JP2010160521A

JP2010160521A - Noise canceller, and communication device equipped with the same

Info

Publication number: JP2010160521A
Application number: JP2010089555A
Authority: JP
Inventors: Yuriko Tsukahara; 由利子塚原
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 2010-04-08
Filing date: 2010-04-08
Publication date: 2010-07-22

Abstract

<P>PROBLEM TO BE SOLVED: To reduce generation of voice distortion in noise suppressing by performing stable noise suppression processing by avoiding influence of change, even when noise energy is changed. <P>SOLUTION: In a noise minimum value estimation circuit 29, a minimum value of acoustic noise power of each band is calculated, and a spectrum feature of the noise minimum value is used for determination of a band classified gain by a band classified gain determination section 33. <P>COPYRIGHT: (C)2010,JPO&INPIT

Description

この発明は、ディジタル自動車・携帯電話装置やディジタルコードレス電話機、ディジタル有線電話装置等のように音声信号を符号化して伝送する通信装置に設けられるノイズキャンセラおよびこのノイズキャンセラを備えた通信装置に関する。 The present invention relates to a noise canceller provided in a communication device that encodes and transmits an audio signal, such as a digital automobile / mobile phone device, a digital cordless phone, a digital wired phone device, and the like, and a communication device including the noise canceller.

ディジタル携帯電話装置では、一般にＣＥＬＰ（Code Excited Linear Prediction）方式等の低ビットレートの音声符号化方式が使用されている。この種の符号化方式を使用すると、背景雑音が比較的大きい環境下でも良好な音声通話を行なうことが可能である。なお、ＣＥＬＰ方式の詳細については、M.R.Schroeder 氏とB.S.Atal氏の“Code-Excited Linear Prediction(ＣＥＬＰ):High-Quality Speech At Very Low Bit Rates”in Proc.ICASSP,1985.pp.937-939に述べられている。 In a digital cellular phone device, a low bit rate speech encoding method such as a CELP (Code Excited Linear Prediction) method is generally used. By using this type of coding system, it is possible to perform a good voice call even in an environment where the background noise is relatively large. For details of the CELP method, see MR Schroeder and BSAtal in “Code-Excited Linear Prediction (CELP): High-Quality Speech At Very Low Bit Rates” in Proc.ICASSP, 1985.pp.937-939. It is stated.

しかし、鉄道のホームや幹線道路等のような高雑音環境下では、背景雑音は音声の明瞭感を著しく低下させる。このため、雑音を除去して音声のみを符号化に供するノイズキャンセラの研究が種々なされている。その一例として、米国で規格化された可変レート音声符号化方式においてオプションとして規定されたノイズキャンセラがある（例えば、非特許文献１を参照。）。 However, in a noisy environment such as a railway platform or a main road, background noise significantly reduces voice clarity. For this reason, various studies have been made on noise cancellers that remove noise and use only speech for encoding. As an example, there is a noise canceller defined as an option in the variable rate speech coding system standardized in the United States (see Non-Patent Document 1, for example).

図９は、この非特許文献１に記載された規格において規定されたノイズキャンセラの機能ブロック図である。同図において、５１は高速フーリエ変換部（ＦＦＴ:fast Fourier transform ）であり、このＦＦＴ５１にはフレーム化された送話信号が入力される。なお、フレーム化送話信号は、Ａ／Ｄ変換された送話信号を例えば８０サンプルのフレームに分割したのち、オーバラップ分を含め１２８サンプルに整えることにより生成される。ＦＦＴ５１は、上記フレーム化送話信号に対し高速フーリエ変換処理を行い、これにより例えば１６帯域に周波数分割された変換係数が得られる。 FIG. 9 is a functional block diagram of a noise canceller defined in the standard described in Non-Patent Document 1. In the figure, 51 is a fast Fourier transform (FFT), and a framed transmission signal is input to the FFT 51. Note that the framed transmission signal is generated by dividing the A / D converted transmission signal into, for example, 80-sample frames and then adjusting it to 128 samples including the overlap. The FFT 51 performs a fast Fourier transform process on the framed transmission signal, thereby obtaining, for example, transform coefficients that are frequency-divided into 16 bands.

ＦＦＴ５１により得られた１６帯域の変換係数は、帯域エネルギ推定部５２に入力される。帯域エネルギ推定部５２は、上記１６帯域の変換係数のエネルギを計算して、この帯域別エネルギの算出値を帯域ＳＮＲ推定部５３及び雑音推定部５４に入力する。雑音推定部５４は、上記帯域別エネルギの推定値をもとに雑音部分の帯域別エネルギを推定し、この推定した帯域別雑音エネルギを帯域ＳＮＲ推定部５３に与える。帯域ＳＮＲ推定部５３は、上記帯域別エネルギの算出値と上記帯域別雑音エネルギの推定値とをもとに、帯域別の信号エネルギと雑音エネルギとの対数値（ＳＮＲ）を計算する。そして、この帯域別ＳＮＲをボイスメトリック計算部５５に与える。ボイスメトリック計算部５５は、上記帯域別ＳＮＲにその大きさに応じた重み係数を掛けたのちその総和を求めるもので、その算出結果がボイスメトリック（Voice metric）である。 The 16-band conversion coefficient obtained by the FFT 51 is input to the band energy estimation unit 52. The band energy estimation unit 52 calculates the energy of the conversion coefficients of the 16 bands and inputs the calculated values of the band-specific energy to the band SNR estimation unit 53 and the noise estimation unit 54. The noise estimation unit 54 estimates the band-specific energy of the noise portion based on the estimated value of the band-specific energy, and supplies the estimated band-specific noise energy to the band SNR estimation unit 53. The band SNR estimator 53 calculates a logarithmic value (SNR) of signal energy and noise energy for each band based on the calculated value of the band-specific energy and the estimated value of the noise energy for each band. Then, this band-specific SNR is given to the voice metric calculation unit 55. The voice metric calculation unit 55 multiplies the band-specific SNR by a weighting coefficient corresponding to the size and then obtains the sum, and the calculation result is a voice metric.

帯域ＳＮＲ修正部５６は、上記ボイスメトリックがしきい値より小さい場合には音声が含まれないと判断して帯域ＳＮＲを小さい値に修正し、帯域ゲイン計算部５７に与える。帯域ゲイン計算部５７は、上記修正されたＳＮＲをもとに各帯域の雑音抑圧量を決定し、乗算部６１に与える。この結果乗算部６１では、前記ＦＦＴ５１から出力された１６帯域の変換係数と上記帯域別の雑音抑圧量とが帯域別に乗算され、これにより帯域別に雑音抑圧された信号が出力される。そして、この帯域別に雑音抑圧された変換係数は、逆高速フーリエ変換部（ＩＦＦＴ）５８において逆高速フーリエ変換され、これにより時間領域の信号に戻されて出力される。 The band SNR correction unit 56 determines that the voice is not included when the voice metric is smaller than the threshold value, corrects the band SNR to a small value, and provides the band gain calculation unit 57 with the band SNR. The band gain calculation unit 57 determines the noise suppression amount of each band based on the modified SNR, and gives it to the multiplication unit 61. As a result, the multiplication unit 61 multiplies the 16-band conversion coefficient output from the FFT 51 and the noise suppression amount for each band for each band, and outputs a signal with noise suppression for each band. Then, the transform coefficient subjected to noise suppression for each band is subjected to inverse fast Fourier transform in an inverse fast Fourier transform unit (IFFT) 58, thereby returning to a time domain signal and outputting it.

なお、上記雑音エネルギの更新は、スペクトル偏差推定部５９及び雑音更新判定部６０において更新の可否が判定された上で行われる。
すなわち、上記ノイズキセャンセラでは、帯域別ノイズ抑圧量を送話信号のＳＮＲ、つまり信号のエネルギと雑音エネルギとの比をもとに決定している。また、音声が含まれるかどうかを判断するために、各帯域別ＳＮＲを重み付け加算して求めた値であるボイスメトリックを用いている。 The noise energy is updated after the spectrum deviation estimation unit 59 and the noise update determination unit 60 determine whether or not the update is possible.
That is, in the noise canceller, the noise suppression amount for each band is determined based on the SNR of the transmission signal, that is, the ratio between the signal energy and the noise energy. Further, in order to determine whether or not speech is included, a voice metric which is a value obtained by weighted addition of each band SNR is used.

”Enhanced Variable Rate Codec, Speech Service Option 3 for Wideband Spread Spectrum Digital Systems”（TIA IS127）“Enhanced Variable Rate Codec, Speech Service Option 3 for Wideband Spread Spectrum Digital Systems” (TIA IS127)

ところで、雑音（Background Noise）は一般に定常と仮定されるが、屋外では変動する場合がある。特に、自動車が通り過ぎるときに発生する雑音のエネルギは自動車の接近とともに大きくなる。この状態で送話音声が入力されると、音声と雑音とのエネルギ差が小さいため、抑圧後の音声を歪ませることがある。また、雑音のスペクトル形状と音声のスペクトル形状が似ている場合も、雑音エネルギをもとに抑圧を行うと音声のスペクトルに干渉しやすくなるため、抑圧後の音声に歪みが発生する。 By the way, although noise (Background Noise) is generally assumed to be steady, it may fluctuate outdoors. In particular, the energy of noise generated when a vehicle passes by increases as the vehicle approaches. When the transmitted voice is input in this state, the energy difference between the voice and noise is small, and thus the suppressed voice may be distorted. Further, even when the noise spectrum shape is similar to the speech spectrum shape, if suppression is performed based on noise energy, interference with the speech spectrum is likely to occur, and thus the suppressed speech is distorted.

音声の検出に用いるボイスメトリックは、ＳＮＲに重み係数を掛けてその総和をとったものであるため、突発的な雑音が混入した場合には音声と見なされやすい。 Since the voice metric used for voice detection is obtained by multiplying the SNR by a weighting coefficient and taking the sum, it is likely to be regarded as voice when sudden noise is mixed.

この発明は上記事情に着目してなされたもので、その目的は、雑音エネルギが変動した場合でもその影響を排除して安定な雑音抑圧処理を行えるようにし、これにより雑音抑圧における音声歪みの発生を低減したノイズキャンセラとこのノイズキャンセラを備えた通信装置を提供することである。 The present invention has been made paying attention to the above circumstances, and its purpose is to eliminate the influence of noise energy fluctuations and to perform stable noise suppression processing, thereby generating voice distortion in noise suppression. The present invention is to provide a noise canceller with reduced noise and a communication apparatus including the noise canceller.

上記目的を達成するためにこの発明の一つの観点は、入力信号を一定時間長のフレームに分割するとともに、これらのフレームの信号をそれぞれ複数の周波数帯域に分け、これらの周波数帯域ごとに雑音の抑圧処理を行うノイズキャンセラにおいて、上記周波数帯域ごとに信号のパワーを求め、この帯域パワーをもとに帯域別の雑音パワーを推定する雑音パワー推定手段と、上記帯域パワーおよび帯域別雑音パワーのうちの少なくとも一方について、複数のフレーム期間にわたりパワーの最小値を検出する最小値検出手段と、上記周波数帯域ごとにその帯域パワーと上記最小値検出手段により検出された帯域別最小値との差を求め、この差をもとに周波数帯域別の雑音抑圧量を決定する抑圧量決定手段とを具備したものである。 In order to achieve the above object, one aspect of the present invention is to divide an input signal into frames having a fixed time length, and divide the signals of these frames into a plurality of frequency bands, respectively. In a noise canceller that performs suppression processing, the power of a signal is obtained for each frequency band, noise power estimation means for estimating noise power for each band based on the band power, and the band power and the noise power for each band For at least one, a minimum value detecting means for detecting a minimum value of power over a plurality of frame periods, and obtaining a difference between the band power for each frequency band and the minimum value for each band detected by the minimum value detecting means, And a suppression amount determining means for determining a noise suppression amount for each frequency band based on this difference.

具体的には、上記抑圧量決定手段は、音声区間と雑音区間のうちの少なくとも音声区間において、上記帯域パワーと上記帯域別最小値との差に基づく帯域別の雑音抑圧量決定処理を行う。 Specifically, the suppression amount determination means performs a noise suppression amount determination process for each band based on a difference between the band power and the minimum value for each band in at least the speech section of the speech section and the noise section.

したがってこの発明の一観点によれば、周波数帯域ごとにその帯域パワー又は帯域別雑音パワーの最小値が検出され、帯域パワーとこの最小値との差をもとに周波数帯域別の雑音抑圧量が決定される。このため、通話中に例えば自動車が通過して雑音エネルギが一時的に増加しても、雑音抑圧量はあくまでも帯域パワーと上記最小値との差をもとに決定されるため、抑圧量は安定に保持され、この結果常に安定な雑音抑圧処理が可能となる。 Therefore, according to one aspect of the present invention, the minimum value of the band power or the noise power for each band is detected for each frequency band, and the noise suppression amount for each frequency band is determined based on the difference between the band power and the minimum value. It is determined. For this reason, even if, for example, a vehicle passes during a call and the noise energy temporarily increases, the noise suppression amount is determined based on the difference between the band power and the minimum value, so the suppression amount is stable. As a result, stable noise suppression processing is always possible.

また、雑音のスペクトル形状と音声のスペクトル形状が似ている場合でも、帯域パワーと上記最小値との差をもとに抑圧処理が行われるため、雑音エネルギの影響を受け難くなり、この結果音声のスペクトルに対する干渉は減少されて音声歪みの発生は防止される。 In addition, even if the noise spectrum shape is similar to the speech spectrum shape, suppression processing is performed based on the difference between the band power and the above minimum value, making it less susceptible to noise energy. Interference with the spectrum of is reduced and the occurrence of speech distortion is prevented.

またこの発明は、上記抑圧量決定手段において、フレームごとに異なる帯域共通の調整値を生成し、上記周波数帯域ごとに、その帯域パワーと、最小値検出手段により検出された帯域別最小値に上記帯域共通の調整値を加えた値との差を求め、この差をもとに周波数帯域別の雑音抑圧量を決定することも特徴とする。 Further, according to the present invention, the suppression amount determining means generates a common adjustment value for each band different for each frame, and for each frequency band, the band power and the minimum value for each band detected by the minimum value detecting means are It is also characterized in that a difference from a value obtained by adding an adjustment value common to the bands is obtained, and a noise suppression amount for each frequency band is determined based on the difference.

具体的には、雑音区間においては前記帯域別最小値間の平均値と前記帯域別雑音パワー間の平均値との差に基づいて帯域共通の調整値を決定し、一方音声区間においては１フレームにおける複数の帯域パワーの中の最小値と複数の帯域別最小値の中の最大値との差に基づいて帯域共通の調整値を決定する。
このようにすることで、音声区間及び雑音区間の各々についてより一層適切な雑音抑圧量を決定することが可能となる。 Specifically, in the noise period, a common adjustment value is determined based on the difference between the average value between the minimum values for each band and the average value between the noise powers for each band. An adjustment value common to the bands is determined based on the difference between the minimum value among the plurality of band powers and the maximum value among the minimum values according to the bands.
By doing so, it becomes possible to determine a more appropriate noise suppression amount for each of the speech section and the noise section.

すなわちこの発明によれば、雑音エネルギが変動した場合でもその影響を排除して安定な雑音抑圧処理を行うことができ、これにより雑音抑圧における音声歪みの発生を低減したノイズキャンセラとこのノイズキャンセラを備えた通信装置を提供することができる。 That is, according to the present invention, even when the noise energy fluctuates, it is possible to eliminate the influence and perform stable noise suppression processing, thereby including a noise canceller that reduces the occurrence of voice distortion in noise suppression and the noise canceller. A communication device can be provided.

この発明に係わるノイズキャンセラを備えたディジタル携帯電話装置の一実施形態を示す回路ブロック図。1 is a circuit block diagram showing an embodiment of a digital cellular phone device provided with a noise canceller according to the present invention. この発明に係わるノイズキャンセラの一実施形態を示す機能ブロック図。The functional block diagram which shows one Embodiment of the noise canceller concerning this invention. 図２に示したノイズキャンセラの有意値計算部における処理手順とその内容を示すフローチャート。The flowchart which shows the processing procedure and the content in the significant value calculation part of the noise canceller shown in FIG. 図２に示したノイズキャンセラの音声重み計算部における処理手順とその内容を示すフローチャート。The flowchart which shows the process sequence and the content in the audio | voice weight calculation part of the noise canceller shown in FIG. 図２に示したノイズキャンセラの雑音最小値推定部における処理手順とその内容の前半部分を示すフローチャート。The flowchart which shows the processing procedure in the noise minimum value estimation part of the noise canceller shown in FIG. 2, and the first half part of the content. 図２に示したノイズキャンセラの雑音最小値推定部における処理手順とその内容の後半部分を示すフローチャート。The flowchart which shows the latter half part of the process sequence in the noise minimum value estimation part of the noise canceller shown in FIG. 雑音区間における帯域パワーと雑音最小値及び調整後の雑音最小値との関係を示す図。The figure which shows the relationship between the band power in a noise area, a noise minimum value, and the noise minimum value after adjustment. 音声区間における帯域パワーと雑音最小値及び調整後の雑音最小値との関係を示す図。The figure which shows the relationship between the band power in a speech area, the noise minimum value, and the noise minimum value after adjustment. 従来のノイズキャンセラの構成の一例を示す図。The figure which shows an example of a structure of the conventional noise canceller.

以下、図面を参照してこの発明に係わる実施形態を説明する。
図１は、この発明に係わるノイズキャンセラを備えたディジタル携帯電話装置の一実施形態を示す回路ブロック図である。
同図において、図示しない基地局から無線チャネルを介して送られた無線搬送波信号は、アンテナ１で受信されたのちアンテナ共用器（ＤＵＰ）２を介して受信回路（ＲＸ）３に入力され、ここで周波数シンセサイザ（ＳＹＮ）４から出力された受信局部発振信号とミキシングされて中間周波信号に周波数変換される。そしてこの受信中間周波信号は、図示しないＡ／Ｄ変換器においてサンプリングされたのちディジタル復調器（ＤＥＭ）６に入力される。 Embodiments according to the present invention will be described below with reference to the drawings.
FIG. 1 is a circuit block diagram showing an embodiment of a digital cellular phone device provided with a noise canceller according to the present invention.
In the figure, a radio carrier signal transmitted from a base station (not shown) via a radio channel is received by an antenna 1 and then input to a receiving circuit (RX) 3 via an antenna duplexer (DUP) 2. Thus, the received local oscillation signal output from the frequency synthesizer (SYN) 4 is mixed and frequency-converted to an intermediate frequency signal. The received intermediate frequency signal is sampled by an A / D converter (not shown) and then input to a digital demodulator (DEM) 6.

ディジタル復調器６は、上記ディジタル受信中間周波信号に対するフレーム同期およびビット同期を確立したうえで、ディジタル復調処理を行なう。この復調処理により得られたベースバンドのディジタル復調信号は、時分割多元接続回路（ＴＤＭＡ）８に入力され、ここで各伝送フレームごとに自己宛てのタイムスロットが分離抽出される。なお、上記ディジタル復調器６において得られたフレーム同期及びビット同期に関する情報は制御回路１８に入力される。 The digital demodulator 6 performs digital demodulation processing after establishing frame synchronization and bit synchronization for the digital reception intermediate frequency signal. The baseband digital demodulated signal obtained by this demodulation processing is input to a time division multiple access circuit (TDMA) 8, where the time slot addressed to itself is separated and extracted for each transmission frame. Information regarding frame synchronization and bit synchronization obtained in the digital demodulator 6 is input to the control circuit 18.

上記ＴＤＭＡ回路８から出力されたディジタル復調信号は、続いて誤り訂正符号復号回路（ＣＨ−ＣＯＤ）９に入力され、ここで誤り訂正復号処理される。そして、この誤り訂正復号されたディジタル復調信号は、音声復号回路（ＤＥＣ）１０に入力されて音声復号処理され、これによりディジタル受話信号が再生される。このディジタル受話信号は、Ｄ／Ａ変換器１１でアナログ受話信号に戻されたのち図示しない音声増幅器を介してスピーカ１２に供給され、このスピーカ１２から拡声出力される。 The digital demodulated signal output from the TDMA circuit 8 is subsequently input to an error correction code decoding circuit (CH-COD) 9 where error correction decoding processing is performed. Then, this error-corrected decoded digital demodulated signal is input to a voice decoding circuit (DEC) 10 and subjected to voice decoding processing, whereby a digital received signal is reproduced. The digital reception signal is converted back to an analog reception signal by the D / A converter 11 and then supplied to the speaker 12 via a voice amplifier (not shown), and is output from the speaker 12 as a loud voice.

一方、話者の送話音声は、マイクロホン１３で集音されて電気信号に変換されたのちＡ／Ｄ変換器１４に入力され、このＡ／Ｄ変換器１４で所定のサンプリング周期でサンプリングされてディジタル送話信号に変換される。このディジタル送話信号は、後述するノイズキャンセラ１７に通されたのち音声符号化回路（ＣＯＤ）１６に入力されて音声符号化される。 On the other hand, the transmitted voice of the speaker is collected by the microphone 13 and converted into an electrical signal, which is then input to the A / D converter 14 and sampled by the A / D converter 14 at a predetermined sampling period. It is converted into a digital transmission signal. This digital transmission signal is passed through a noise canceller 17 which will be described later, and then input to a voice encoding circuit (COD) 16 for voice encoding.

この音声符号化回路１６から出力された符号化音声データは、制御回路１８から出力された制御信号とともに誤り訂正符号復号回路（ＣＨ−ＣＯＤ）９に入力され、ここで誤り訂正符号化される。そして、この誤り訂正符号化されたディジタル送信信号はＴＤＭＡ回路８に入力される。このＴＤＭＡ回路８では、時分割多元接続（ＴＤＭＡ）方式に対応した伝送フレームが生成され、この伝送フレーム中の自装置に割り当てられたタイムスロットに上記ディジタル送信信号を挿入するための処理が行なわれる。このＴＤＭＡ回路８から出力されたディジタル送信信号は、続いてディジタル変調器（ＭＯＤ）７に入力される。 The encoded speech data output from the speech encoding circuit 16 is input to the error correction code decoding circuit (CH-COD) 9 together with the control signal output from the control circuit 18, and is subjected to error correction encoding. The error-encoded digital transmission signal is input to the TDMA circuit 8. In this TDMA circuit 8, a transmission frame corresponding to a time division multiple access (TDMA) system is generated, and processing for inserting the digital transmission signal into a time slot assigned to the own apparatus in this transmission frame is performed. . The digital transmission signal output from the TDMA circuit 8 is subsequently input to the digital modulator (MOD) 7.

ディジタル変調器７では、上記ディジタル送信信号によりディジタル変調された送信中間周波信号が発生され、この送信中間周波信号は図示しないＤ／Ａ変換器によりアナログ信号に変換されたのち送信回路（ＴＸ）５に入力される。なお、ディジタル変調方式としては、例えばπ／４シフトＱＰＳＫ（π／４ shifted quadrature phase shift keying）方式が使用される。 The digital modulator 7 generates a transmission intermediate frequency signal digitally modulated by the digital transmission signal. The transmission intermediate frequency signal is converted into an analog signal by a D / A converter (not shown), and then transmitted to a transmission circuit (TX) 5. Is input. As a digital modulation method, for example, a π / 4 shift QPSK (π / 4 shifted quadrature phase shift keying) method is used.

送信回路５では、上記変調された送信中間周波信号がまず周波数シンセサイザ４から出力された送信局部発振信号とミキシングされ、これにより通話チャネルに対応する無線搬送波周波数に変換される。そして、この無線変調波信号は、送信電力増幅器において制御回路１８から制御信号ＴＣＳにより指示された所定の送信電力レベルに制御されたのち、アンテナ共用器２を介してアンテナ１から図示しない基地局へ向けて送信される。 In the transmission circuit 5, the modulated transmission intermediate frequency signal is first mixed with the transmission local oscillation signal output from the frequency synthesizer 4, and thereby converted to a radio carrier frequency corresponding to the speech channel. This radio modulated wave signal is controlled to a predetermined transmission power level instructed by the control signal TCS from the control circuit 18 in the transmission power amplifier, and then from the antenna 1 to the base station (not shown) via the antenna duplexer 2. Sent to.

なお、１９は操作パネル部であり、この操作パネル部１９には発信キー、終了キー、ダイヤルキー、および各種機能キーを有するキー入力部と、液晶表示器（ＬＣＤ）や発光ダイオード（ＬＥＤ）を有する表示部とが配設されている。
ところで、ノイズキャンセラ１７は例えばＤＳＰ（Digital Signal Processor）により実現されるもので、その処理プログラムはノイズキャンセラ内のメモリ又は制御回路１８に付属するメモリに格納されている。図２はこの処理プログラムにより実現される機能の構成を示すブロック図である。 Reference numeral 19 denotes an operation panel unit. The operation panel unit 19 includes a key input unit having a call key, an end key, a dial key, and various function keys, a liquid crystal display (LCD), and a light emitting diode (LED). And a display unit having the same.
Incidentally, the noise canceller 17 is realized by, for example, a DSP (Digital Signal Processor), and the processing program is stored in a memory in the noise canceller or a memory attached to the control circuit 18. FIG. 2 is a block diagram showing a configuration of functions realized by this processing program.

Ａ／Ｄ変換器１４から出力されたディジタル送話信号は、先ずフレーム分割部２１に入力される。フレーム分割部２１は、上記ディジタル送話信号を例えば８０サンプルのフレームに分割したのち、ウィンドウがけを行うことによりフレーム端をオーバラップさせ、これによりオーバラップ分を含め１２８サンプルに整えられたフレームを出力する。そして、このディジタル送話信号フレームを高速フーリエ変換部（ＦＦＴ）２２に入力する。 The digital transmission signal output from the A / D converter 14 is first input to the frame dividing unit 21. The frame dividing unit 21 divides the digital transmission signal into, for example, 80-sample frames, and then overlaps the frame ends by performing windowing. As a result, the frames adjusted to 128 samples including the overlaps are arranged. Output. The digital transmission signal frame is input to the fast Fourier transform unit (FFT) 22.

ＦＦＴ２２は、入力されたディジタル送話信号フレームに対し高速フーリエ変換処理を行い、これにより例えば低域から高域まで順に１６帯域に周波数分割された変換係数を得る。なお、各帯域ごとの変換係数の数は同一でなくてもよい。そして、この帯域分割された変換係数を、雑音抑圧処理用の乗算器２３に入力すると共に帯域パワー計算部２６に入力する。 The FFT 22 performs a fast Fourier transform process on the input digital transmission signal frame, thereby obtaining, for example, transform coefficients that are frequency-divided into 16 bands from low to high. Note that the number of conversion coefficients for each band may not be the same. The band-divided conversion coefficient is input to the noise suppression multiplier 23 and also to the band power calculation unit 26.

帯域パワー計算部２６は、各帯域ごとにエネルギ（変換変数の二乗平均値）を求めて対数をとり、帯域パワーchannel_power(m,k)を出力する。ただし、ｍはフレーム番号、ｋは帯域番号（k=0,…,15）である。そして、この帯域パワーを、後述する雑音リーク積分値更新部３２及び帯域別ゲイン決定部３３にそれぞれ入力すると共に、有意値計算部２７に入力する。 The band power calculation unit 26 obtains energy (root mean square value of the conversion variable) for each band, takes a logarithm, and outputs band power channel_power (m, k). Here, m is a frame number, and k is a band number (k = 0,..., 15). The band power is input to a noise leak integrated value update unit 32 and a band-specific gain determination unit 33, which will be described later, and also input to the significant value calculation unit 27.

有意値計算部２７は、各帯域ごとに、後述する雑音リーク積分値更新部３２から出力される雑音リーク積分値noise_power(m,k)と、上記帯域パワーchannel_power(m,k)との差tmp を求め、これらの帯域別の差tmp を所定のしきい値と比較する。そして、周波数順に配列された上記各帯域別の差tmp のうち隣接する複数の帯域の帯域別差tmp がしきい値を超えると判定された場合に、これらの帯域別差tmp に所定の重み付けを行った上で相互に加算する。そして、この重み付け後の値suby(m,k)と、このsubyの条件付総和を有意値ｙとして出力する。 For each band, the significant value calculator 27 calculates a difference tmp between a noise leak integrated value noise_power (m, k) output from a noise leak integrated value updater 32 described later and the band power channel_power (m, k). The difference tmp for each band is compared with a predetermined threshold value. When it is determined that the band-specific differences tmp of adjacent bands among the band-specific differences tmp arranged in order of frequency exceed the threshold value, the band-specific differences tmp are given a predetermined weight. And then add to each other. Then, the weighted value suby (m, k) and the conditional sum of the suby are output as a significant value y.

図３は、この有意値計算部２７の処理手順とその内容を示すフローチャートである。
同図において、有意値計算部２７は、先ずステップ３ａでフレーム番号ｍを０にリセットしたのち、ステップ３ｂでグループ番号ｍをインクリメントすると共に、有意値y 、帯域番号k 、及びしきい値を越える帯域別差tmp の連続数flagをそれぞれ０に初期設定する。 FIG. 3 is a flowchart showing the processing procedure and contents of the significant value calculation unit 27.
In the figure, the significant value calculator 27 first resets the frame number m to 0 in step 3a, then increments the group number m in step 3b, and exceeds the significant value y, the band number k, and the threshold value. The continuous number flag of the band-specific difference tmp is initialized to 0 respectively.

次に有意値計算部２７は、ステップ３ｃで帯域k=0 について、その帯域パワーと雑音リーク積分値との差tmp と、この帯域別差tmp に対し重み付けを行った値suby(m,k)とをそれぞれ以下のように計算する。
tmp ＝channel_power(m,k)−noise_power(m,k)
suby(m,k)＝{200-(k-1)2 }／100*(tmp-1)
ただし、{200-(k-1)2 }は重み係数である。 Next, the significant value calculation unit 27 weights the difference tmp between the band power and the noise leak integrated value for the band k = 0 in step 3c, and the value suby (m, k) obtained by weighting the band-specific difference tmp. And are calculated as follows.
tmp = channel_power (m, k) −noise_power (m, k)
suby (m, k) = {200- (k-1) 2} / 100 * (tmp-1)
However, {200- (k-1) 2} is a weighting coefficient.

そして、帯域k=0 における上記帯域別差tmp が算出されると、有意値計算部２７はステップ３ｄでこの帯域別差tmp をしきい値「１」と比較し、帯域別差tmp がしきい値「１」を越えていると、音声である可能性があると判断してステップ３ｅ及びステップ３ｇを経てステップ３ｉに移行し、ここで連続数flag＝１に設定する。そして、ステップ３ｋで帯域番号k をインクリメントしてk=1 としたのち、ステップ３ｃに戻って今度は帯域k=1 について同様の処理を実行する。 When the band-specific difference tmp in the band k = 0 is calculated, the significant value calculating unit 27 compares the band-specific difference tmp with the threshold “1” in step 3d, and the band-specific difference tmp is the threshold. If the value exceeds “1”, it is determined that there is a possibility of voice, and the process proceeds to step 3i through steps 3e and 3g, where the continuous number flag = 1 is set. Then, after incrementing the band number k in step 3k to set k = 1, the process returns to step 3c and the same processing is executed for the band k = 1.

さて、この帯域k=1 においても、前記帯域k=0 に続き帯域別差tmp がしきい値「１」を越えたとする。そうすると有意値計算部２７は、既にflag＝１であるためステップ３ｅからステップ３ｆに移行して、ここで
y ＝y ＋suby(m,k-1)
なる演算を行う。すなわち、前記帯域k=0 におけるsuby(m,k-1)を有意値y とする。そして、連続数flag＝２としたのち、ステップ３ｇを経てステップ３ｈに移行し、ここで
y ＝y ＋suby(m,k)
なる演算を行い、これにより前記帯域k=0 におけるsuby(m,k-1)に、今回の帯域k=1 で求めたsuby(m,k)を加算する。そして、ステップ３ｋで帯域番号k をさらにインクリメントしてk=2 としたのち、ステップ３ｃに戻って帯域k=2 についての処理を実行する。 Now, also in this band k = 1, it is assumed that the band-specific difference tmp exceeds the threshold value “1” following the band k = 0. Then, since significant value calculation unit 27 has already flag = 1, the process proceeds from step 3e to step 3f, where
y = y + suby (m, k-1)
Perform the following operation. That is, let suby (m, k-1) in the band k = 0 be a significant value y. Then, after setting the continuous number flag = 2, the process proceeds to step 3h through step 3g.
y = y + suby (m, k)
Thus, the suby (m, k) obtained in the current band k = 1 is added to the suby (m, k-1) in the band k = 0. In step 3k, the band number k is further incremented to become k = 2, and then the process returns to step 3c to execute processing for the band k = 2.

以後同様に、隣接する帯域k=2 ，k=3，k=4 ，…の帯域別差tmp がしきい値「１」を越えるごとに、その帯域のsuby(m,k)が一つ前の帯域までに得られた有意値y に順次加算され、これにより帯域別差tmp の重み付け加算値y が求められる。 Thereafter, similarly, every time the band-specific difference tmp of the adjacent bands k = 2, k = 3, k = 4,... Exceeds the threshold value “1”, the suby (m, k) of that band is decreased by one. Are sequentially added to the significant value y obtained up to the band of, thereby obtaining the weighted addition value y of the band-specific difference tmp.

なお、いずれかの帯域k=i において、帯域別差tmp がしきい値「１」以下になると、有意値計算部２７はステップ３ｄからステップ３ｊに移行してここでflagを０にリセットする。
そうして、１フレームを構成する１６個すべての帯域k=0〜k=15についての処理が終了すると、有意値計算部２７はステップ３ｍからステップ３ｎに移行して、ここで上記有意値y と、各帯域ごとに算出した重み付け後の帯域別差suby(m,k)(k=0,1,…,15)をそれぞれ出力する。 Note that, in any band k = i, when the band-specific difference tmp is equal to or less than the threshold value “1”, the significant value calculation unit 27 proceeds from step 3d to step 3j and resets flag to 0 here.
Then, when the processing for all 16 bands k = 0 to k = 15 constituting one frame is completed, the significant value calculation unit 27 proceeds from step 3m to step 3n, where the significant value y And the weighted subband difference suby (m, k) (k = 0, 1,..., 15) calculated for each band are output.

かくして、各フレームごとに、しきい値が「１」を越える連続する複数の帯域の帯域別差tmp の重み付け加算値y が求められ、この重み付け加算値y が後述する音声重み計算部２８における音声重みの計算、つまりそのフレームが音声フレームであるか雑音フレームであるか、さらにはその中間の過渡域フレームであるかの判定に供される。すなわち、ただ１つの帯域のみで帯域別差tmp がしきい値「１」を越えた場合には、これは雑音と見なされて排除され、上記音声重み計算部２８における音声／雑音／過渡域の判定には供されない。 Thus, for each frame, the weighted addition value y of the band-specific differences tmp of a plurality of continuous bands whose threshold value exceeds “1” is obtained, and this weighted addition value y is used for the voice weight calculation unit 28 described later. This is used for calculating the weight, that is, determining whether the frame is a speech frame, a noise frame, or an intermediate transition frame. That is, when the band-specific difference tmp exceeds the threshold value “1” in only one band, this is regarded as noise and eliminated, and the voice / noise / transient area in the voice weight calculation unit 28 is excluded. It is not used for judgment.

上記有意値計算部２７から重み付け加算値y が供給されると、音声重み計算部２８では雑音抑圧ゲインの決定に用いる音声重みspの計算が行われる。音声重みspは、１フレーム中に音声が含まれる度合いを０≦sp≦６の範囲で表す数値であり、前記重み付け加算値y から計算される。なお、sp＝０は雑音区間、sp＝６は音声区間を表す。 When the weighted addition value y is supplied from the significant value calculation unit 27, the audio weight calculation unit 28 calculates the audio weight sp used for determining the noise suppression gain. The sound weight sp is a numerical value representing the degree of sound included in one frame in a range of 0 ≦ sp ≦ 6, and is calculated from the weighted addition value y. Note that sp = 0 represents a noise section, and sp = 6 represents a voice section.

図４は、この音声重み計算部２８における音声重みspの計算手順とその処理内容を示すフローチャートである。
音声重み計算部２８は、先ずステップ４ａでフレーム番号ｍを０にリセットしたのち、ステップ４ｂでグループ番号ｍをインクリメントする。次に、ステップ４ｃで上記重み付け加算値y を任意のしきい値「１３」と比較し、y ＜１３であれば雑音フレームと判断してステップ４ｄに移行し、ここで音声重みspを
sp(m)＝sp(m-1)−0.5
に設定する。一方、y ≧１３だった場合には、ステップ４ｅに移行してここで
z ＝(y−13)*1.5+1
を計算する。すなわち、y をもとに音声重みz を１〜６の範囲で仮に設定する。 FIG. 4 is a flowchart showing the procedure for calculating the voice weight sp in the voice weight calculator 28 and the processing content thereof.
The voice weight calculator 28 first resets the frame number m to 0 in step 4a, and then increments the group number m in step 4b. Next, in step 4c, the weighted addition value y is compared with an arbitrary threshold value “13”. If y <13, it is determined as a noise frame, and the process proceeds to step 4d.
sp (m) = sp (m-1) −0.5
Set to. On the other hand, if y ≧ 13, the process proceeds to step 4e where
z = (y−13) * 1.5 + 1
Calculate That is, the voice weight z is temporarily set in the range of 1 to 6 based on y.

続いて音声重み計算部２８は、ステップ４ｆにおいてsp(m-1)≦0.5を判定する。すなわち、１フレーム前の音声重みsp(m-1)が雑音フレームだったかどうかを判定する。そして、雑音フレームだった場合にはステップ４ｇに移行し、ここで現フレームの音声重みsp(m) を上記z に設定する。これに対し、１フレーム前の音声重みsp(m-1)が雑音フレームではなかった場合には、ステップ４ｈに移行してここでz ＞sp(m-1)＋0.5を判定し、z ＞sp(m-1)＋0.5であればステップ４ｉで現フレームの音声重みsp(m) をsp(m-1)＋0.5に設定する。一方、z ＞sp(m-1)＋0.5でなければ、ステップ４ｊに移行してここでz ＞sp(m-1)−0.5を判定し、z ＞sp(m-1)−0.5であればステップ４ｋで現フレームの音声重みsp(m) をsp(m-1)−0.5に設定する。また、z ＞sp(m-1)−0.5でもない場合には、ステップ４ｍに移行してここで現フレームの音声重みをsp(m) ＝MIN(sp(m),6)又はsp(m) ＝MAX(sp(m),0)に設定する。 Subsequently, the voice weight calculation unit 28 determines sp (m−1) ≦ 0.5 in Step 4f. That is, it is determined whether or not the voice weight sp (m−1) one frame before is a noise frame. If it is a noise frame, the process proceeds to step 4g, where the speech weight sp (m) of the current frame is set to z. On the other hand, if the speech weight sp (m−1) one frame before is not a noise frame, the process proceeds to step 4h where z> sp (m−1) +0.5 is determined, and z If> sp (m-1) +0.5, the voice weight sp (m) of the current frame is set to sp (m-1) +0.5 in step 4i. On the other hand, if z> sp (m-1) +0.5, the process proceeds to step 4j, where z> sp (m-1) -0.5 is determined, and z> sp (m-1) -0.5. If there is, the voice weight sp (m) of the current frame is set to sp (m-1) -0.5 in step 4k. If z> sp (m-1) −0.5 is not satisfied, the process proceeds to step 4m where the voice weight of the current frame is set to sp (m) = MIN (sp (m), 6) or sp (m ) = Set to MAX (sp (m), 0).

すなわち、上記ステップ４ｆからステップ４ｍにおいて、現フレームで算出した仮の音声重みz が、１つ前のフレームで設定した音声重みsp(m-1) を考慮して補正される。したがって、このように求めた音声重みsp(m) を使用することで、フレーム間の連続性を考慮した音声／雑音／過渡域の判定が可能となる。 That is, in step 4f to step 4m, the temporary audio weight z calculated in the current frame is corrected in consideration of the audio weight sp (m-1) set in the previous frame. Therefore, by using the speech weight sp (m) obtained in this way, it is possible to determine speech / noise / transient range in consideration of continuity between frames.

上記音声重み計算部２８により求められた音声重みsp(m) は、ステップ４ｎにおいて出力され、雑音最小値推定部２９及び帯域別ゲイン決定部３３に入力される。
雑音最小値推定部２９は、上記音声重みがsp＝０となる１００フレームの期間ごとに、各帯域における雑音のリーク積分値noise_power(m,k)の最小値を調べる。そして、この最小値を次の１００フレームの期間において、雑音最小値noise_min(m,k)として使用する。またそれと共に、各帯域の雑音最小値の帯域間平均値min_allを求める。 The speech weight sp (m) obtained by the speech weight calculation unit 28 is output in step 4n and input to the noise minimum value estimation unit 29 and the band-specific gain determination unit 33.
The noise minimum value estimation unit 29 checks the minimum value of the noise leakage integral value noise_power (m, k) in each band for every 100 frames in which the voice weight is sp = 0. Then, this minimum value is used as the minimum noise value noise_min (m, k) in the next 100 frames. At the same time, the average value min_all of the minimum noise values in each band is obtained.

図５及び図６は、この雑音最小値推定部２９において実行される最小値推定処理の手順と内容を示すフローチャートである。
同図において、雑音最小値推定部２９は先ずステップ５ａで、フレーム番号ｍをｍ＝０にリセットすると共に、フレームカウンタの値をｆc ＝９６に、雑音最小値をnoise_min(k)＝３６に、帯域をk ＝0,…,15にそれぞれ初期設定し、さらにnoise_min(k)_h(k)＝MAX(noise_power(m,2k)，noise_power(m,2k+1))，k=0,…,7に、雑音最小値の帯域間平均min_allを
7
min_all＝Σ noise_min(k)_h(n)/8
n=0
にそれぞれ初期設定する。 5 and 6 are flowcharts showing the procedure and contents of the minimum value estimation process executed in the noise minimum value estimation unit 29.
In the figure, the noise minimum value estimation unit 29 first resets the frame number m to m = 0 in step 5a, sets the frame counter value to fc = 96, and sets the noise minimum value to noise_min (k) = 36. Bands are initially set to k = 0,..., 15, respectively, and noise_min (k) _h (k) = MAX (noise_power (m, 2k), noise_power (m, 2k + 1)), k = 0,. 7 shows the average noise minimum value between bands min_all.
7
min_all ＝ Σ noise_min (k) _h (n) / 8
n = 0
Initialize each.

次に雑音最小値推定部２９は、ステップ５ｂでフレーム番号ｍをインクリメントしたのち、ステップ５ｃで上記音声重みがsp＝０であるか否か、つまり雑音フレームであるか否かを判定する。そして、雑音フレームであれば、ステップ５ｂに戻ってフレーム番号ｍをインクリメントし、上記ステップ５ｃによる雑音フレームの判定を行う。すなわち、上記ステップ５ｂ及びステップ５ｃにより音声フレーム又は過渡域フレームの検出が行われる。 Next, the noise minimum value estimation unit 29 increments the frame number m in step 5b, and then determines in step 5c whether or not the speech weight is sp = 0, that is, whether or not it is a noise frame. If it is a noise frame, the process returns to step 5b to increment the frame number m, and the noise frame is determined in step 5c. In other words, the audio frame or the transient frame is detected by the above steps 5b and 5c.

さて、音声フレーム又は過渡域フレームが検出されると、雑音最小値推定部２９はステップ５ｄに移行してここでフレームカウンタｆc をインクリメントすると共に、帯域k=0を選択する。そして、ステップ５ｅで
x ＝MAX(noise_power(m,2k)，noise_power(m,2k+1))
に設定したのち、ステップ５ｆに移行してnoise_min(k)_h(k)＞x であるか否か判定し、noise_min(k)_h(k)＞x であればステップ５ｇに移行してここで雑音最小値をnoise_min(k)_h(k)＝x に設定する。そして、ステップ５ｈに移行する。 When a voice frame or a transient frame is detected, the minimum noise estimation unit 29 proceeds to step 5d, where the frame counter fc is incremented and the band k = 0 is selected. And in step 5e
x = MAX (noise_power (m, 2k), noise_power (m, 2k + 1))
After moving to step 5f, it is determined whether or not noise_min (k) _h (k)> x. If noise_min (k) _h (k)> x, the process proceeds to step 5g where Set the noise minimum value to noise_min (k) _h (k) = x. Then, the process proceeds to step 5h.

これに対しnoise_min(k)_h(k)＞x でなければ、そのままステップ５ｈに移行して次の帯域k=1を選択し、帯域k=8に達するまでは上記ステップ５ｅ〜ステップ５ｇによる雑音最小値noise_min(k)_h(k)の設定処理を繰り返す。 On the other hand, if noise_min (k) _h (k)> x is not satisfied, the process proceeds directly to step 5h to select the next band k = 1, and until the band k = 8 is reached, the noise generated in steps 5e to 5g. The setting process of the minimum value noise_min (k) _h (k) is repeated.

そして、帯域k=8に達すると、雑音最小値推定部２９はステップ５ｊでフレームカウンタｆc が１００に達したか否かを判定する。そして、１００フレームに達するまではステップ５ｂに戻って次のフレームを選択し、この選択したフレームについて上記ステップ５ｃ〜ステップ５ｉによる処理を繰り返す。 When the band k reaches 8, the noise minimum value estimation unit 29 determines whether or not the frame counter fc reaches 100 in step 5j. Until the number of frames reaches 100, the process returns to step 5b to select the next frame, and the processes in steps 5c to 5i are repeated for the selected frame.

一方、上記１００フレームに対する処理を終了すると、雑音最小値推定部２９は図６に示すステップ６ａに移行し、ここで雑音最小値の帯域間平均min_allを
7
min_all＝Σ noise_min(k)_h(n)/8
n=0
により求める。 On the other hand, when the processing for the 100 frames is completed, the noise minimum value estimation unit 29 proceeds to step 6a shown in FIG. 6, where the noise minimum value interband average min_all is calculated.
7
min_all ＝ Σ noise_min (k) _h (n) / 8
n = 0
Ask for.

またそれと共に、noise_min(0)及びnoise_min(1)をそれぞれ
noise_min(0)＝noise_min_h(0)
noise_min(1)＝0.75 noise_min_h(0)＋0.25 noise_min_h(1)
とすると共に、帯域をk=1とする。 Along with that, noise_min (0) and noise_min (1)
noise_min (0) = noise_min_h (0)
noise_min (1) = 0.75 noise_min_h (0) +0.25 noise_min_h (1)
And the bandwidth is k = 1.

さらに雑音最小値推定部２９は、ステップ６ｂに移行してここで、先に帯域k=0〜k=7について求めた８個の雑音最小値をもとに、残りの帯域k=8〜k=15について雑音最小値を
noise_min(2k)＝0.75 noise_min_h(k)＋0.25 noise_min_h(k-1)
noise_min(2k+1)＝0.75 noise_min_h(k)＋0.25 noise_min_h(k+1)
のように算出する。 Further, the noise minimum value estimation unit 29 proceeds to step 6b, where the remaining bands k = 8 to k are based on the eight noise minimum values previously obtained for the bands k = 0 to k = 7. Noise minimum value for = 15
noise_min (2k) = 0.75 noise_min_h (k) +0.25 noise_min_h (k-1)
noise_min (2k + 1) = 0.75 noise_min_h (k) +0.25 noise_min_h (k + 1)
Calculate as follows.

そして、以上の演算が終了すると、雑音最小値推定部２９はステップ６ｄからステップ６ｅに移行し、ここで
noise_min(14)＝0.75 noise_min_h(7)＋0.25 noise_min_h(6)
noise_min(15)＝noise_min_h(7)
を算出する。 When the above calculation is completed, the noise minimum value estimation unit 29 proceeds from step 6d to step 6e, where
noise_min (14) = 0.75 noise_min_h (7) +0.25 noise_min_h (6)
noise_min (15) ＝ noise_min_h (7)
Is calculated.

すなわち、雑音最小値推定部２９は、上記ステップ６ａ〜ステップ６ｅにおいて８個のmin_allをもとに16個のmin_allを補間している。
そうして16個のmin_allを算出すると、雑音最小値推定部２９はステップ６ｆにおいて、フレームカウンタｆc を０にリセットすると共に、雑音最小値をnoise_min_h(k)=36に、また帯域をk=0,…,7に設定し直す。そして、ステップ６ｇにおいて、先に算出した雑音最小値の帯域間平均値min_all、及び雑音最小値noise_min(m,k),k=0,…,15を出力したのち、ステップ５ｂに戻って次のフレームm=m+1について同様の雑音最小値及びその帯域間平均値の算出処理を繰り返す。 That is, the noise minimum value estimation unit 29 interpolates 16 min_alls based on the 8 min_alls in Steps 6a to 6e.
After calculating 16 min_all, the noise minimum value estimation unit 29 resets the frame counter fc to 0 in step 6f, sets the noise minimum value to noise_min_h (k) = 36, and sets the band to k = 0. , ..., reset to 7. Then, in step 6g, the average value min_all of the minimum noise values calculated previously and the minimum noise values noise_min (m, k), k = 0,..., 15 are output, and then the process returns to step 5b to return to the next. The same processing for calculating the minimum noise value and the average value between the bands is repeated for the frame m = m + 1.

また、更新判定部３１および雑音リーク積分値更新部３２では、雑音のリーク積分値noise_power(m,k)の更新処理が行われる。すなわち、更新判定部３１は、y ＜１５のときに更新可、それ以外のときには更新不可とする。更新可のとき雑音リーク積分値更新部３２は、雑音パワーnoise_powerを例えば、
noise_power(m+1,k)＝noise_power(m,k)*0.9＋channel_power(m,k)*0.1, k=0,…,15
のように更新する。 In addition, the update determination unit 31 and the noise leak integration value update unit 32 perform update processing of the noise leak integration value noise_power (m, k). That is, the update determination unit 31 determines that updating is possible when y <15, and updating is not possible otherwise. When updating is possible, the noise leak integrated value updating unit 32 sets the noise power noise_power to, for example,
noise_power (m + 1, k) = noise_power (m, k) * 0.9 + channel_power (m, k) * 0.1, k = 0, ..., 15
Update like this.

帯域別ゲイン決定部３３は、前記帯域パワー計算部２６から出力された帯域パワーchannel_power(m,k)、雑音リーク積分値更新部３２から出力された雑音パワーnoise_power(m,k)、音声重み計算部２８から出力された音声重みsp(m,k)、及び雑音最小値推定部２９から出力された雑音最小値noise_min(m,k)をもとに、帯域別ゲインgain(m,k)を決定する。 The band-specific gain determination unit 33 includes the band power channel_power (m, k) output from the band power calculation unit 26, the noise power noise_power (m, k) output from the noise leak integration value update unit 32, and a voice weight calculation. Based on the speech weight sp (m, k) output from the unit 28 and the noise minimum value noise_min (m, k) output from the noise minimum value estimation unit 29, the gain by band gain (m, k) is obtained. decide.

先ず雑音リーク積分値noise_power(m,k)の帯域平均値noise_allを、
15
noise_all= Σ noise_power(m,k)/16
k=0
により求める。 First, the band average value noise_all of the noise leak integrated value noise_power (m, k)
15
noise_all = Σ noise_power (m, k) / 16
k = 0
Ask for.

続いて、帯域パワーchannel_power(m,k)の帯域最低値min_band、及び雑音最小値noise_min(m,k)の帯域最大値max_bandをそれぞれ、
min_band=MIN (channel_power(m,k),k=2,…,11
max_band=MAX(noise_power(m,k),k=0,…,15)
により求める。 Subsequently, the band minimum value min_band of the band power channel_power (m, k) and the band maximum value max_band of the noise minimum value noise_min (m, k), respectively,
min_band = MIN (channel_power (m, k), k = 2,…, 11
max_band = MAX (noise_power (m, k), k = 0,…, 15)
Ask for.

次に、帯域共通の調整値mdを
md=(noise_all−min_all)*(1−sp/6)＋(min_band−max_band)*sp/6
により決定する。この式によると、
sp=0すなわち雑音区間のとき、md=noise_all−min_all
sp=6すなわち音声区間のとき、md=min_band−max_band
となり、過渡域はこれらの中間の値をとることがわかる。 Next, the adjustment value md common to all bands
md = (noise_all−min_all) * (1−sp / 6) + (min_band−max_band) * sp / 6
Determined by According to this formula:
When sp = 0, that is, in the noise interval, md = noise_all−min_all
sp = 6, that is, md = min_band−max_band when the voice interval
Thus, it can be seen that the transition region takes an intermediate value between these.

雑音フレームの場合、及び音声フレームの場合の周波数対パワー特性の一例を、それぞれ図７及び図８に示す。
雑音フレームでは、図７に示すように、帯域パワーは雑音最小値に近くなる。雑音最小値に調整値を加えた値は、雑音最小値のスペクトル特性はそのままで平均値が雑音パワーの平均値noise_allに変更されたものとなる。 Examples of frequency versus power characteristics in the case of a noise frame and in the case of a speech frame are shown in FIGS. 7 and 8, respectively.
In the noise frame, as shown in FIG. 7, the band power is close to the noise minimum value. The value obtained by adding the adjustment value to the minimum noise value is obtained by changing the average value to the noise power average value noise_all without changing the spectral characteristics of the minimum noise value.

これに対し音声フレームの場合には、図８に示すように、雑音最小値に調整値を加えた値は、最小値のスペクトル特性はそのままで帯域の最大値が帯域パワーの最低値と一致するよう調整されることになる。
帯域別ゲインgain(m,k)は、帯域パワーchannel_power(m,k)と、雑音最小値noise_min(m,k)と、調整値とから次のように決定される。すなわち、帯域k において、
tmp=channel_power(m,k)−noise_min(m,k)−md−1.625
gain(m,k)={sqrt(1.4+0.49*tmp2)＋0.7*tmp−9.5}*2
をk=0,…,15についてそれぞれ独立に求める。
なお、Sp=0のときは雑音であるので、上記の計算式に依らずすべての帯域でゲインを定数に、例えばgain(m,k)=−20に設定してもよい。 On the other hand, in the case of an audio frame, as shown in FIG. 8, the value obtained by adding the adjustment value to the minimum noise value is the same as the minimum value of the band power while the minimum spectral characteristics remain unchanged. Will be adjusted as follows.
The gain by band gain (m, k) is determined from the band power channel_power (m, k), the minimum noise value noise_min (m, k), and the adjustment value as follows. That is, in the band k,
tmp = channel_power (m, k) −noise_min (m, k) −md−1.625
gain (m, k) = {sqrt (1.4 + 0.49 * tmp2) + 0.7 * tmp−9.5} * 2
Are independently obtained for k = 0, ..., 15.
In addition, since it is noise when Sp = 0, the gain may be set to a constant in all bands regardless of the above formula, for example, gain (m, k) = − 20.

そして、以上のように求められた帯域別ゲインgain(m,k)は、乗算器２３において帯域ごとに変換係数に乗算され、これによりノイズキャンセルがなされる。そして、このノイズキャンセルされた各帯域ごとの変換係数は、ＩＦＦＴ２４において逆高速フーリエ変換されて時間軸上の信号フレームに戻されたのち、フレーム合成部２５においてフレーム合成されて音声符号化回路１６に供給される。 Then, the gain by band gain (m, k) obtained as described above is multiplied by the conversion coefficient for each band in the multiplier 23, and noise cancellation is thereby performed. The noise-canceled transform coefficients for each band are subjected to inverse fast Fourier transform in IFFT 24 and returned to the signal frame on the time axis, and then frame-synthesized in the frame synthesizer 25 to the speech encoding circuit 16. Supplied.

以上述べたようにこの実施形態によれば、雑音最小値推定回路２９において各帯域の雑音パワーの最小値を求め、この雑音最小値のスペクトル形状を帯域別ゲイン決定部３３による帯域別ゲインの決定に用いるようにしているため、例えば自動車の通過時のような雑音スペクトルの短期的な変化に影響されず、音声スペクトルを歪ませにくいノイズキャンセル処理を実現することができる。 As described above, according to this embodiment, the noise minimum value estimation circuit 29 obtains the minimum value of the noise power of each band, and the spectrum shape of this noise minimum value is determined by the band-specific gain determination unit 33. Therefore, it is possible to realize a noise canceling process that is not affected by a short-term change in the noise spectrum, such as when passing through an automobile, and that hardly distorts the audio spectrum.

またこの実施形態によれば、有意値計算部２７において、しきい値が「１」を越える連続する複数の帯域の帯域別差の重み付け加算値を求め、この重み付け加算値を音声重み計算部２８における音声重みの計算、つまりそのフレームが音声フレームであるか雑音フレームであるか、さらにはその中間の過渡域フレームであるかの判定に供している。このため、ただ１つの帯域のみで帯域別差がしきい値「１」を越えた場合には、これは雑音と見なして排除することができ、これにより音声／雑音／過渡域の判定を正確に行って、ノイズキャンセル性能を高めることができる。 In addition, according to this embodiment, the significant value calculation unit 27 obtains a weighted addition value of a difference for each band of a plurality of continuous bands whose threshold value exceeds “1”, and uses the weighted addition value as a voice weight calculation unit 28. Is used for the calculation of the voice weight in, that is, whether the frame is a voice frame, a noise frame, or an intermediate transition frame. For this reason, when the difference between the bands exceeds the threshold value “1” with only one band, it can be regarded as noise and eliminated, thereby accurately determining the voice / noise / transient range. Thus, noise cancellation performance can be improved.

なお、この発明は上記実施形態に限定されるものではない。例えば、前記実施形態ではＴＤＭＡ方式を採用したディジタル携帯電話装置を例にとって説明したが、ＣＤＭＡ方式を採用したディジタル携帯電話装置にもこの発明は適用可能である。
その他、ノイズキャンセラ内の各機能部の処理手順と処理内容、及びこの処理を実現するための回路構成又は処理プログラム等についても、この発明の要旨を逸脱しない範囲で種々変形して実施できる。 The present invention is not limited to the above embodiment. For example, in the above-described embodiment, the digital mobile phone device adopting the TDMA system has been described as an example. However, the present invention can also be applied to a digital mobile phone device adopting the CDMA system.
In addition, the processing procedure and processing contents of each functional unit in the noise canceller and the circuit configuration or processing program for realizing the processing can be variously modified without departing from the gist of the present invention.

１…アンテナ、２…アンテナ共用器（ＤＵＰ）、３…受信回路（ＲＸ）、４…周波数シンセサイザ（ＳＹＮ）、５…送信回路（ＴＸ）、６…ディジタル復調器（ＤＥＭ）、７…ディジタル変調器（ＭＯＤ）、８…時分割多元接続回路（ＴＤＭＡ）、９…誤り訂正符号復号回路（ＣＨ−ＣＯＤ）、１０…音声復号回路（ＤＥＣ）、１１…Ｄ／Ａ変換器、１２…スピーカ、１３…マイクロホン、１４…Ａ／Ｄ変換器、１６…音声符号化回路（ＣＯＤ）、１７…ノイズキャンセラ、１８…制御回路、１９…操作パネル部、２１…フレーム分割部、２２…高速フーリエ変換部（ＦＦＴ）、２３…乗算部、２４…逆高速フーリエ変換部（ＩＦＦＴ）、２５…フレーム合成部、２６…帯域パワー計算部、２７…有意値計算部、２８…音声重み計算部、２９…雑音最小値推定部、３１…更新判定部、３２…雑音リーク積分値更新部、３３…帯域別ゲイン決定部。 DESCRIPTION OF SYMBOLS 1 ... Antenna, 2 ... Antenna duplexer (DUP), 3 ... Reception circuit (RX), 4 ... Frequency synthesizer (SYN), 5 ... Transmission circuit (TX), 6 ... Digital demodulator (DEM), 7 ... Digital modulation Unit (MOD), 8 ... time division multiple access circuit (TDMA), 9 ... error correction code decoding circuit (CH-COD), 10 ... voice decoding circuit (DEC), 11 ... D / A converter, 12 ... speaker, DESCRIPTION OF SYMBOLS 13 ... Microphone, 14 ... A / D converter, 16 ... Voice coding circuit (COD), 17 ... Noise canceller, 18 ... Control circuit, 19 ... Operation panel part, 21 ... Frame division part, 22 ... Fast Fourier transform part ( FFT), 23 ... multiplier, 24 ... inverse fast Fourier transform (IFFT), 25 ... frame synthesis unit, 26 ... band power calculation unit, 27 ... significant value calculation unit, 28 ... voice weight calculation unit, 29 The minimum noise value estimation unit, 31 ... update determination unit, 32 ... Noise leaky integration value updating unit, 33 ... band-specific gain determination unit.

Claims

In the noise canceller that divides the input signal into frames of a certain time length, divides the signals of these frames into a plurality of frequency bands, and performs noise suppression processing for each of these frequency bands,
Noise power estimating means for obtaining signal power for each frequency band and estimating noise power for each band based on the band power;
Minimum value detecting means for detecting a minimum value of power over a plurality of frame periods for at least one of the band power and the noise power for each band;
For each frequency band, a difference between the band power and the minimum value for each band detected by the minimum value detecting means is obtained, and a suppression amount determining means for determining a noise suppression amount for each frequency band based on the difference. A noise canceller characterized by comprising.

Further comprising means for determining a speech interval and a noise interval for the input signal;
The suppression amount determination means performs a noise suppression amount determination process for each band based on a difference between the band power and the minimum value for each band in at least a speech section of a speech section and a noise section. Item 2. The noise canceller according to Item 1.

The suppression amount determination unit further includes an adjustment value generation unit that generates an adjustment value common to different bands for each frame, and for each frequency band, the band power and the band detected by the minimum value detection unit The noise suppression amount for each frequency band is determined based on a difference between a minimum value and a value obtained by adding an adjustment value common to the band generated by the adjustment value generation unit. The noise canceller according to 1.

The adjustment value generation means determines an adjustment value common to the band based on the difference between the average value between the minimum values for each band and the average value between the noise powers for each band in the noise section, 4. The band common adjustment value is determined based on a difference between a minimum value among a plurality of band powers in one frame and a maximum value among a plurality of minimum values for each band. Noise canceller.

In a communication apparatus that encodes and transmits a transmission input signal by a voice encoding unit,
The transmission input signal is divided into frames of a certain time length, and the signals of these frames are divided into a plurality of frequency bands, respectively, and a noise canceller that performs noise suppression processing for each of these frequency bands is provided,
The noise canceller is
Noise power estimating means for obtaining signal power for each frequency band and estimating noise power for each band based on the band power;
Minimum value detecting means for detecting a minimum value of power over a plurality of frame periods for at least one of the band power and the noise power for each band;
For each frequency band, a difference between the band power and the minimum value for each band detected by the minimum value detecting means is obtained, and a suppression amount determining means for determining a noise suppression amount for each frequency band based on the difference. A communication device comprising: