JP2012225982A

JP2012225982A - Speech processing device

Info

Publication number: JP2012225982A
Application number: JP2011090886A
Authority: JP
Inventors: Arikatsu Wada; 存功和田
Original assignee: Audio Technica KK
Current assignee: Audio Technica KK
Priority date: 2011-04-15
Filing date: 2011-04-15
Publication date: 2012-11-15

Abstract

PROBLEM TO BE SOLVED: To attain soft mute for reducing abnormal noise to be generated due to deterioration of a reception state with a speech processing device which handles a digital signal.SOLUTION: A speech processing device includes: fast Fourier transform means 1 for performing Fourier transform to a digital input signal in a voice band at high speed; absolute value calculation means 2 for calculating an absolute value of the signal to which the Fourier transform is performed to generate an absolute value signal; auto-correlation function calculation means 3 for calculating an auto-correlation function of the calculated absolute value signal; difference calculation means 8 for calculating variation of output of the auto-correlation function calculation means 3; noise determination means 9 for calculating a noise component by the size of difference; fade-out coefficient derivation means 10 for calculating a fade-out amount based on a calculation result by the noise determination means 9; and volume control means 11 for controlling a level of the digital input signal based on a calculation result by the fade-out coefficient derivation means 10 to be output.

Description

本発明は、例えば、赤外線などを用いるワイヤレス方式音声会議システムなどに好適な音声処理装置に関するもので、特に、デジタル信号を扱う受信機において、受信状態が悪化した場合などに違和感なくフェードアウトするようにしたものである。 The present invention relates to an audio processing apparatus suitable for, for example, a wireless audio conference system using infrared rays and the like, and in particular, in a receiver that handles digital signals, it fades out without a sense of incongruity when the reception state deteriorates. It is a thing.

ワイヤレス方式音声会議システムでは、マイクロホンでとらえた音声信号を赤外線や電波を利用して送信し、この送信信号を受信器で受信して音声信号に復元し、スピーカから音声として放出するようになっている。ワイヤレス方式音声会議システムに限らず、ワイヤレスで音声信号を送信し、これを受信して音声信号に戻すシステムにおいては、送受信条件が悪く受信状態が悪化すると、音声に混入するノイズの割合が多くなり、音声を聴く者に不快感を与える。 In a wireless audio conference system, an audio signal captured by a microphone is transmitted using infrared rays or radio waves, this transmission signal is received by a receiver, restored to an audio signal, and emitted from a speaker as audio. Yes. Not only in wireless audio conferencing systems, but in systems that wirelessly transmit audio signals, receive them, and return them to audio signals, if the transmission and reception conditions are poor and the reception status deteriorates, the proportion of noise mixed in the audio increases. Discomfort to those who listen to audio.

特に、ＦＭ方式の送受信システムにおいては、受信状態が悪化すると、「ザー」というようなホワイトノイズが発生し、受信状態がさらに悪化すると突然「ポッ」というような不愉快な音を発して音が途絶えてしまう。また、受信状態が好転すると、突然音声が発せられて聴く者を驚かせる。 In particular, in the FM transmission / reception system, when the reception state deteriorates, white noise such as “Zar” occurs, and when the reception state further deteriorates, suddenly an unpleasant sound such as “pop” is emitted and the sound stops. End up. In addition, when the reception state improves, a voice is suddenly emitted and surprises the listener.

そこで、受信機において、受信状態の悪化により不愉快な雑音を発するような状況に至ると、復調した音声信号のレベルを連続的に減衰させるようにしたいわゆるソフトミュート回路が提案されている。特許文献１に記載されている受信機のソフトミュート回路はその一つで、復調した音声信号のレベルを制御するレベル制御回路と、ＲＦ信号と局部発振信号を混合して生成される中間周波信号のレベルに応じた第１制御信号を生成する電界強度検出回路と、上記音声信号の出力レベルに応じた第２制御信号を生成する変調度検出回路とを具備し、上記第１制御信号が所定の電界強度以下を示す値になったときに、上記レベル制御回路が、上記第２制御信号の出力に応じて音声信号の出力レベルを制御するように構成されている。 In view of this, a so-called soft mute circuit has been proposed in which the level of the demodulated audio signal is continuously attenuated when the receiver generates an unpleasant noise due to the deterioration of the reception state. One of the soft mute circuits of the receiver described in Patent Document 1 is a level control circuit that controls the level of a demodulated audio signal, and an intermediate frequency signal that is generated by mixing an RF signal and a local oscillation signal. And a modulation degree detection circuit for generating a second control signal corresponding to the output level of the audio signal, wherein the first control signal is predetermined. The level control circuit is configured to control the output level of the audio signal in accordance with the output of the second control signal when the value becomes equal to or less than the value of the electric field strength.

特許文献１記載の発明によれば、電界強度が低下したときに増大する復調ノイズを低減するとともに、変調度の程度に応じて出力レベルの減衰特性を変化させることで、聴感上の違和感が生じないという効果がある。しかし、特許文献１記載の発明で扱う信号はアナログ信号である。デジタル信号を扱う受信機において、受信状態の悪化によってノイズが発生しやすい状況になったとき、特許文献１記載の発明と同様の技術思想ではいわゆるソフトミュートを実現することができず、特許文献１記載の発明とは全く異なった発想が求められる。 According to the invention described in Patent Document 1, the demodulation noise that increases when the electric field strength is reduced is reduced, and the attenuation characteristic of the output level is changed according to the degree of the modulation degree. There is no effect. However, the signal handled in the invention described in Patent Document 1 is an analog signal. In a receiver that handles digital signals, when noise is likely to occur due to deterioration of the reception state, so-called soft mute cannot be realized with the same technical idea as the invention described in Patent Document 1, and Patent Document 1 A completely different idea from the described invention is required.

また、赤外線を用いる従来のマイクロホンシステムでは、受信状態が悪化すると、悪化の程度に応じた量のホワイトノイズが発生する。そこで、例えば、アナログ信号用のＦＭ復調回路から発生する可聴帯域以外の指定周波数（例えば５０ｋＨz）のエネルギーが閾値を超えた場合、ミュートを行う（例えば、マイクロホンの出力をオフする）ようにしたものがある。 In the conventional microphone system using infrared rays, when the reception state deteriorates, white noise of an amount corresponding to the degree of deterioration is generated. Therefore, for example, when the energy of a specified frequency (for example, 50 kHz) other than the audible band generated from the FM demodulation circuit for analog signals exceeds a threshold value, muting is performed (for example, the output of the microphone is turned off). There is.

しかし、上記従来のマイクロホンシステムでは、受信状態が不安定で、指定周波数のエネルギーが閾値前後で変動している場合、マイクロホン出力がオンとオフを繰り返し、その都度「バツ、バツ」というような異音が発生する。また、上記ＦＭの復調回路をデジタル処理回路で構成した場合は、上記の可聴帯域以外の指定周波数のエネルギーでミュートを行うことは困難である。 However, in the above conventional microphone system, when the reception state is unstable and the energy of the specified frequency fluctuates around the threshold value, the microphone output is repeatedly turned on and off, and each time a difference such as “X, X” is given. Sound is generated. Further, when the FM demodulating circuit is constituted by a digital processing circuit, it is difficult to mute with the energy of the designated frequency other than the audible band.

特公平６−２６３１６号公報Japanese Patent Publication No. 6-26316

本発明は、受信状態が悪化することによって生ずる異音を低減するためのいわゆるソフトミュートを、デジタル信号を扱う音声処理装置で実現することを目的とする。 An object of the present invention is to realize so-called soft mute for reducing abnormal noise caused by deterioration of the reception state in a sound processing apparatus that handles digital signals.

本発明に係る音声処理装置は、
音声帯域のデジタル入力信号を高速でフーリエ変換する高速フーリエ変換手段と、
上記フーリエ変換された信号の絶対値を算出して絶対値信号を生成する絶対値算出手段と、
算出された上記絶対値信号の自己相関関数を算出する自己相関関数算出手段と、
上記自己相関関数算出手段の出力の変化量を算出する差分算出手段と、
上記差分の大きさによってノイズ成分を算出するノイズ判定手段と、
上記ノイズ判定手段による算出結果をもとにフェードアウト量を算出するフェードアウト係数導出手段と、
上記フェードアウト係数導出手段による算出結果をもとに上記デジタル入力信号のレベルを調節して出力する音量調節手段と、
を備えていることを最も主要な特徴とする。 The speech processing apparatus according to the present invention
A fast Fourier transform means for performing a Fourier transform of a digital input signal in a voice band at a high speed;
Absolute value calculating means for generating an absolute value signal by calculating an absolute value of the Fourier transformed signal;
Autocorrelation function calculating means for calculating an autocorrelation function of the calculated absolute value signal;
Difference calculating means for calculating the amount of change in output of the autocorrelation function calculating means;
Noise determining means for calculating a noise component according to the magnitude of the difference;
A fade-out coefficient deriving unit that calculates a fade-out amount based on the calculation result by the noise determination unit;
Volume adjusting means for adjusting and outputting the level of the digital input signal based on the calculation result by the fade-out coefficient deriving means;
It has the most important feature.

本発明によれば、自己相関法を用いて受信状態を検知し、受信状態の悪化に伴い発生するノイズを定量化し、このノイズ成分に対応してデジタル入力信号のレベルを調節してフェードアウトさせるようになっていて、いわゆるソフトミュートをデジタル音声信号回路で実現することができる。 According to the present invention, the reception state is detected using the autocorrelation method, the noise generated as the reception state deteriorates is quantified, and the level of the digital input signal is adjusted in accordance with the noise component to be faded out. Therefore, so-called soft mute can be realized by a digital audio signal circuit.

本発明に係る音声処理装置の一実施例を示す機能ブロック図である。It is a functional block diagram which shows one Example of the audio processing apparatus which concerns on this invention. 本発明に係る音声処理装置の別の実施例を示す機能ブロック図である。It is a functional block diagram which shows another Example of the audio processing apparatus which concerns on this invention. 周期性のある信号と非周期性の信号の相関関数波形を比較して示す波形図である。It is a wave form diagram which compares and shows the correlation function waveform of a signal with periodicity, and a non-periodic signal. 受信レベルに対するノイズ量、フェードアウト量、出力音量のレベルを百分率で示すグラフである。It is a graph which shows the level of the noise amount with respect to a reception level, fade-out amount, and an output volume in percentage.

以下、本発明に係る音声処理装置の実施例を、図面を参照しながら説明する。 Embodiments of a speech processing apparatus according to the present invention will be described below with reference to the drawings.

図１に示す音声処理装置は、例えば、一般的な放送の受信機、通信機の受信機、あるいは赤外線や電波を用いるワイヤレス方式音声会議システムの受信機の一部であって、復調された後の音声処理装置である。図１において、「入力」とあるのは、音声帯域信号の入力のことであって、ここではデジタル信号で入力される。送受信システムがデジタル方式であるものに限らず、送受信システムはアナログ方式で、復調された信号がデジタル信号に変換され、変換されたデジタル信号を上記「入力」とするものであってもよい。 The audio processing apparatus shown in FIG. 1 is, for example, a part of a general broadcast receiver, a communication receiver, or a wireless audio conference system using infrared rays or radio waves, and is demodulated. This is a voice processing apparatus. In FIG. 1, “input” refers to an input of a voice band signal, and here is input as a digital signal. The transmission / reception system is not limited to a digital system, and the transmission / reception system may be an analog system, and a demodulated signal may be converted into a digital signal, and the converted digital signal may be used as the “input”.

上記入力信号は、高速フーリエ変換手段１、絶対値算出手段２、自己相関関数算出手段３、差分算出手段８、ノイズ判定手段９、フェードアウト係数導出手段１０、音量調節手段１１を順に経ることによって、それぞれの手段で所定の処理が行われる。上記各手段で行われる処理は以下のとおりである。 The input signal passes through the fast Fourier transform means 1, the absolute value calculation means 2, the autocorrelation function calculation means 3, the difference calculation means 8, the noise determination means 9, the fade-out coefficient derivation means 10, and the volume control means 11 in this order. Predetermined processing is performed by each means. The processing performed by each of the above means is as follows.

高速フーリエ変換手段１は、入力される音声帯域のデジタル信号を高速フーリエ変換（ＦａｓｔＦｏｕｒｉｅｒＴｒａｎｓｆｏｒｍ：ＦＦＴ）するもので、時間や空間座標が変数の関数を、周波数が変数の関数に変換する。入力される音声帯域のデジタル信号は時間領域の信号であり、この入力信号は、高速フーリエ変換手段１で周波数領域の信号に変換される。 The fast Fourier transform means 1 performs fast Fourier transform (FFT) on a digital signal in an input audio band, and converts a function having a variable time and space coordinates into a function having a variable frequency. The input digital signal in the voice band is a time domain signal, and this input signal is converted into a frequency domain signal by the fast Fourier transform means 1.

高速フーリエ変換手段１で周波数領域の信号に変換された上記デジタル音声帯信号は複素数で表されていて、この信号のゲインを算出するために、絶対値算出手段２で上記変換信号の絶対値信号を生成する。 The digital voice band signal converted into the frequency domain signal by the fast Fourier transform means 1 is represented by a complex number. In order to calculate the gain of this signal, the absolute value calculation means 2 calculates the absolute value signal of the converted signal. Is generated.

自己相関関数算出手段３は、絶対値算出回路２で算出された絶対値信号の自己相関関数を算出する。自己相関とは、信号がそれ自身を時間シフトした信号とどれだけ良く整合しているかを図る尺度であり、時間シフトの大きさの関数として表され、この関数を自己相関関数という。時間的に一定の長さの信号と、これよりも時間的に遅れた信号の振幅が大きく、同様な繰り返し成分があれば、上記二つの信号の相関値ないしは自己相関関数は大きくなる。したがって、音楽などの音声信号に比べ、様々な音が含まれているノイズ信号は、相関値がすぐに小さいため、ノイズ信号の自己相関関数は音声信号の相関関数よりも小さくなる。自己相関関数算出手段３は、上記絶対値信号の自己相関関数を算出するために、高速フーリエ逆変換を行い、時間領域の信号に戻す。 The autocorrelation function calculation means 3 calculates an autocorrelation function of the absolute value signal calculated by the absolute value calculation circuit 2. Autocorrelation is a measure for how well a signal matches itself with a time-shifted signal, and is expressed as a function of the magnitude of the time shift, and this function is called an autocorrelation function. If the amplitude of a signal of a certain length in time and a signal delayed in time are large and there are similar repetitive components, the correlation value or autocorrelation function of the two signals becomes large. Therefore, since a correlation value of a noise signal including various sounds is immediately smaller than that of an audio signal such as music, the autocorrelation function of the noise signal is smaller than the correlation function of the audio signal. The autocorrelation function calculation means 3 performs fast Fourier inverse transform in order to calculate the autocorrelation function of the absolute value signal, and returns it to the time domain signal.

差分算出手段８は、自己相関関数算出手段３で得られる自己相関関数に含まれる相関波形の振幅値の差分すなわち変化量を求めるためにハイパスフィルタ処理を行う。このハイパスフィルタ処理を行うと、すなわち上記相関波形をハイパスフィルタに通すと、ノイズ、例えばホワイトノイズやピンクノイズなど非周期性の信号は差分すなわち変化量が大きいため、急峻な波形となり、すぐに小さくなる。これに対して周期性の信号成分が多い音楽などの音声信号をハイパスフィルタに通すと、なだらかな波形になるとともに、すぐに小さくなってしまうことはなく持続性がある。図３は、周期性のある信号と非周期性のノイズ信号の相関関数波形を比較して示す。図３において波形Ａは周期性のある音声信号成分が多い場合の相関関数波形、波形Ｂは非周期性のノイズ信号成分が多い場合の相関関数波形を示している。 The difference calculation means 8 performs high-pass filter processing in order to obtain the difference, that is, the amount of change in the amplitude value of the correlation waveform included in the autocorrelation function obtained by the autocorrelation function calculation means 3. When this high-pass filter processing is performed, that is, when the correlation waveform is passed through a high-pass filter, noise, for example, non-periodic signals such as white noise and pink noise have a large difference, that is, a change amount. Become. On the other hand, when a sound signal such as music having a large number of periodic signal components is passed through a high-pass filter, it becomes a gentle waveform and does not quickly become small, and is durable. FIG. 3 shows a comparison of correlation function waveforms of a periodic signal and an aperiodic noise signal. In FIG. 3, a waveform A shows a correlation function waveform when there are many periodic sound signal components, and a waveform B shows a correlation function waveform when there are many non-periodic noise signal components.

図３からもわかるように、相関関数波形の急峻度の違いすなわち変化量の違いから、非周期性のノイズ信号成分が多い信号か否かを判定することができる。そこで、ノイズ判定手段９において、差分算出手段８で求められた差分の値から、差分が大きいときは、相関関数波形の傾きが大きいということで非周期性のノイズ信号成分が多いと判定する。逆に、差分が小さいときは、相関関数波形の傾きが小さいということで非周期性のノイズ信号成分が少ないすなわち周期性のある音声信号成分が多いと判定する。この判定結果を表す信号が次のフェードアウト係数導出手段１０に向けて出力される。後で述べる音量調節をきめ細かに行うために、ノイズ判定手段９におけるノイズ判定は多段にわたって行うことが望ましいが、一定の閾値を設定して、この閾値以上か否かで判断してもよい。 As can be seen from FIG. 3, it is possible to determine whether or not the signal has a large number of non-periodic noise signal components from the difference in the steepness of the correlation function waveform, that is, the difference in change amount. Therefore, when the difference is large from the difference value obtained by the difference calculation unit 8, the noise determination unit 9 determines that there are many non-periodic noise signal components because the inclination of the correlation function waveform is large. On the other hand, when the difference is small, it is determined that the non-periodic noise signal component is small, that is, the periodic audio signal component is large, because the slope of the correlation function waveform is small. A signal representing the determination result is output to the next fade-out coefficient deriving means 10. In order to finely adjust the volume, which will be described later, it is desirable that the noise determination in the noise determination means 9 is performed in multiple stages. However, it may be determined by setting a certain threshold and determining whether or not the threshold is exceeded.

前記フェードアウト係数導出手段１０は、ノイズ判定手段９から出力されるノイズ成分の判定信号に基づいて音量調節するために、フェードアウト係数を導出する。フェードアウト係数導出手段１０は、上記ノイズ成分の判定信号に基づき、これを演算することによってフェードアウト係数を導出してもよい。あるいは、予めノイズ成分の判定信号に対応したフェードアウト係数を設定してこれをテーブルにしておき、ノイズ判定手段９から出力されるノイズ成分の判定信号を上記テーブルにあてはめてフェードアウト係数を導出するようにしてもよい。 The fade-out coefficient deriving unit 10 derives a fade-out coefficient in order to adjust the volume based on the noise component determination signal output from the noise determination unit 9. The fade-out coefficient deriving unit 10 may derive the fade-out coefficient by calculating the noise component based on the determination signal of the noise component. Alternatively, a fade-out coefficient corresponding to the noise component determination signal is set in advance and this is set in a table, and the noise component determination signal output from the noise determination means 9 is applied to the table to derive the fade-out coefficient. May be.

前記音量調節手段１１には、前記音声帯域のデジタル入力信号とフェードアウト係数導出手段１０で導出されたフェードアウト係数信号が入力される。音量調節手段１１は、上記音声帯域のデジタル入力信号のレベルを、上記フェードアウト係数に基づいて調節し、フェードアウト係数の大小に応じて音量を調節した音声信号を出力する。図４は赤外線ワイヤレス受信機における赤外線受光ユニットの受信レベルに対するフェードアウト特性の一例で、点線による曲線はノイズ量を、破線による曲線はフェードアウト量を、実線による曲線は出力音量を示している。横軸は受信レベル（ｄＢμＶ）であり、縦軸は上記ノイズ量、フェードアウト量、出力音量をそれぞれ百分率（％）で表している。図４から明らかなように、受信レベルがある程度以上であればノイズ量はごくわずかであり、フェードアウトさせる必要はないから、出力音量は１００％に維持される。受信レベルが３０．０（ｄＢμＶ）以下になるとノイズ成分の割合が無視することができないほどに多くなるため、この辺りからフェードアウト量を立ち上がらせ、フェードアウト量が多くなるに従って出力音量を低下させるようになっている。 The sound volume adjusting means 11 is inputted with the digital input signal of the voice band and the fade-out coefficient signal derived by the fade-out coefficient deriving means 10. The volume adjusting means 11 adjusts the level of the digital input signal in the audio band based on the fade-out coefficient, and outputs an audio signal whose volume is adjusted according to the magnitude of the fade-out coefficient. FIG. 4 is an example of a fade-out characteristic with respect to the reception level of the infrared light receiving unit in the infrared wireless receiver. A dotted curve indicates the amount of noise, a broken curve indicates the fade-out amount, and a solid curve indicates the output volume. The horizontal axis represents the reception level (dBμV), and the vertical axis represents the noise amount, the fade-out amount, and the output volume in percentage (%). As apparent from FIG. 4, if the reception level is a certain level or more, the amount of noise is negligible and there is no need to fade out, so the output volume is maintained at 100%. When the reception level is 30.0 (dBμV) or less, the ratio of the noise component increases so much that it cannot be ignored. Therefore, the fade-out amount is increased from this area, and the output volume is lowered as the fade-out amount increases. It has become.

音量調節手段１１で音量調節され出力される信号は、Ｄ／Ａ変換器でアナログの音声信号に変換され、増幅器、その他適宜の回路を経由し、スピーカから音声が放出される。 The signal whose volume is adjusted by the volume adjusting means 11 and output is converted into an analog audio signal by the D / A converter, and the sound is emitted from the speaker via an amplifier and other appropriate circuits.

音量調節手段１１は、例えば減衰器（アッテネータ）で構成することができる。図４に示すフェードアウト特性、すなわちノイズ量に対するフェードアウト量の関係は一例であって、使用目的、通信方式、求められる通信の品質などによりフェードアウト特性を適宜調整できるようにするとよい。例えば、上記音量調節手段１１を構成する減衰器による減衰特性を調整することによってフェードアウト特性を調整することができる。 The volume control means 11 can be configured by an attenuator, for example. The fade-out characteristic shown in FIG. 4, that is, the relationship between the fade-out amount and the noise amount is merely an example, and the fade-out characteristic may be appropriately adjusted depending on the purpose of use, communication method, required communication quality, and the like. For example, the fade-out characteristic can be adjusted by adjusting the attenuation characteristic of the attenuator constituting the volume control means 11.

音量調節手段１１は、フェードアウト係数の大小に応じて音量すなわち音声信号の出力レベルを連続的に調節するように構成してもよいし、段階的に調節するようにしてもよい。 The volume control means 11 may be configured to continuously adjust the volume, that is, the output level of the audio signal, according to the magnitude of the fade-out coefficient, or may be adjusted in stages.

図１に示す各ブロックは、これらのブロックごとに物理的に存在するものであっても差し支えないが、各ブロックの果たす機能を全て、デジタル・シグナル・プロセッサ（ＤｉｇｉｔａｌＳｉｇｎａｌＰｒｏｃｅｓｓｏｒ：ＤＳＰ）に持たせるとよい。 Each block shown in FIG. 1 may physically exist for each of these blocks, but all functions performed by each block are given to a digital signal processor (DSP). Good.

以上説明した第１実施例によれば、デジタル方式の音声処理装置であっても、受信状態に応じてフェードアウト量を変化させることにより、違和感のないいわゆるソフトミュートを実現することができる。 According to the first embodiment described above, even a digital audio processing device can realize so-called soft mute without a sense of incongruity by changing the fade-out amount according to the reception state.

次に、図２に示す第２実施例について説明する。第２実施例は、図１に示す第１実施例の構成に、ノイズ低減手段４、第２の高速フーリエ変換手段５、第２の絶対値算出手段６、音声復元手段７を付加したものである。これによって信号伝達経路が若干異なっていることが第１実施例と異なっているので、この異なった構成部分を重点的に説明する。 Next, a second embodiment shown in FIG. 2 will be described. In the second embodiment, the noise reduction means 4, the second fast Fourier transform means 5, the second absolute value calculation means 6, and the sound restoration means 7 are added to the configuration of the first embodiment shown in FIG. is there. Since the signal transmission path is slightly different from that of the first embodiment, the different components will be described mainly.

図２において、音声帯域信号のデジタル入力信号が、高速フーリエ変換手段１、絶対値算出手段２、自己相関関数算出手段３、差分算出手段８、ノイズ判定手段９、フェードアウト係数導出手段１０、音量調節手段１１を順に経ることによって、それぞれの手段で所定の処理が行われる点では第１実施例と同じである。高速フーリエ変換手段１は、第２の高速フーリエ変換手段５と区別するため、「第１の高速フーリエ変換手段１」という。同様に、絶対値算出手段２は「第１の絶対値算出手段２」という。 In FIG. 2, the digital input signal of the audio band signal is a fast Fourier transform means 1, an absolute value calculation means 2, an autocorrelation function calculation means 3, a difference calculation means 8, a noise determination means 9, a fade-out coefficient derivation means 10, a volume control. It is the same as the first embodiment in that a predetermined process is performed by each means by passing through the means 11 in order. The fast Fourier transform unit 1 is referred to as “first fast Fourier transform unit 1” in order to distinguish it from the second fast Fourier transform unit 5. Similarly, the absolute value calculation means 2 is referred to as “first absolute value calculation means 2”.

自己相関関数算出手段３から出力される自己相関関数信号が差分算出手段８に入力されるようになっている点では第１実施例と同じであるが、自己相関関数信号がノイズ低減手段４に入力されるように構成されている点は第１実施例と異なっている。ノイズ低減手段４は例えばローパスフィルタなどで構成され、上記自己相関関数信号から高域成分を取り除く。これによって、周波数領域の信号からホワイトノイズなどの非周期性ノイズ成分を低減する。 The autocorrelation function signal output from the autocorrelation function calculation unit 3 is the same as that of the first embodiment in that the autocorrelation function signal is input to the difference calculation unit 8. The point of being configured to input is different from the first embodiment. The noise reduction means 4 is composed of, for example, a low-pass filter and removes high-frequency components from the autocorrelation function signal. This reduces non-periodic noise components such as white noise from the frequency domain signal.

ノイズ低減手段４の出力信号は第２の高速フーリエ変換手段５に入力され、ノイズ低減手段４の出力信号が高速フーリエ変換されることによって周波数領域の信号に戻される。この周波数領域の信号は複素数で表されており、この複素数で表されている信号からゲインを算出するために、第２の絶対値算出手段６において絶対値を算出する。 The output signal of the noise reduction means 4 is input to the second fast Fourier transform means 5, and the output signal of the noise reduction means 4 is fast Fourier transformed to be returned to the frequency domain signal. The signal in the frequency domain is represented by a complex number. In order to calculate the gain from the signal represented by the complex number, the second absolute value calculating unit 6 calculates an absolute value.

第２の絶対値算出手段６において算出された上記周波数領域の信号の絶対値信号すなわち周波数領域の信号のゲイン信号は音声復元手段７に入力される。音声復元手段７には、前記第１の高速フーリエ変換手段１においてデジタル音声帯域入力信号を時間領域の信号から周波数領域の信号に変換することによって得られる位相信号が入力される。音声復元手段７では、上記ゲイン信号と上記周波数領域の信号を組み合わせることによって、デジタル音声帯域信号を復元する。このように、ノイズ低減手段４から、第２の高速フーリエ変換手段５、第２の絶対値算出手段６、音声復元手段７に至る構成部分は、自己相関関数算出手段３の出力からデジタル音声帯域信号を復元し生成するための回路を構成している。こうして復元されるデジタル音声帯域信号は音量調節手段１１に入力される。 The absolute value signal of the frequency domain signal calculated by the second absolute value calculation means 6, that is, the gain signal of the frequency domain signal is input to the sound restoration means 7. The audio restoration means 7 receives a phase signal obtained by converting the digital audio band input signal from the time domain signal to the frequency domain signal in the first fast Fourier transform means 1. The audio restoration means 7 restores the digital audio band signal by combining the gain signal and the frequency domain signal. As described above, the components from the noise reduction unit 4 to the second fast Fourier transform unit 5, the second absolute value calculation unit 6, and the audio restoration unit 7 are connected to the digital audio band from the output of the autocorrelation function calculation unit 3. A circuit for restoring and generating the signal is configured. The digital audio band signal restored in this way is input to the volume control means 11.

差分算出手段８から、ノイズ判定手段９、フェードアウト係数導出手段１０、音量調節手段１１に至る回路構成は、前記第１実施例における回路構成と同じである。差分算出手段８は、自己相関関数算出手段３で得られる自己相関関数に含まれる相関波形の振幅の差分すなわち変化量を求めるためにハイパスフィルタ処理を行い、非周期性のノイズ信号成分が多い信号か否かを判定する。フェードアウト係数導出手段１０は、ノイズ判定手段９から出力されるノイズ成分の判定信号に基づいて音量調節するために、フェードアウト係数を導出する。音量調節手段１１は、上記音声帯域のデジタル入力信号のレベルを、上記フェードアウト係数に基づいて調節し、フェードアウト係数の大小に応じてフェードアウト量を調節する。音量調節手段１１は、上記音声帯域のデジタル入力信号のレベルを、上記フェードアウト係数に基づいて調節し、フェードアウト係数の大小に応じて音量を調節した音声信号を出力する。 The circuit configuration from the difference calculating unit 8 to the noise determining unit 9, the fade-out coefficient deriving unit 10, and the volume adjusting unit 11 is the same as the circuit configuration in the first embodiment. The difference calculation means 8 performs high-pass filter processing to obtain the amplitude difference, that is, the amount of change of the correlation waveform included in the autocorrelation function obtained by the autocorrelation function calculation means 3, and a signal having a large amount of non-periodic noise signal components It is determined whether or not. The fade-out coefficient deriving unit 10 derives a fade-out coefficient in order to adjust the volume based on the noise component determination signal output from the noise determination unit 9. The volume control unit 11 adjusts the level of the digital input signal in the audio band based on the fade-out coefficient, and adjusts the fade-out amount according to the magnitude of the fade-out coefficient. The volume adjusting means 11 adjusts the level of the digital input signal in the audio band based on the fade-out coefficient, and outputs an audio signal whose volume is adjusted according to the magnitude of the fade-out coefficient.

第１実施例と同様に、音量調節手段１１で音量調節され出力される信号は、Ｄ／Ａ変換器でアナログの音声信号に変換され、増幅器、その他適宜の回路を経由し、スピーカから音声が放出される。また、音量調節手段１１は、例えば減衰器（アッテネータ）で構成することができる。さらに、図２に示す各機能ブロックは全て、デジタル・シグナル・プロセッサ（ＤＳＰ）で構成するとよい。音量調節手段１１による音量調節は、フェードアウト係数の大小に応じて連続的に行ってもよいし、段階的に行ってもよい。 Similar to the first embodiment, the signal whose volume is adjusted by the volume adjusting means 11 and output is converted into an analog audio signal by the D / A converter, and the sound is output from the speaker via an amplifier and other appropriate circuits. Released. Moreover, the volume control means 11 can be comprised with an attenuator (attenuator), for example. Further, all the functional blocks shown in FIG. 2 may be constituted by a digital signal processor (DSP). The volume adjustment by the volume adjustment unit 11 may be performed continuously or stepwise depending on the magnitude of the fade-out coefficient.

第２実施例によれば、第１実施例と同様の効果に加えて、音声帯域のデジタル入力信号に含まれるノイズ成分を低減したことにより、さらにノイズ感の少ない音声出力を得ることができる。 According to the second embodiment, in addition to the same effects as those of the first embodiment, the noise component included in the digital input signal in the audio band is reduced, so that an audio output with less noise can be obtained.

本発明は、音声帯域信号をワイヤレスで送受信し、音声を再生する各種装置ないしはシステム、例えば、電波や光を伝達媒体とした音声会議システム、放送あるいは通信システムにおける音声処理装置として好適である。 The present invention is suitable as an audio processing apparatus in various apparatuses or systems for transmitting and receiving audio band signals wirelessly and reproducing audio, for example, an audio conference system using radio waves or light as a transmission medium, broadcasting or a communication system.

１高速フーリエ変換手段
２絶対値算出手段
３自己相関関数算出手段
４ノイズ低減手段
５第２の高速フーリエ変換手段
６第２の絶対値算出手段
７音声復元手段
８差分算出手段
９ノイズ判定手段
１０フェードアウト係数導出手段
１１音量調節手段
DESCRIPTION OF SYMBOLS 1 Fast Fourier transform means 2 Absolute value calculation means 3 Autocorrelation function calculation means 4 Noise reduction means 5 Second fast Fourier transform means 6 Second absolute value calculation means 7 Speech restoration means 8 Difference calculation means 9 Noise determination means 10 Fade out Coefficient derivation means 11 Volume control means

Claims

A fast Fourier transform means for performing a Fourier transform of a digital input signal in a voice band at a high speed;
Absolute value calculating means for generating an absolute value signal by calculating an absolute value of the Fourier transformed signal;
Autocorrelation function calculating means for calculating an autocorrelation function of the calculated absolute value signal;
Difference calculating means for calculating the amount of change in output of the autocorrelation function calculating means;
Noise determining means for calculating a noise component according to the magnitude of the difference;
A fade-out coefficient deriving unit that calculates a fade-out amount based on the calculation result by the noise determination unit;
Volume adjusting means for adjusting and outputting the level of the digital input signal based on the calculation result by the fade-out coefficient deriving means;
A speech processing apparatus comprising:

First fast Fourier transform means for fast Fourier transform of a digital input signal in a voice band;
First absolute value calculating means for generating an absolute value signal by calculating an absolute value of the Fourier transformed signal;
Autocorrelation function calculating means for calculating an autocorrelation function of the calculated absolute value signal;
Difference calculating means for calculating the amount of change in output of the autocorrelation function calculating means;
Noise determining means for calculating a noise component according to the magnitude of the difference;
A fade-out coefficient deriving unit that calculates a fade-out amount based on the calculation result by the noise determination unit;
Second fast Fourier transform means for fast Fourier transforming the output of the autocorrelation function calculating means;
Second absolute value calculating means for generating an absolute value signal by calculating an absolute value of the signal subjected to Fourier transform by the second fast Fourier transform means;
Speech restoration means for restoring speech from a phase signal obtained by fast Fourier transforming the digital input signal by the first fast Fourier transform means and an absolute value signal output from the second absolute value calculation means;
Volume adjusting means for adjusting and outputting the level of the audio signal restored by the audio restoring means based on the calculation result by the fade-out coefficient deriving means;
A speech processing apparatus comprising:

The speech processing apparatus according to claim 2, wherein a noise reduction means is provided between the autocorrelation function calculation means and the second fast Fourier transform means.

4. The speech processing apparatus according to claim 3, wherein the noise reduction means is a filter that removes a high frequency component of the function calculated by the autocorrelation function calculation means.

The sound processing apparatus according to claim 1, wherein the volume adjusting means continuously adjusts the level of the sound signal.

The sound processing apparatus according to claim 1, wherein the volume adjusting means adjusts the level of the sound signal stepwise.