JP4206409B2

JP4206409B2 - Audio processing apparatus, method thereof, program, and recording medium recording the program

Info

Publication number: JP4206409B2
Application number: JP2006073434A
Authority: JP
Inventors: 末廣島内; 暁江村
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2006-03-16
Filing date: 2006-03-16
Publication date: 2009-01-14
Anticipated expiration: 2026-03-16
Also published as: JP2007251676A

Abstract

<P>PROBLEM TO BE SOLVED: To attain acceleration of a reaction speed by suppressing or intensifying an input signal in accordance with a change in the character of the relevant signal. <P>SOLUTION: The input signal is converted into a discrete frequency domain signal x(ωn) for each frame and divided into a plurality of groups so as to include one or more signals, an average root AX of total power sums of input signals within the frame is determined, an amplitude average AHm of signals within the group is determined for each group, and magnitudes of a first target amplitude TX and AX are compared. If AX is greater, a first standardized average value AHm×TX/AX is determined and the magnitude of the standardized average value is compared with that of amplitudes ¾x(ωn)¾ of signals in that group. If ¾x(ωn)¾ is greater, a first compression function is applied to that amplitude and the signal is output after the amplitude is suppressed to become closer to TX. If ¾x(ωn)¾ is not greater, the signal is output while keeping its amplitude as it is, and the entire output signal is converted into a time domain signal. <P>COPYRIGHT: (C)2007,JPO&INPIT

Description

この発明はディジタル化された入力信号のダイナミックレンジを所望の範囲に圧縮する音声処理装置、その方法、プログラム、及びそのプログラムを記録した記録媒体に関する。 The present invention relates to an audio processing apparatus for compressing a dynamic range of a digitized input signal to a desired range, a method and a program thereof, and a recording medium on which the program is recorded.

通信会議や補聴器システムなどにおいて、スピーカなどに対する再生ボリューム（音声調整量）を一定に保ったままでも、再生される音声内容が容易に理解できるようにするために、再生すべき音声信号を入力信号とし、その入力信号のダイナミックレンジを圧縮し、一定の音量範囲に納まるように処理して出力することにより、スピーカなどで再生させる音声処理方法がある。
例えば、従来、リミッタやコンプレッサと呼ばれる手法では、予め定義された非線形圧縮関数に基づき、一定の大きさ以上の音声信号を入力すると、その大きさが強制的に抑圧された信号を出力する。これらの技術は、抑圧量が大きくなるに従い、音声の聴感上の歪みが増大する問題がある。「非特許文献１」に示されるように、信号を複数の周波数帯域に分割または、変換し、それぞれの周波数帯域ごとに異なる圧縮関数を適用すれば、上記の歪みの問題を低減できることが期待される。「非特許文献１」に示される手法では、周波数帯域ごとに予め与えられた固定の圧縮関数を持つが、それら圧縮関数には、各周波数帯域に変換された入力信号そのものではなく、それら各離散周波数領域信号の振幅の時間平均値などが入力として与えられる。これより得られる圧縮された振幅の平均時間値と、圧縮関数適用前の振幅の時間平均値の比を、各周波数帯域ごとに計算し、これらの比を各離散周波数領域信号に乗じることで、各周波数帯域における圧縮処理を完了させる。これら各周波数帯域ごとに圧縮処理された信号は、周波数合成処理を施され、最終的な出力信号となる。
Ｔ．ＳｃｈｎｅｉｄｅｒａｎｄＲ．Ｂｒｅｎｎａｎ，“Ａｍｕｌｔｉｃｈａｎｎｅｌｃｏｍｐｒｅｓｓｉｏｎａｔｒａｔｅｇｙｆｏｒａｄｉｇｉｔａｌｈｅａｒｉｎｇａｉｄ”，１９９７ＩＥＥＥＩｎｔｅｒｎａｔｉｏｎａｌＣｏｎｆｅｒｅｎｃｅｏｎＡｃｏｕｓｔｉｃｓ，Ｓｐｅｅｃｈ，ａｎｄＳｉｇｎａｌＰｒｏｃｅｓｓｉｎｇ（ＩＣＡＳＳＰ−９７），ｖｏｌ．１，ｐｐ．４１１−４１４，Ａｐｒｉｌ１９９７ In teleconferences and hearing aid systems, the audio signal to be played is used as an input signal so that the audio content to be played can be easily understood even if the playback volume (sound adjustment amount) for the speaker is kept constant. In addition, there is an audio processing method in which the dynamic range of the input signal is compressed, processed so as to fall within a certain volume range, and output by a speaker or the like.
For example, conventionally, in a technique called a limiter or a compressor, when an audio signal having a certain level or more is input based on a predefined nonlinear compression function, a signal whose size is forcibly suppressed is output. These techniques have the problem that the distortion in the audibility of the sound increases as the amount of suppression increases. As shown in “Non-patent Document 1,” it is expected that the above-described distortion problem can be reduced by dividing or converting a signal into a plurality of frequency bands and applying different compression functions for each frequency band. The The technique disclosed in “Non-patent Document 1” has a fixed compression function given in advance for each frequency band, but these compression functions include not the input signal itself converted into each frequency band but each of these discrete functions. The time average value of the amplitude of the frequency domain signal is given as an input. By calculating the ratio between the average time value of the compressed amplitude obtained from this and the time average value of the amplitude before applying the compression function for each frequency band, and multiplying these discrete frequency domain signals by these ratios, The compression process in each frequency band is completed. The signals compressed for each frequency band are subjected to frequency synthesis processing and become final output signals.
T.A. Schneider and R.M. Brennan, “A multichannel compression attribution for a digital healing aid”, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing 97 (IC). 1, pp. 411-414, April 1997

「非特許文献１」に示される手法は、周波数帯域を細かく分割・変換するほど、よりきめの細かい品質の高い処理の実現が期待される反面、各離散周波数領域信号と元の入力信号の振幅の相関関係が複雑となり、各周波数帯域ごとに固定的に与えられる圧縮関数では、信号の性質の変化に柔軟に対応できず、ある性質の信号を入力した場合、品質高く圧縮できるが、他の性質の信号を入力した場合では、必ずしも、同等な品質で圧縮できないなどの問題が生じる。ここで、信号の性質の違いとは、例えば、音声の母音部分など周期性の高い信号と１音声の子音部分など、周波数成分が広範囲に分布する信号の違いなどを含む。 Although the technique shown in “Non-patent Document 1” is expected to realize finer and higher quality processing as the frequency band is divided and converted more finely, the amplitude of each discrete frequency domain signal and the original input signal is expected. The compression function given by each frequency band is complex and the compression function that is given fixedly for each frequency band cannot flexibly cope with changes in the signal properties. When a signal with a certain property is input, it can be compressed with high quality. When a signal having a characteristic is input, there is a problem that the signal cannot always be compressed with an equivalent quality. Here, the difference in signal characteristics includes, for example, a difference in signals having a wide range of frequency components such as a highly periodic signal such as a vowel part of speech and a consonant part of one speech.

また、各周波数帯域の圧縮関数の入力として、離散周波数領域信号の振幅の時間平均値が適用されるため、時間平均に必要な時定数に応じて、反応速度の遅れが生じる問題がある。 In addition, since the time average value of the amplitude of the discrete frequency domain signal is applied as the input of the compression function of each frequency band, there is a problem in that the reaction rate is delayed according to the time constant required for the time average.

離散時間の入力信号をフレームごとに離散周波数領域信号に変換し、上記離散周波数領域信号を、少なくとも１つのグループは複数の離散周波数領域信号を含むように複数のグループに分割し、上記フレーム内の入力信号の電力総和の平方根を求め、上記各分割されたグループごとに、そのグループ内の離散周波数領域信号の振幅平均を求め、圧縮処理後の期待される所望の第１の目標振幅と上記電力総和の平方根との値の大小を比較する第１の判定をし、上記第１の判定により上記電力総和の平方根の方が大であることを示す信号が入力された場合、上記第１の目標振幅と上記電力総和の平方根との比で、グループごとの振幅平均値を正規化して、第１の規格化平均値を求め、各グループごとに、上記第１の規格化平均値とそのグループの各上記離散周波数領域信号の振幅の大小を比較する第２の判定をし、上記第２の判定が、上記離散周波数領域信号の振幅の方が大であれば、その振幅に対し、出力信号が上記第１の目標振幅に近づくように抑圧する第１の圧縮関数を第１圧縮関数演算部で適用し、上記第２の判定部の判定が、上記離散周波数領域信号の振幅の方が大でなければ、その離散周波数領域信号をそのまま出力し、各グループごとに出力された離散周波数領域信号の全体を時間領域信号に変換する。 A discrete-time input signal is converted into a discrete frequency domain signal for each frame, and the discrete frequency domain signal is divided into a plurality of groups so that at least one group includes a plurality of discrete frequency domain signals. The square root of the total power of the input signal is obtained, and for each of the divided groups, the average amplitude of the discrete frequency domain signals in the group is obtained, and the desired desired first target amplitude after compression processing and the power When the first determination is made to compare the value of the square root of the sum, and when the signal indicating that the square root of the power sum is larger is input by the first determination, the first target The amplitude average value for each group is normalized by the ratio of the amplitude and the square root of the power sum to obtain a first normalized average value. For each group, the first normalized average value and the group's A second determination comparing the magnitudes of the amplitudes of the discrete frequency domain signals is made. If the amplitude of the discrete frequency domain signals is greater than the second judgment, the output signal is compared to the amplitude. A first compression function that suppresses the first target amplitude so as to approach the first target amplitude is applied by the first compression function calculation unit, and the determination by the second determination unit must be greater in amplitude of the discrete frequency domain signal. For example, the discrete frequency domain signal is output as it is, and the entire discrete frequency domain signal output for each group is converted into a time domain signal.

以上の構成によれば、入力信号が離散周波数領域信号に変換することにより、反応速度の向上を図ることが出来、また離散周波数領域信号をグループごとに分割し、そのグループごとの当該信号の性質に応じた圧縮関数により抑圧または強調することにより、安定したダイナミックレンジの圧縮が可能となる。 According to the above configuration, the response speed can be improved by converting the input signal into a discrete frequency domain signal, and the discrete frequency domain signal is divided into groups, and the characteristics of the signal for each group are divided. Stable or dynamic range compression is possible by suppressing or emphasizing with a compression function corresponding to.

実施例１
図１にこの発明の実施例１を示す。入力信号ｘ（ｔ）が周波数領域変換部２に入力されると、定時間（フレーム）、例えば入力信号ｘ（ｔ）のサンプリング周波数が１６ｋＨｚの場合、サンプル数が２５６や５１２ごとに、短時間フーリエ変換（ＦＦＴ）などにより、ω１〜ωＮまでのＮ個の周波数に対応するＮ個の離散周波数領域信号、ｘ（ω１）、．．．、ｘ（ωＮ）に変換される。ただし、入力信号は一定周期でサンプリングされ、各サンプルがディジタル値に変換されたディジタル信号であり、Ｎは整数とする。Ｎ個の離散周波数領域信号ｘ（ω１）、．．．、ｘ（ωＮ）は電力総和平方根計算部４とグループ帯域分割部６に入力される。 Example 1
FIG. 1 shows a first embodiment of the present invention. When the input signal x (t) is input to the frequency domain conversion unit 2, a fixed time (frame), for example, when the sampling frequency of the input signal x (t) is 16 kHz, the number of samples is short for every 256 or 512. N discrete frequency domain signals corresponding to N frequencies from ω1 to ωN, such as x (ω1),. . . , X (ωN). However, the input signal is a digital signal sampled at a constant period and each sample is converted to a digital value, and N is an integer. N discrete frequency domain signals x (ω1),. . . , X (ωN) are input to the power sum square root calculation unit 4 and the group band division unit 6.

グループ帯域分割部６で、離散周波数領域信号は周波数番号１〜Ｎについて順に、Ｍ個のグループに分割される。ただし、ＭはＮより小さく、１以上の整数とする。グループの分割の方法については、例えば、周波数について等分割するか、もしくは、低い周波数グループは比較的細かく、高い周波数領域は比較的粗く分割するなどの方法が考えられる。また、１つのグループに１つの離散周波数領域信号のみ含まれるグループが存在しても良いが、２つ以上の離散周波数領域信号を含むグループが最低１つは存在するものとする。例えば、１ｋＨｚごとに等分割する。離散周波数領域信号ｘ（ω１）、．．．、ｘ（ωＮ）は各分割グループごとに、グループ振幅平均計算部８ｍ、第１の圧縮関数制御部１０ｍ、第１の圧縮関数適用部１２ｍに入力される。ただしｍは１〜Ｍまでの整数とする。 In the group band dividing unit 6, the discrete frequency domain signals are divided into M groups in order for the frequency numbers 1 to N. However, M is smaller than N and is an integer of 1 or more. As a method of dividing the group, for example, a method of dividing the frequency equally or a method of dividing the low frequency group relatively finely and the high frequency region relatively coarsely can be considered. Further, a group including only one discrete frequency domain signal may exist in one group, but it is assumed that at least one group including two or more discrete frequency domain signals exists. For example, equal division is performed every 1 kHz. Discrete frequency domain signals x (ω1),. . . , X (ωN) is input to the group amplitude average calculation unit 8m, the first compression function control unit 10m, and the first compression function application unit 12m for each divided group. However, m is an integer from 1 to M.

一方、電力総和平方根計算部４で、ｘ（ω１）、．．．、ｘ（ωＮ）の電力の総和の平方根ＡＸを算出し、電力の総和の平方根ＡＸは各グループごとの第１の圧縮関数制御部１０ｍと第１の圧縮関数適用部１２ｍに入力される。なお、電力総和平方根計算部４では図１中に破線で示すように、入力信号ｘ（ｔ）を入力し、そのフレームの対象サンプルの電力の総和の平方根の計算をしてもよい。各グループごとに、グループ振幅平均計算部８ｍで、そのグループ中の離散周波数領域信号の振幅平均ＡＨｍを計算する。振幅平均ＡＨｍはそれぞれ対応する圧縮関数制御部１０ｍと圧縮関数適用部１２ｍに入力される。入力部１４から、入力信号に対して、圧縮処理後の期待される所望の振幅（以下、第１の目標振幅）ＴＸが入力され、第１の目標振幅ＴＸは、各グループごとの第１の圧縮関数制御部１０ｍと第１の圧縮関数適用部１２ｍに入力される。第１の目標振幅ＴＸの値は、この装置が適用されるシステムのスピーカや出力増幅器のダイナミックレンジなどにより決定され、信号歪みが生じないような値が選定される。 On the other hand, in the power sum square root calculation unit 4, x (ω1),. . . , X (ωN), and the square root AX of the sum of powers is input to the first compression function controller 10m and the first compression function application unit 12m for each group. The power sum square root calculation unit 4 may input the input signal x (t) as shown by a broken line in FIG. 1 and calculate the square root of the sum of the powers of the target samples in the frame. For each group, the group amplitude average calculation unit 8m calculates the average amplitude AHm of the discrete frequency domain signals in the group. The amplitude average AHm is input to the corresponding compression function control unit 10m and compression function application unit 12m, respectively. An expected desired amplitude after compression processing (hereinafter referred to as a first target amplitude) TX is input to the input signal from the input unit 14, and the first target amplitude TX is the first target amplitude for each group. The data is input to the compression function control unit 10m and the first compression function application unit 12m. The value of the first target amplitude TX is determined by the dynamic range of the speaker or output amplifier of the system to which this apparatus is applied, and a value that does not cause signal distortion is selected.

図２に第１の圧縮関数制御部１０ｍと第１の圧縮関数適用部１２ｍの詳細例と、これに関連する部分の図を示す。第１の圧縮関数制御部１０ｍは、振幅絶対値化部１０１ｍ、第２の判定部１０２ｍ、第１の規格化平均値計算部１０４ｍ、により構成され、
第１の圧縮関数適用部１２ｍは切替スイッチ１０６ｍ、第１の圧縮関数演算部１３０ｍ、位相付与部１２６ｍ、位相計算部１２８ｍにより構成され、
入力部１４はＴＸ入力部１４００とα入力部１４０２により構成される。
電力総和平方根計算部４よりの電力総和の平方根ＡＸと、ＴＸ入力部１４００よりの第１の目標振幅ＴＸと、グループ振幅平均計算部８ｍよりグループ内の離散周波数領域信号の振幅平均ＡＨｍがそれぞれ、第１の規格化平均値計算部１０４ｍに入力され、第１の規格化平均値計算部１０４ｍで、ＡＨｍ・ＴＸ／ＡＸを計算することにより、第１の規格化平均値が計算され、第２の判定部１０２ｍに入力される。各グループの振幅平均値ＡＨｍは第１の目標振幅ＴＸと電力総和平方根ＡＸとの比ＴＸ／ＡＸにより規格化される。 FIG. 2 shows a detailed example of the first compression function control unit 10m and the first compression function application unit 12m, and a diagram of parts related thereto. The first compression function control unit 10m includes an amplitude absolute value conversion unit 101m, a second determination unit 102m, and a first normalized average value calculation unit 104m.
The first compression function application unit 12m includes a changeover switch 106m, a first compression function calculation unit 130m, a phase applying unit 126m, and a phase calculation unit 128m.
The input unit 14 includes a TX input unit 1400 and an α input unit 1402.
The square root AX of the power sum from the power sum square root calculation unit 4, the first target amplitude TX from the TX input unit 1400, and the amplitude average AHm of the discrete frequency domain signals in the group from the group amplitude average calculation unit 8m, respectively. The first normalized average value is input to the first normalized average value calculation unit 104m, and the first normalized average value calculation unit 104m calculates AHm · TX / AX to calculate the first normalized average value. Is input to the determination unit 102m. The amplitude average value AHm of each group is normalized by the ratio TX / AX of the first target amplitude TX and the power sum square root AX.

一方、入力されたそのグループの離散周波数領域信号ｘ（ωｎ）が振幅絶対値化部１０１ｍ、位相計算部１２８ｍに入力される。振幅絶対値化部１０１ｍで離散周波数領域信号ｘ（ωｎ）の振幅｜ｘ（ωｎ）｜が求められ、第２の判定部１０２ｍと、切替スイッチ１０６ｍに入力される。第２の判定部１０２ｍで｜ｘ（ωｎ）｜とＡＨｍ・ＴＸ／ＡＸの値の大小が比較される。振幅｜ｘ（ωｎ）｜の方が大であれば、第２の判定部１０２ｍの出力により切替スイッチ１０６ｍが固定接点１０６２ｍ側に切り替えられ、振幅｜ｘ（ωｎ）｜の方が大でなければ、固定接点１０６１ｍ側に切り替えられる。固定接点１０６２ｍ側に切り替えられた場合、離散周波数領域信号の振幅ｘ（ωｎ）に第１の圧縮関数が第１の圧縮関数演算部１３０ｍで適用される。第１の圧縮関数は例えば、次式で表せる。 On the other hand, the input discrete frequency domain signal x (ωn) of the group is input to the amplitude absolute value converting unit 101m and the phase calculating unit 128m. The amplitude | x (ωn) | of the discrete frequency domain signal x (ωn) is obtained by the amplitude absolute value converting unit 101m and input to the second determination unit 102m and the changeover switch 106m. The second determination unit 102m compares | x (ωn) | with the magnitude of the values of AHm · TX / AX. If the amplitude | x (ωn) | is larger, the changeover switch 106m is switched to the fixed contact 1062m side by the output of the second determination unit 102m, and the amplitude | x (ωn) | is not larger. , Switching to the fixed contact 1061m side. When switched to the fixed contact 1062m side, the first compression function calculation unit 130m applies the first compression function to the amplitude x (ωn) of the discrete frequency domain signal. The first compression function can be expressed by the following equation, for example.

α｜ｘ（ωｎ）｜＋（１−α）ＡＨｍ・ＴＸ／ＡＸ
ここでαはα入力部１４０２より入力され、抑圧の程度を決定する０から１の範囲の実数であり、小さな値を与えるほど、大きく抑圧されることになる。なお、αは０．２〜０．５であることが望ましい。なお、この圧縮関数の演算に利用するため、第１の規格化平均値計算部１０４ｍで計算されたＡＨｍ・ＴＸ／ＡＸが第１の圧縮関数演算部１３０ｍに入力される。その演算結果は位相付与部１２６ｍに入力される。固定接点１０６１ｍより振幅｜ｘ（ωｎ）｜は位相付与部１２６ｍに直接入力される。 α | x (ωn) | + (1-α) AHm · TX / AX
Here, α is input from the α input unit 1402 and is a real number ranging from 0 to 1 that determines the degree of suppression. The smaller the value, the greater the suppression. Α is preferably 0.2 to 0.5. Note that AHm · TX / AX calculated by the first normalized average value calculation unit 104m is input to the first compression function calculation unit 130m to be used for calculation of the compression function. The calculation result is input to the phase applying unit 126m. The amplitude | x (ωn) | is directly input to the phase applying unit 126m from the fixed contact 1061m.

つまり、第１の圧縮関数適用部１２ｍではそのグループの各離散周波数領域信号の振幅｜ｘ（ωｎ）｜に対し、以下の（式１）の制御を行う。
（ａ）｜ｘ（ωｎ）｜＞ＡＨｍ・ＴＸ／ＡＸのとき
Ｆｍ（｜ｘ（ωｎ）｜）＝α｜ｘ（ωｎ）｜＋（１−α）ＡＨｍ・ＴＸ／ＡＸ
（ｂ）それ以外のとき
Ｆｍ（｜ｘ（ωｎ）｜）＝｜ｘ（ωｎ）｜（式１）
つまり、第１の圧縮関数制御部１０ｍではグループごとのその各離散周波数領域信号の振幅がそのグループの平均振幅ＡＨｍに基づき抑圧するか否かの判定が行われており、その判定結果により、第１の圧縮関数適用部１２ｍで、振幅｜ｘ（ωｎ）｜に対して抑圧するか、そのままにするかの制御が行われる。 That is, the first compression function application unit 12m performs the following control (Equation 1) for the amplitude | x (ωn) | of each discrete frequency domain signal of the group.
When (a) | x (ωn) |> AHm · TX / AX, Fm (| x (ωn) |) = α | x (ωn) | + (1-α) AHm · TX / AX
(B) Otherwise, Fm (| x (ωn) |) = | x (ωn) | (Formula 1)
That is, the first compression function control unit 10m determines whether or not the amplitude of each discrete frequency domain signal for each group is suppressed based on the average amplitude AHm of the group. 1 compression function application unit 12m controls whether the amplitude | x (ωn) | is suppressed or left as it is.

グループｍの離散周波数領域信号ｘ（ωｎ）は、位相計算部１２８ｍに入力される。位相計算部１２８ｍで、ｘ（ωｎ）の位相∠ｘ（ωｎ）が計算され、この計算結果∠ｘ（ωｎ）が位相付与部１２６ｍに入力される。位相付与部１２６ｍで、Ｙ（ωｎ）＝Ｆｍ（｜ｘ（ωｎ）｜）・∠ｘ（ωｎ）が計算され、第１の圧縮関数適用部１２ｍから出力され、時間領域変換部１６に入力される。
なお、位相付与部１２６ｍ、位相計算部１２８ｍを特に設けることなく、（式１）の演算において、いずれの場合にもｘ（ωｎ）／｜ｘ（ωｎ）｜を乗算させたものを時間領域変換部１６へ出力するようにしてもよい。各グループの第１の圧縮関数適用部１２ｍの出力を、時間領域変換部１６で例えば、短時間逆フーリエ変換などで、時間領域信号に変換されて、出力される。 The discrete frequency domain signal x (ωn) of the group m is input to the phase calculation unit 128m. The phase calculation unit 128m calculates the phase ∠x (ωn) of x (ωn), and the calculation result ∠x (ωn) is input to the phase applying unit 126m. In the phase applying unit 126m, Y (ωn) = Fm (| x (ωn) |) · で x (ωn) is calculated, output from the first compression function applying unit 12m, and input to the time domain converting unit 16. The
In addition, in the calculation of (Expression 1), x (ωn) / | x (ωn) | is multiplied in any case in the calculation of (Equation 1) without providing the phase adding unit 126m and the phase calculating unit 128m. The data may be output to the unit 16. The output of the first compression function application unit 12m of each group is converted into a time domain signal by the time domain conversion unit 16 by, for example, a short time inverse Fourier transform, and is output.

この実施例１により、入力信号ｘ（ｔ）を短時間フーリエ変換などにより、周波数成分ごとの信号に変換し、それら各離散周波数領域信号のグループにまとめ、各グループ固有の性質を有する第１の圧縮関数を与える。このとき、第１の圧縮関数は、各グループの離散周波数領域信号ｘ（ωｎ）の瞬時の振幅｜ｘ（ωｎ）｜が規格化平均値ＡＨｍ・ＴＸ／ＡＸより大きい場合のみ、対応する離散周波数領域信号ｘ（ωｎ）の振幅｜ｘ（ωｎ）｜を抑圧する。これにより、入力信号が第１の目標振幅よりも大きな振幅を持っている場合、各グループの平均振幅によって捉えられる入力信号のマクロ的な周波数特性に応じ、各グループの圧縮基準が瞬時に自動的に決定される。 According to the first embodiment, the input signal x (t) is converted into a signal for each frequency component by a short-time Fourier transform or the like, and is grouped into a group of each discrete frequency domain signal. Gives the compression function. At this time, the first compression function has a corresponding discrete frequency only when the instantaneous amplitude | x (ωn) | of the discrete frequency domain signal x (ωn) of each group is larger than the normalized average value AHm · TX / AX. The amplitude | x (ωn) | of the region signal x (ωn) is suppressed. Thereby, when the input signal has an amplitude larger than the first target amplitude, the compression standard of each group is automatically and instantaneously according to the macro frequency characteristics of the input signal captured by the average amplitude of each group. To be determined.

なお、図１中に破線で示すように、電力総和平方根ＡＸと第１の目標振幅ＴＸとの大小を第１の判定部１３６で比較判定し、ＴＸ＞ＡＸと判定されると、その判定出力で、スイッチ１１２をオンにして、当該短時間フーリエ変換区間の入力信号をそのまま出力する。また、例えば、第１の判定部１３６でＴＸ＞ＡＸでないと判定されると、各グループの第１の圧縮関数制御部１０ｍの動作を禁止する構成にしてもよい。
実施例２
次にこの発明の実施例２を説明する。 As indicated by a broken line in FIG. 1, the first determination unit 136 compares and determines the magnitude of the power sum square root AX and the first target amplitude TX. When TX> AX is determined, the determination output is obtained. Then, the switch 112 is turned on, and the input signal in the short-time Fourier transform section is output as it is. Further, for example, when the first determination unit 136 determines that TX> AX is not satisfied, the operation of the first compression function control unit 10m of each group may be prohibited.
Example 2
Next, a second embodiment of the present invention will be described.

実施例１では、第１の目標振幅ＴＸより、電力総和の平方根、つまり、入力信号の振幅平均の推定値ＡＸが大きい場合は、入力信号の振幅を抑圧した。しかし、ＴＸ＞ＡＸの場合は、振幅抑圧処理は行われていない。この実施例２では、ＴＸ以下の第２の目標振幅ＤＸを予め決め、ＤＸ＞ＡＸの場合に入力信号を強調してダイナミックレンジを更に圧縮する。
先に述べたように第１の判定部１３６で電力総和平方根計算部４よりの電力総和の平方根ＡＸと、入力部１４よりの所望の振幅ＴＸの大小を比較し、ＴＸ＜ＡＸを満たすフレームについては、第１の実施例で説明した第１の圧縮関数による抑圧する。実施例２では、ＴＸ＜ＡＸを満たさないフレームについて、強調したい所望振幅（以下第２の目標振幅）ＤＸを定め、第２の目標振幅に近づくよう、入力信号を強調して、ダイナミックレンジをさらに圧縮する。 In the first embodiment, when the square root of the power sum, that is, the estimated average value AX of the amplitude of the input signal is larger than the first target amplitude TX, the amplitude of the input signal is suppressed. However, when TX> AX, the amplitude suppression process is not performed. In the second embodiment, the second target amplitude DX equal to or lower than TX is determined in advance, and when DX> AX, the input signal is emphasized to further compress the dynamic range.
As described above, the first determination unit 136 compares the square root AX of the power sum from the power sum square root calculation unit 4 with the magnitude of the desired amplitude TX from the input unit 14, and the frame satisfies TX <AX. Are suppressed by the first compression function described in the first embodiment. In the second embodiment, for a frame that does not satisfy TX <AX, a desired amplitude (hereinafter referred to as a second target amplitude) DX to be emphasized is determined, and the input signal is enhanced so as to approach the second target amplitude to further increase the dynamic range. Compress.

図３に実施例２の具体的構成例を示す。図１を用いて説明した実施例１と比べて、第１の圧縮関数制御部１０ｍと第１の圧縮関数適用部１２ｍの具体的構成が一部変更され、第２の圧縮関数制御部１３５ｍと第２の圧縮関数適用部１２５ｍとなっている。また同一機能構成部分には同一参照番号をつける。
第１の圧縮判定部１３４の判定がＴＸ＞ＡＸの場合、このことを示す判定結果出力が第２の圧縮関数制御部１３５ｍに入力される。
図４に第２の圧縮関数制御部１３５ｍと第２の圧縮関数適用部１２５ｍの詳細例と、これに関連する部分の図を示す。同一機能構成部分には同一参照番号をつける。 FIG. 3 shows a specific configuration example of the second embodiment. Compared to the first embodiment described with reference to FIG. 1, the specific configurations of the first compression function control unit 10 m and the first compression function application unit 12 m are partially changed, and the second compression function control unit 135 m The second compression function application unit 125m is provided. The same reference numerals are assigned to the same functional components.
When the determination of the first compression determination unit 134 is TX> AX, a determination result output indicating this is input to the second compression function control unit 135m.
FIG. 4 shows a detailed example of the second compression function control unit 135m and the second compression function application unit 125m, and a diagram of parts related thereto. The same reference numerals are assigned to the same functional components.

第１の圧縮判定部１３４は第１の判定部１３６と第３の判定部１３７により構成され、第２の圧縮関数適用部１２５ｍは切替スイッチ１０８ｍ、切替スイッチ１０７ｍ、第２の圧縮関数演算部１３２ｍ、位相付与部１２６ｍ、位相計算部１２８ｍにより構成され、入力部１４はＴＸ入力部１４００、ＤＸ入力部１４０４、α入力部１４０２、β入力部１４０６により構成され、第２の圧縮関数制御部１３５ｍは第４の判定部１３８ｍで構成されている。
まず第１の判定部１３６は電力総和平方根計算部４よりの電力総和の平方根ＡＸとＴＸ入力部１４００よりの所望の振幅ＴＸと値の大小を比較し、この結果が（１）ＡＸ≦ＤＸ（２）ＤＸ＜ＡＸ≦ＴＸ（３）ＴＸ＜ＡＸの３状態を判定する３状態判定回路１４４に入力される。 The first compression determination unit 134 includes a first determination unit 136 and a third determination unit 137, and the second compression function application unit 125m includes a changeover switch 108m, a changeover switch 107m, and a second compression function calculation unit 132m. The phase adding unit 126m and the phase calculating unit 128m. The input unit 14 includes a TX input unit 1400, a DX input unit 1404, an α input unit 1402, and a β input unit 1406. The second compression function control unit 135m It is comprised by the 4th determination part 138m.
First, the first determination unit 136 compares the square root AX of the power sum from the power sum square root calculation unit 4 with the desired amplitude TX from the TX input unit 1400 and the magnitude of the value, and the result is (1) AX ≦ DX ( 2) DX <AX ≦ TX (3) Input to the three-state determination circuit 144 that determines three states of TX <AX.

ＤＸ入力部１４０４より入力された第２の目標振幅ＤＸは第３の判定部１３７と第２の圧縮関数演算部１３２ｍに入力される。第３の判定部１３７は電力総和平方根計算部４よりの電力総和の平方根ＡＸと第２の目標振幅ＤＸとの値の大小を比較し、この結果が３状態判定回路１４４に入力される。ただしＤＸ≦ＴＸであり、ＤＸはＴＸの約２分の１程度が好ましい。３状態判定回路１４４の出力が（１）の場合、全てのグループの切替スイッチ１０８ｍが固定接点１０８２ｍ側に切り替えられ、（３）の場合は、固定接点１０８１ｍに切り替えられる。また、（２）の場合は、入力信号は抑圧されずに、出力される。固定接点１０８１ｍ側に切り替えられた場合は、振幅絶対値化部１０１ｍよりの対応グループｍの各離散周波数領域信号の振幅｜ｘ（ωｎ）｜が第１の圧縮関数適用部１２ｍ内の切替スイッチ１０６ｍの固定接点１０６１ｍへ供給される。つまりこの場合は振幅｜ｘ（ωｎ）｜は抑圧圧縮、強調圧縮されることはない。 The second target amplitude DX input from the DX input unit 1404 is input to the third determination unit 137 and the second compression function calculation unit 132m. The third determination unit 137 compares the values of the square root AX of the power sum from the power sum square root calculation unit 4 and the second target amplitude DX, and the result is input to the three-state determination circuit 144. However, DX ≦ TX, and DX is preferably about one-half of TX. When the output of the three-state determination circuit 144 is (1), the selector switches 108m of all groups are switched to the fixed contact 1082m side, and in the case of (3), the switches are switched to the fixed contact 1081m. In the case of (2), the input signal is output without being suppressed. When switched to the fixed contact 1081m side, the amplitude | x (ωn) | of each discrete frequency domain signal of the corresponding group m from the amplitude absolute value converting unit 101m is changed to the changeover switch 106m in the first compression function applying unit 12m. To the fixed contact 1061m. That is, in this case, the amplitude | x (ωn) | is not subjected to suppression compression or enhancement compression.

第４の判定部１３８ｍでは、グループ振幅平均計算部８ｍよりの振幅平均ＡＨｍと振幅絶対値化部１０１ｍよりの振幅｜ｘ（ωｎ）｜との大小が比較され、振幅｜ｘ（ωｎ）｜の方が大きい場合、切替スイッチ１０７ｍが、固定接点１０７２ｍ側に切り替えられ、振幅｜ｘ（ωｎ）｜の方が大きくない場合、固定接点１０７１ｍ側に切り替えられる。切替スイッチが１０７２ｍに切り替えられている場合、すなわち、ＤＸ＞ＡＸ、｜ｘ（ωｎ）｜＞ＡＨｍ、を満たす場合は、離散周波数領域信号の振幅｜ｘ（ωｎ）｜は第２の圧縮関数演算部１３２ｍで、第２の圧縮関数が適用される。第２の圧縮関数は例えば、次式で表せる。 In the fourth determination unit 138m, the magnitude of the amplitude average AHm from the group amplitude average calculation unit 8m and the amplitude | x (ωn) | from the amplitude absolute value conversion unit 101m are compared, and the amplitude | x (ωn) | If it is larger, the changeover switch 107m is switched to the fixed contact 1072m side, and if the amplitude | x (ωn) | is not larger, it is switched to the fixed contact 1071m side. When the changeover switch is switched to 1072 m, that is, when DX> AX and | x (ωn) |> AHm are satisfied, the amplitude | x (ωn) | of the discrete frequency domain signal is calculated by the second compression function calculation. In part 132m, the second compression function is applied. For example, the second compression function can be expressed by the following equation.

β｜ｘ（ωｎ）｜＋（１−β）ＡＨｍ・ＤＸ／ＡＸ
ここで、βはβ入力部１４０６より入力され、βは強調の程度を決定する０〜１の範囲の実数であり、小さな値を与えるほど、入力された振幅｜ｘ（ωｎ）｜と比べ、より大きく強調される。なお、βは０．２〜０．５であることが望ましい。その演算結果は位相付与部１２６ｍに入力される。固定接点１０７１ｍより位相付与部１２６ｍに入力され、切替スイッチ１０７ｍが固定接点１０７１ｍ側に接続されている場合は、振幅｜ｘ（ωｎ）｜は位相付与部１２６ｍへ直接入力される。 β | x (ωn) | + (1-β) AHm · DX / AX
Here, β is input from the β input unit 1406, β is a real number in the range of 0 to 1 that determines the degree of emphasis, and the smaller the value, the larger the input amplitude | x (ωn) | Greater emphasis. Note that β is preferably 0.2 to 0.5. The calculation result is input to the phase applying unit 126m. When the fixed contact 1071m is input to the phase applying unit 126m and the changeover switch 107m is connected to the fixed contact 1071m side, the amplitude | x (ωn) | is directly input to the phase applying unit 126m.

つまり、第２の圧縮関数適用部１２５ｍでは、そのグループの各離散周波数領域信号の振幅｜ｘ（ωｎ）｜に対し、以下に示す（式２）で制御を行う。
（ａ）｜ｘ（ωｎ）｜＞ＡＨｍのとき
Ｆｍ（｜ｘ（ωｎ）｜）＝β｜ｘ（ωｎ）｜＋（１−β）ＡＨｍ・ＤＸ／ＡＸ
（ｂ）それ以外のとき
Ｆｍ（｜ｘ（ωｎ）｜）＝｜ｘ（ωｎ）｜（式２）
つまり、第２の圧縮関数制御部１３５ｍではグループごとのその各離散周波数領域信号の振幅がそのグループの平均振幅ＡＨｍに基づき強調するか否かの判定が行われており、その判定結果により、第２の圧縮関数適用部１２５ｍで、振幅｜ｘ（ωｎ）｜に対して強調するか、そのままにするかの制御が行われる。 That is, the second compression function application unit 125m controls the amplitude | x (ωn) | of each discrete frequency domain signal of the group by the following (Equation 2).
When (a) | x (ωn) |> AHm, Fm (| x (ωn) |) = β | x (ωn) | + (1-β) AHm · DX / AX
(B) Otherwise, Fm (| x (ωn) |) = | x (ωn) | (Formula 2)
That is, the second compression function control unit 135m determines whether or not the amplitude of each discrete frequency domain signal for each group is enhanced based on the average amplitude AHm of the group. The compression function applying unit 125m 2 controls whether the amplitude | x (ωn) | is emphasized or left as it is.

第２の圧縮関数適用部１２５ｍでのその後の圧縮は実施例１と同様に強調処理された信号もそのままの信号にも位相∠ｘ（ωｎ）が付与されて、時間領域変換部１６に入力される。
図５ＡにＡＸ＞ＴＸを満たすフレームについての（式１）による抑圧処理の特性を示し、図５ＢにＡＸ＞ＴＸを満たさないフレームについての（式２）による強調処理の特性を示す。図５（Ａ）、図５（Ｂ）とも、は縦軸を出力されるＦｍ（｜ｘ（ωｎ）｜）とし、横軸を入力される｜ｘ（ωｎ）｜とする。 Subsequent compression in the second compression function application unit 125m is performed by adding the phase ∠x (ωn) to the signal subjected to enhancement processing as it is in the same manner as in the first embodiment and is input to the time domain conversion unit 16. The
FIG. 5A shows the characteristics of the suppression process according to (Expression 1) for a frame satisfying AX> TX, and FIG. 5B shows the characteristics of the enhancement process according to (Expression 2) for a frame not satisfying AX> TX. In both FIGS. 5A and 5B, the vertical axis is Fm (| x (ωn) |) that is output, and the horizontal axis is | x (ωn) | that is input.

図５（Ａ）において、上述の（式１）により、｜ｘ（ωｎ）｜＞ＡＨｍ・ＴＸ／ＡＸの領域（図５（Ａ）中で抑圧領域と示している）では、
Ｆｍ（｜ｘ（ωｎ）｜）＝α｜ｘ（ωｎ）｜＋（１−α）ＡＨｍ・ＴＸ／ＡＸにより抑圧されており、｜ｘ（ωｎ）｜≦ＡＨｍ・ＴＸ／ＡＸでは、
Ｆｍ（｜ｘ（ωｎ）｜）＝｜ｘ（ωｎ）｜となるので何ら抑圧されていない。
図５（Ｂ）において、上述の（式２）により｜ｘ（ωｎ）｜＞ＡＨｍの領域（図５（Ｂ）では強調領域と示している）では、
Ｆｍ（｜ｘ（ωｎ）｜）＝β｜ｘ（ωｎ）｜＋（１−β）ＡＨｍ・ＤＸ／ＡＸ
により強調されており、｜ｘ（ωｎ）｜≦｜ＡＨｍでは、
Ｆｍ（｜ｘ（ωｎ）｜）＝｜ｘ（ωｎ）｜となるので何ら強調されていない。
図５（Ａ）において、（式１）による抑圧処理においてはＴＸ／ＡＸで規格化された規格化平均値ＡＨｍ・ＴＸ／ＡＸを｜ｘ（ωｎ）｜についての抑圧の下限値としているが、（式２）による強調においては、強調の下限値をＡＨｍにより、与えている。これは、信号を抑圧する場合は、要求される抑圧の程度により、ＡＨｍより小さい信号も抑圧する必要が生じるのに対し、強調する場合には、平均値ＡＨｍより小さい信号の中に含まれると考えられる雑音成分などの不要な増幅を避ける意図がある。また図５（Ｂ）に示す（式２）の特性において、
｜ｘ（ωｎ）｜＞ＡＨｍ・ＴＸ／ＡＸの範囲では、逆に抑圧する効果を与えてしまうが、第１の目標振幅ＴＸを第２の目標振幅ＤＸと近い値に選んだ場合は、逆に（式１）の特性との連続性が保たれることになる。 In FIG. 5A, according to the above (Equation 1), in the region of | x (ωn) |> AHm · TX / AX (shown as the suppression region in FIG. 5A),
Fm (| x (ωn) |) = α | x (ωn) | + (1-α) AHm · TX / AX is suppressed, and | x (ωn) | ≦ AHm · TX / AX,
Since Fm (| x (ωn) |) = | x (ωn) |, no suppression is performed.
In FIG. 5B, in the region of | x (ωn) |> AHm (shown as an emphasized region in FIG. 5B) according to the above (Equation 2),
Fm (| x (ωn) |) = β | x (ωn) | + (1-β) AHm · DX / AX
And | x (ωn) | ≦ | AHm,
Since Fm (| x (ωn) |) = | x (ωn) |, it is not emphasized at all.
In FIG. 5A, in the suppression processing according to (Equation 1), the normalized average value AHm · TX / AX normalized by TX / AX is set as the lower limit value of suppression for | x (ωn) | In the emphasis by (Expression 2), the lower limit value of emphasis is given by AHm. When suppressing a signal, it is necessary to suppress a signal smaller than AHm depending on the required degree of suppression. On the other hand, when emphasizing, it is included in a signal smaller than the average value AHm. The intention is to avoid unnecessary amplification such as possible noise components. In the characteristic of (Equation 2) shown in FIG.
In the range of | x (ωn) |> AHm · TX / AX, the effect of suppressing the reverse is provided. However, when the first target amplitude TX is selected to be close to the second target amplitude DX, the reverse is achieved. Therefore, continuity with the characteristic of (Equation 1) is maintained.

また図３中の第２の圧縮関数制御部１３５ｍに雑音レベル推定部１４２ｍを具備してもよい。この場合、例えば、図４中の第２の圧縮関数制御部１３５ｍ内に破線で示すように、雑音レベル推定部１４２ｍが設けられ、更に、第５の判定部１４０ｍ、も設けられる。グループごとに周波数領域信号ｘ（ωｎ）は雑音レベル推定部１４２ｍに入力され、強調不要な雑音成分の大きさの最大値もしくは平均値に１より大きい定数を乗算した雑音レベルＮＬｍが推定され、この雑音レベルＮＬｍは第５の判定部１４０ｍに入力される。第５の判定部１４０ｍでは各対応グループごとにグループ振幅平均計算部８ｍよりの振幅平均ＡＨｍと雑音レベルＮＬｍとの値の大小が比較される。第５の判定部１４０ｍよりの比較結果により、振幅平均ＡＨｍと雑音レベルＮＬｍの大きい方の値が第４の判定部１３８ｍに入力され、第４の判定部１３８ｍではこの入力された大きい方の値と振幅｜ｘ（ωｎ）｜との比較が行われる。このようにすれば、雑音成分の望ましくない強調をより確実に抑えることができる。 Moreover, the noise level estimation part 142m may be provided in the 2nd compression function control part 135m in FIG. In this case, for example, as indicated by a broken line in the second compression function control unit 135m in FIG. 4, a noise level estimation unit 142m is provided, and further, a fifth determination unit 140m is also provided. The frequency domain signal x (ωn) is input to the noise level estimation unit 142m for each group, and the noise level NLm obtained by multiplying the maximum value or the average value of the noise components that do not require enhancement by a constant larger than 1 is estimated. The noise level NLm is input to the fifth determination unit 140m. The fifth determination unit 140m compares the magnitudes of the amplitude average AHm and the noise level NLm from the group amplitude average calculation unit 8m for each corresponding group. Based on the comparison result from the fifth determination unit 140m, the larger value of the amplitude average AHm and the noise level NLm is input to the fourth determination unit 138m, and the input larger value is input to the fourth determination unit 138m. And the amplitude | x (ωn) |. In this way, unwanted enhancement of noise components can be more reliably suppressed.

実施例３
次にこの発明の実施例３を説明する。実施例１、２は入力信号の振幅値を電力総和平方根により算出しているため、インパルス性信号のように、瞬間的な振幅は大きくても、エネルギーの小さい信号を効果的に抑圧できない。この実施例３では、周波数領域に変換されたパルス性信号が各周波数において、ほぼ等しい振幅を有する性質に着目し、例えば、図６に示すように構成する。全体のブロック構成としては、実施例１とほぼ同様であるが、第２の圧縮判定部１４７を設け、また第３の圧縮関数制御部１４５ｍ、第３の圧縮関数適用部１５３ｍにおける処理内容が異なる。フレームごとに、入力信号がインパルス性信号であるか否かを判定し、インパルス性信号であるフレームについて、実施例３では以下の処理を行う。 Example 3
Next, a third embodiment of the present invention will be described. In the first and second embodiments, the amplitude value of the input signal is calculated by the square root of the sum of power, and thus a signal with low energy cannot be effectively suppressed even if the instantaneous amplitude is large, such as an impulsive signal. In the third embodiment, attention is paid to the property that the pulse signal converted into the frequency domain has substantially the same amplitude at each frequency, and is configured as shown in FIG. 6, for example. The overall block configuration is substantially the same as in the first embodiment, but the second compression determination unit 147 is provided, and the processing contents in the third compression function control unit 145m and the third compression function application unit 153m are different. . For each frame, it is determined whether or not the input signal is an impulsive signal, and the following processing is performed in the third embodiment for a frame that is an impulsive signal.

図７に第３の圧縮関数制御部１４５ｍ、第３の圧縮関数適用部１５３ｍの詳細、その他関連のある部分を示す。第３の圧縮関数制御部１４５ｍは振幅絶対値化部１０１ｍ、第２の規格化平均値計算部１４８ｍ、第６の判定部１５２ｍにより構成され、第３の圧縮関数適用部１５３ｍは切替スイッチ１１０ｍ、第３の圧縮関数演算部１５４ｍ、位相付与部１２６ｍ、位相計算部１２８ｍにより構成されている。なお、図６、図７に関して、実施例１と２と同一機能構成部分には同一参照番号をつける。
離散周波数領域信号ｘ（ωｎ）は第２の圧縮判定部１４７（インパルス性信号判定部）と位相計算部１２８ｍと振幅絶対値化部１０１ｍに入力される。第２の圧縮判定部１４７でこのフレームがインパルス性信号であるか否かが判定され、その判定結果は、第６の判定部１５２ｍに入力される。また、グループ振幅平均計算部８ｍよりの平均振幅ＡＨｍが第２の規格化平均値計算部１４８ｍに入力され、電力総和平方根計算部４よりの電力総和の平方根ＡＸとＰＸ入力部１４０８よりのインパルス性信号に対する圧縮処理後の期待振幅（以下第３の目標振幅）ＰＸがそれぞれ、第２の規格化平均値計算部１４８ｍと第２の圧縮判定部１４７に入力される。なお、第３の目標振幅ＰＸは第１の目標振幅ＴＸの１／１０程度であることが望ましい。 FIG. 7 shows details of the third compression function control unit 145m, the third compression function application unit 153m, and other related parts. The third compression function control unit 145m includes an amplitude absolute value conversion unit 101m, a second normalized average value calculation unit 148m, and a sixth determination unit 152m. The third compression function application unit 153m includes a changeover switch 110m, A third compression function calculation unit 154m, a phase applying unit 126m, and a phase calculation unit 128m are included. 6 and 7, the same reference numerals are assigned to the same functional components as those in the first and second embodiments.
The discrete frequency domain signal x (ωn) is input to the second compression determination unit 147 (impulsive signal determination unit), the phase calculation unit 128m, and the amplitude absolute value conversion unit 101m. The second compression determination unit 147 determines whether or not this frame is an impulsive signal, and the determination result is input to the sixth determination unit 152m. Further, the average amplitude AHm from the group amplitude average calculation unit 8m is input to the second normalized average value calculation unit 148m, and the square root AX of the power sum from the power sum square root calculation unit 4 and the impulsiveness from the PX input unit 1408 An expected amplitude (hereinafter, third target amplitude) PX after compression processing on the signal is input to the second normalized average value calculation unit 148m and the second compression determination unit 147, respectively. The third target amplitude PX is desirably about 1/10 of the first target amplitude TX.

第２の規格化平均値計算部１４８ｍで、第２の規格化平均値ＡＨｍ・ＰＸ／ＡＸが計算され、第６の判定部１５２ｍに入力される。第６の判定部１５２ｍでは、第２の圧縮判定部（インパルス性信号判定部）１４７よりの判定結果がインパルス性信号である場合に、振幅｜ｘ（ωｎ）｜と第２の規格化平均値ＡＨｍ・ＰＸ／ＡＸの値の大小が比較される。
振幅絶対値化部１０１ｍで、振幅｜ｘ（ωｎ）｜が求められ、切替スイッチ１１０ｍと第６の判定部１５２ｍに入力される。
第６の判定部１５２ｍの判定結果が｜ｘ（ωｎ）｜＞ＡＨｍ・ＰＸ／ＡＸの場合は、切替スイッチ１１０ｍを固定接点１１０２ｍに切り替え、｜ｘ（ωｎ）｜＞ＡＨｍ・ＰＸ／ＡＸでない場合は、切替スイッチ１１０ｍを固定接点１１０１ｍに切り替える。 The second normalized average value calculation unit 148m calculates the second normalized average value AHm · PX / AX and inputs it to the sixth determination unit 152m. In the sixth determination unit 152m, when the determination result from the second compression determination unit (impulsive signal determination unit) 147 is an impulse signal, the amplitude | x (ωn) | and the second normalized average value AHm · PX / AX values are compared in magnitude.
The amplitude | x (ωn) | is obtained by the amplitude absolute value converting unit 101m and input to the changeover switch 110m and the sixth determining unit 152m.
When the determination result of the sixth determination unit 152m is | x (ωn) |> AHm · PX / AX, the changeover switch 110m is switched to the fixed contact 1102m, and | x (ωn) |> AHm · PX / AX is not satisfied Switches the changeover switch 110m to the fixed contact 1101m.

切替スイッチ１１０ｍが固定接点１１０２ｍに切り替えられている場合に振幅｜ｘ（ωｎ）｜に対してγ｜ｘ（ωｎ）｜＋（１−γ）ＡＨｍ・ＰＸ／ＡＸが第３の圧縮関数演算部１５４ｍで演算される。なおγはγ入力部１４１０により入力されるものであり、γ＝０．２〜０．５であることが好ましい。一方、切替スイッチ１１０ｍが固定接点１１０１ｍに切り替えられている場合は、振幅｜ｘ（ωｎ）｜はそのままとされる。
つまり、第３の圧縮関数適用部１５３ｍでは、そのグループの各離散周波数領域信号の振幅｜ｘ（ωｎ）｜に対し、以下に示す（式３）で制御を行う。
（ａ）｜ｘ（ωｎ）｜＞ＡＨｍ・ＰＸ／ＡＸのとき
Ｆｍ（｜ｘ（ωｎ）｜）＝γ｜ｘ（ωｎ）｜＋（１−γ）ＡＨｍ・ＰＸ／ＡＸ
（ｂ）それ以外のとき
Ｆｍ（｜ｘ（ωｎ）｜）＝｜ｘ（ωｎ）｜（式３）
つまり、第３の圧縮関数制御部ではグループごとのその各離散周波数領域信号の振幅がそのグループの第２規格化平均値ＡＨｍ・ＰＸ／ＡＸに基づき抑圧するか否かの判定が行われており、その判定結果により、第３の圧縮関数適用部１５３ｍで、振幅｜ｘ（ωｎ）｜に対して抑圧するか、そのままにするかの制御が行われる。 When the changeover switch 110m is switched to the fixed contact 1102m, γ | x (ωn) | + (1-γ) AHm · PX / AX is the third compression function calculation unit with respect to the amplitude | x (ωn) | It is calculated at 154m. Note that γ is input by the γ input unit 1410, and preferably γ = 0.2 to 0.5. On the other hand, when the changeover switch 110m is switched to the fixed contact 1101m, the amplitude | x (ωn) | is left as it is.
That is, the third compression function application unit 153m controls the amplitude | x (ωn) | of each discrete frequency domain signal of the group by the following (Equation 3).
When (a) | x (ωn) |> AHm · PX / AX, Fm (| x (ωn) |) = γ | x (ωn) | + (1-γ) AHm · PX / AX
(B) Otherwise, Fm (| x (ωn) |) = | x (ωn) | (Formula 3)
That is, the third compression function controller determines whether or not the amplitude of each discrete frequency domain signal for each group is suppressed based on the second normalized average value AHm · PX / AX of the group. Based on the determination result, the third compression function applying unit 153m controls whether the amplitude | x (ωn) | is suppressed or left as it is.

これら処理された振幅Ｆｍ（｜ｘ（ωｎ）｜）に対し、実施例１、２と同様に位相∠ｘ（ωｎ）を付与して、時間領域変換部１６に入力する。なお、位相∠ｘ（ωｎ）の付与は、ｘ（ωｎ）／｜ｘ（ωｎ）｜をＦｍ（｜ｘ（ωｎ）｜）に乗算して行っても良い。このことは実施例２についても同様である。
図８に第２の圧縮判定部１４７の具体的構成例を示す。第２の圧縮判定部１４７は例えば、全体振幅平均算出部１４４０、全体最大振幅検出部１４４２、第７の判定部１４４５、第８の判定部１４４４、アンド回路１４４６により構成されている。 A phase ∠x (ωn) is given to the processed amplitude Fm (| x (ωn) |) in the same manner as in the first and second embodiments, and is input to the time domain conversion unit 16. The phase ∠x (ωn) may be given by multiplying x (ωn) / | x (ωn) | by Fm (| x (ωn) |). The same applies to the second embodiment.
FIG. 8 shows a specific configuration example of the second compression determination unit 147. The second compression determination unit 147 includes, for example, an overall amplitude average calculation unit 1440, an overall maximum amplitude detection unit 1442, a seventh determination unit 1445, an eighth determination unit 1444, and an AND circuit 1446.

電力総和平方根計算部４よりの電力総和の平方根ＡＸとＰＸ入力部１４０８よりの第３の目標振幅ＰＸが第７の判定部１４４５に入力され、電力総和の平方根ＡＸと第３の目標振幅ＰＸの値の大小が判定される。この判定結果がアンド回路１４４６に入力される、
また、全帯域の離散周波数領域信号ｘ（ωｎ）が全体振幅平均算出部１４４０と全体最大振幅検出部１４４２に入力され、全体振幅平均算出部１４４０で、全帯域の離散周波数領域信号ｘ（ωｎ）の振幅平均ｘ（ωｎ）_Ａが算出され、全体最大振幅検出部１４４２で、全帯域の離散周波数領域信号ｘ（ωｎ）の最大振幅ｘ（ωｎ）_Ｍが算出される。振幅平均ｘ（ωｎ）_Ａと最大振幅ｘ（ωｎ）_Ｍがそれぞれ、第８の判定部１４４４に入力され、第８の判定部１４４４で振幅平均ｘ（ωｎ）_Ａと最大振幅ｘ（ωｎ）_Ｍの差が所定の範囲内に収まる場合、例えば、振幅平均ｘ（ωｎ）_Ａが最大振幅ｘ（ωｎ）_Ｍの定数ε倍よりも大きい場合で、かつ第７の判定部１４４５で、ＰＸ＜ＡＸと判定されれば第２の圧縮判定部１４７はインパルス性信号であると判定する。なお、定数εは０〜１の実数であり、例えば約０．９であることが望ましい。なお、当該フレームの全サンプルの振幅を平均して振幅平均ｘ（ωｎ）_Ａとしてもよい。 The square root AX of the power sum from the power sum square root calculation unit 4 and the third target amplitude PX from the PX input unit 1408 are input to the seventh determination unit 1445, and the square root AX of the power sum and the third target amplitude PX The magnitude of the value is determined. The determination result is input to the AND circuit 1446.
Also, the discrete frequency domain signal x (ωn) of the entire band is input to the overall amplitude average calculating unit 1440 and the overall maximum amplitude detecting unit 1442, and the overall amplitude average calculating unit 1440 performs the discrete frequency domain signal x (ωn) of the entire band. The average amplitude x (ωn) _A is calculated, and the maximum maximum amplitude detection unit 1442 calculates the maximum amplitude x (ωn) _M of the discrete frequency domain signal x (ωn) of the entire band. The amplitude average x (ωn) _A and the maximum amplitude x (ωn) _M are respectively input to the eighth determination unit 1444, and the eighth determination unit 1444 uses the amplitude average x (ωn) _A and the maximum amplitude x (ωn) _M. Is within a predetermined range, for example, when the amplitude average x (ωn) _A is larger than the constant amplitude ε times the maximum amplitude x (ωn) _M , and the seventh determination unit 1445 determines that PX <AX If determined, the second compression determination unit 147 determines that the signal is an impulsive signal. The constant ε is a real number from 0 to 1, and is preferably about 0.9, for example. The amplitudes of all samples in the frame may be averaged to obtain an amplitude average x (ωn) _A.

実施例３により、インパルス性信号のように時間波形の振幅が時間的に過大であるにもかかわらず、周波数領域に変換後、各周波数に分散してしまうため、上記の実施例１の実施のみでは、十分な抑圧が困難な場合においても、抑圧を可能とする効果を得ることが出来る。この実施例３をまず適用することにより、実施例１のみでは十分な抑圧が困難な場合において、抑圧が可能である。
実施例３を理解しやすいように、図６、図７について、独立的に記載した。しかし実施例３は実際には、実施例１、実施例２と共に併用することが好ましい。この場合の実施例３の処理の流れを、図９を参照しながら説明する。まず第２の圧縮判定部１４７で、インパルス性信号であるか否かを判定し（Ｓ２０）、インパルス性信号であれば、当該フレームについては上述のように第３の圧縮関数適用部１５３ｍにより、（式３）で処理し、（Ｓ２２）、時間領域変換部１６へ出力する。一方ステップＳ２０でインパルス性信号でないと判定されたフレームについては、第１の判定部１３６により電力総和の平方根ＡＸと第１の目標振幅ＴＸの大小を判定し（Ｓ２４）、ＡＸ＞ＴＸの場合は第１の圧縮関数適用部１２ｍにより、（式１）で処理し（Ｓ２６）、時間領域変換部１６へ出力する。一方、ステップＳ２４でＡＸ＞ＴＸが成り立たないフレームについては、第３の判定部１３７で第２の目標振幅ＤＸと電力総和の平方根ＡＸの大小を判定し（Ｓ２８）、ＤＸ＞ＡＸであるならば、そのフレームに対し、第２の圧縮関数適用部１２５ｍにより（式２）で処理し（Ｓ３０）、時間領域変換部１６へ出力する。Ｓ２８でＤＸ＞ＡＸでないフレームについては、全グループの離散周波数領域信号を時間領域変換部１６へそのまま出力する（Ｓ３２）。 Although the amplitude of the time waveform is excessive in time as in the case of the impulsive signal according to the third embodiment, it is dispersed into each frequency after being converted into the frequency domain, so only the implementation of the first embodiment described above. Then, even when sufficient suppression is difficult, an effect of enabling suppression can be obtained. By applying the third embodiment first, it is possible to perform the suppression in the case where it is difficult to sufficiently suppress the first embodiment alone.
For easy understanding of Example 3, FIGS. 6 and 7 are described independently. However, in practice, Example 3 is preferably used in combination with Example 1 and Example 2. The processing flow of the third embodiment in this case will be described with reference to FIG. First, the second compression determination unit 147 determines whether or not the signal is an impulsive signal (S20). If the signal is an impulsive signal, the third compression function applying unit 153m as described above for the frame, Processing is performed using (Expression 3) (S22), and output to the time domain conversion unit 16. On the other hand, for the frame determined not to be an impulsive signal in step S20, the first determination unit 136 determines the magnitude of the square root AX of the power sum and the first target amplitude TX (S24), and if AX> TX. The first compression function application unit 12m performs processing according to (Expression 1) (S26) and outputs the result to the time domain conversion unit 16. On the other hand, for frames in which AX> TX does not hold in step S24, the third determination unit 137 determines the magnitude of the second target amplitude DX and the square root AX of the power sum (S28), and if DX> AX. The frame is processed by (Expression 2) by the second compression function application unit 125m (S30) and output to the time domain conversion unit 16. For frames that are not DX> AX in S28, the discrete frequency domain signals of all groups are output as they are to the time domain transform unit 16 (S32).

また、ステップＳ２０でインパルス性信号でないと判定されたフレームについては、以下の順序も考えられる。つまり、図９中に破線ブロックＢ内に示すように、第１の圧縮判定部１３４で第２の目標振幅ＤＸと電力総和の平方根ＡＸの大小をそれぞれ判定し（Ｓ３４）、ＤＸ＞ＡＸを満たすフレームについて、第２の圧縮関数適用部１２５ｍで処理して（Ｓ３６）時間領域変換部１６へ出力する。ステップＳ３４でＤＸ＞ＡＸを満たさないフレームについては、第１の判定部１３６で、第１の目標振幅ＴＸと電力総和の平方根ＡＸの大小を判定し（Ｓ３８）、ＡＸ＞ＴＸを満たすフレームについては第１の圧縮関数適用部１２ｍにより処理して（Ｓ４０）、時間領域変換部１６へ出力する。ステップＳ３８でＡＸ＞ＴＸを満たさないフレームについては、時間領域変換部１６にそのまま出力される（Ｓ４２）。 Moreover, the following order is also considered about the frame determined not to be an impulsive signal in step S20. That is, as shown in the broken line block B in FIG. 9, the first compression determination unit 134 determines the magnitude of the second target amplitude DX and the square root AX of the power sum (S34), and satisfies DX> AX. The frame is processed by the second compression function application unit 125m (S36) and output to the time domain conversion unit 16. For frames that do not satisfy DX> AX in step S34, the first determination unit 136 determines the magnitude of the first target amplitude TX and the square root AX of the power sum (S38), and for frames that satisfy AX> TX. Processing is performed by the first compression function application unit 12m (S40), and the result is output to the time domain conversion unit 16. Frames that do not satisfy AX> TX in step S38 are output to the time domain conversion unit 16 as they are (S42).

また、実施例３を実施する場合、図１０に示す処理手順も考えられる。予め第２の目標振幅ＤＸと第３の目標振幅ＰＸの大小を比較し、ＰＸ≧ＤＸを満たす場合は、第１の圧縮判定部１３４で、第２の目標振幅ＤＸと電力総和の平方根ＡＸの大小をそれぞれ判定し（Ｓ４６）、ＤＸ＞ＡＸを満たすフレームについて、第２の圧縮関数適用部１２５ｍで処理して（Ｓ５８）、時間領域変換部１６へ出力する。
一方、ステップＳ４６でＤＸ＞ＡＸを満たさないフレームについては、第２の圧縮判定部１４７でインパルス性信号であるか否かを判定する（Ｓ４８）。インパルス性信号であれば、そのフレームに対し第３の圧縮関数適用部１５３ｍで処理して（Ｓ５４）、時間領域変換部１６に入力される。ステップＳ４８でインパルス性信号でないと判定されると、第１の判定部１３６で第１の目標振幅ＴＸと電力総和の平方根ＡＸの大小を判定し、ＡＸ＞ＴＸと判定されれば、そのフレームに対し、第１の圧縮関数適用部１２ｍで処理して（Ｓ５６）、時間領域変換部１６へ出力する。ステップＳ５０でＡＸ＞ＴＸでないと判定されたフレームについては、時間領域変換部１６にそのまま出力される（Ｓ５２）。 Moreover, when Example 3 is implemented, the process sequence shown in FIG. 10 can also be considered. The second target amplitude DX and the third target amplitude PX are compared in advance, and when PX ≧ DX is satisfied, the first compression determination unit 134 determines whether the second target amplitude DX and the square root AX of the power sum are Each frame is determined (S46), and a frame satisfying DX> AX is processed by the second compression function application unit 125m (S58) and output to the time domain conversion unit 16.
On the other hand, for a frame that does not satisfy DX> AX in step S46, the second compression determination unit 147 determines whether or not it is an impulsive signal (S48). If it is an impulsive signal, the frame is processed by the third compression function application unit 153m (S54) and input to the time domain conversion unit 16. If it is determined in step S48 that the signal is not an impulsive signal, the first determination unit 136 determines the magnitude of the first target amplitude TX and the square root AX of the power sum, and if it is determined that AX> TX, On the other hand, the first compression function application unit 12m performs processing (S56) and outputs the result to the time domain conversion unit 16. Frames determined not to have AX> TX in step S50 are output to the time domain conversion unit 16 as they are (S52).

この発明の実施例１の構成例を示すブロック図。1 is a block diagram showing a configuration example of Embodiment 1 of the present invention. この発明の実施例１の第１の圧縮関数制御部１０ｍと第１の圧縮関数適用部１２ｍの詳細例と、これに関連する部分のブロック図。FIG. 3 is a detailed example of a first compression function control unit 10m and a first compression function application unit 12m according to the first embodiment of the present invention, and a block diagram of parts related thereto. この発明の実施例２の構成例を示すブロック図。The block diagram which shows the structural example of Example 2 of this invention. この発明の実施例２の第２の圧縮関数適用部１２５ｍと第２の圧縮関数制御部１３５ｍの詳細例と、これに関連する部分のブロック図。The detailed example of the 2nd compression function application part 125m and the 2nd compression function control part 135m of Example 2 of this invention, and the block diagram of the part relevant to this. （Ａ）は第１の圧縮関数適用部１２ｍによる（式１）の圧縮関数による特性であり、（Ｂ）は第２の圧縮関数適用部１２５ｍによる（式２）の圧縮関数による特性である。(A) is the characteristic by the compression function of (Formula 1) by the 1st compression function application part 12m, (B) is the characteristic by the compression function of (Formula 2) by the 2nd compression function application part 125m. この発明の実施例３の構成例を示すブロック図。The block diagram which shows the structural example of Example 3 of this invention. この発明の実施例３の第３の圧縮関数適用部１５３ｍと第３の圧縮関数制御部１４５ｍの詳細例と、これに関連する部分のブロック図。The detailed example of the 3rd compression function application part 153m of Example 3 of this invention and the 3rd compression function control part 145m, and the block diagram of the part relevant to this. 実施例３の第２の圧縮判定部１４７の具体的構成例。9 shows a specific configuration example of a second compression determination unit 147 according to the third embodiment. 実施例１〜３を組み合わせて使用する際のフローチャート図。The flowchart figure at the time of using combining Examples 1-3. 実施例１〜３を組み合わせて使用する際の他のフローチャート図。The other flowchart figure at the time of using combining Examples 1-3.

Claims

A frequency domain transform unit for transforming a discrete time input signal into a discrete frequency domain signal for each frame;
A group divider that divides the discrete frequency domain signal into a plurality of groups such that at least one group includes a plurality of discrete frequency domain signals;
A power sum square root calculation unit for obtaining the square root of the power sum of the input signals in the frame;
For each of the divided groups, a group amplitude average calculation unit that calculates an amplitude average of discrete frequency domain signals in the group;
A first determination unit that compares magnitudes of values of an expected desired first target amplitude after compression processing and the square root of the power sum;
A signal indicating that the square root of the power sum is larger than that of the first determination unit, and an amplitude average value for each group is calculated by a ratio between the first target amplitude and the square root of the power sum. Normalize and obtain the first normalized average value by the first normalized average value calculation unit,
For each group, a first compression function control unit that compares the first normalized average value and the amplitude of each discrete frequency domain signal of the group by a second determination unit;
If the second determination unit determines that the amplitude of the discrete frequency domain signal is larger, the first compression function calculation unit applies a predetermined first compression function to the amplitude. If the second determination unit determines that the amplitude of the discrete frequency domain signal is not larger, the discrete frequency domain signal is suppressed so as to approach the first target amplitude. A first compression function application unit for each group,
A time domain transforming unit that transforms the entire discrete frequency domain signal output from the first compression function applying unit for each group into a time domain signal;
A speech processing apparatus comprising:

The speech processing apparatus according to claim 1, wherein
A signal indicating that the square root of the total power from the first determination unit is not larger is input,
A first compression determination unit that compares the input desired amplitude to be emphasized (hereinafter referred to as a second target amplitude) and the power sum square root with a third determination unit;
A second compression function control unit that compares the amplitude average value of each group and the amplitude of each frequency signal of the corresponding group by a fourth determination unit;
When the determination by the first determination unit is greater in the first target amplitude, the determination by the third determination unit is greater in the second target amplitude, and the fourth determination unit If the amplitude of the frequency signal is greater than
Applying a predetermined second compression function to the amplitude in the second compression function calculation unit, emphasizing it so as to approach the second target amplitude, and outputting it to the time domain conversion unit,
If the determination of the fourth determination unit determines that the amplitude of each frequency signal is not greater, the second compression function application unit outputs the amplitude as it is to the time domain conversion unit. Voice processing device.

The speech processing apparatus according to claim 2, wherein
The second compression function control unit is configured to estimate a noise component estimation unit that estimates a value obtained by multiplying a maximum value or an average value of noise levels by a constant larger than 1 for each group;
A fifth determination unit that compares the noise level and the group amplitude average value;
If the fifth determination unit determines that the noise level is higher, the speech processing apparatus uses the noise level instead of the group amplitude average value.

The speech processing apparatus according to any one of claims 1 to 3,
A second compression determination unit that determines for each frame whether or not the input signal is an impulsive signal;
When the second compression determination unit determines that the discrete frequency domain signal is an impulsive signal, the operation of the first compression function control unit is stopped, and for each group of the frame, The ratio of the desired amplitude after compression processing (referred to as the third target amplitude) and the square root of the power sum is normalized, and the second normalized average value calculation unit calculates the second amplitude average value. Find the normalized average value,
For each group, a third compression function control unit that compares the second normalized average value and the amplitude of each discrete frequency domain signal of the group with a sixth determination unit;
If the determination by the sixth determination unit is greater in the amplitude, a third compression function is applied to the discrete frequency domain signal by a third function calculation unit, and the third desired value is obtained. To the time domain conversion unit, and output to the time domain conversion unit. If the determination by the sixth determination unit is not larger than the amplitude, the discrete frequency domain signal is output to the time domain conversion unit as it is. A third compression function application unit that
A speech processing apparatus comprising:

The speech processing apparatus according to claim 4, wherein
The second compression determination unit
A seventh determination unit that compares the third target amplitude and the square root of the power sum;
An overall amplitude average calculator for calculating an average amplitude of the input signal of the frame;
An overall maximum amplitude detector for detecting a maximum amplitude for all frequencies of the discrete frequency domain signal;
An eighth determination unit that determines whether the difference between the overall average amplitude value and the maximum amplitude is within a predetermined range;
If the square root of the power sum is larger in the seventh determination unit and falls within the range in the eighth determination unit, it is determined that the input signal of the frame is an impulsive signal. A voice processing device.

A frequency conversion procedure for converting a discrete time input signal into a discrete frequency domain signal for each frame;
A group division procedure for dividing the discrete frequency domain signal into a plurality of groups so that at least one group includes a plurality of discrete frequency domain signals;
A power sum square root calculation procedure for obtaining the square root of the power sum of the input signals in the frame;
For each of the divided groups, a group amplitude average calculation procedure for calculating the average amplitude of the discrete frequency domain signals in the group;
A first determination procedure for comparing the magnitude of a value of an expected desired first target amplitude after the compression processing and the average of the power sum;
In the determination of the first determination procedure, the square root of the power sum is larger, and the average amplitude value for each group is normalized by the ratio of the desired first target amplitude and the square root of the power sum. Find the first normalized average value,
For each group, a second determination procedure that compares the first normalized average value with the amplitude of each discrete frequency domain signal of the group;
If the determination result of the second determination procedure indicates that the amplitude of the discrete frequency domain signal is larger, a predetermined first compression function is applied to the amplitude to determine the first target. A first compression function application procedure that suppresses the amplitude so as to approach the amplitude and leaves the discrete frequency domain signal as it is if the determination of the second determination procedure is not greater than the amplitude of the discrete frequency domain signal. When,
A time domain conversion procedure for converting the entire discrete frequency domain signal processed by the first compression function application procedure for each group into a time domain signal;
A voice processing method characterized by comprising:

The voice processing method according to claim 6.
A third determination procedure for comparing the magnitude of the input desired amplitude to be emphasized (hereinafter referred to as a second target amplitude) and the total square root of the power;
A fourth determination procedure for comparing the amplitude average value of each group with the amplitude of each frequency signal of the corresponding group;
When the determination result of the first determination procedure is larger than the first target amplitude, the determination result of the third determination procedure is that the second target amplitude is large, and the fourth determination is performed. If the result of the procedure is greater than the amplitude of the frequency signal,
Applying a predetermined second compression function to the amplitude to emphasize the amplitude so as to approach the second target amplitude,
If the determination result of the fourth determination procedure determines that the amplitude of each frequency signal in the corresponding group is not greater, a second compression function application procedure that leaves the amplitude as it is;
And a procedure of converting the entire discrete frequency domain signal processed by the second compression function application procedure for each group into a time domain signal.

The voice processing method according to claim 7,
A noise component estimation procedure for estimating a value obtained by multiplying a maximum value or an average value of noise levels by a constant larger than 1 for each group;
A fifth determination procedure for comparing the noise level with the group amplitude average value,
If the determination result of the fifth determination procedure determines that the noise level is larger, the fourth determination procedure uses the noise level instead of the group amplitude average value. .

The voice processing method according to any one of claims 6 to 8,
A second compression determination procedure for determining whether or not the input signal is an impulsive signal for each frame;
When the discrete frequency domain signal is determined to be an impulsive signal by the second compression determination procedure, the compression processing for the impulsive signal is performed for each group of the frame without performing the first compression function application procedure. The ratio of the later desired amplitude (referred to as the third target amplitude) and the square root of the power sum is normalized to obtain the second normalized average value.
For each group, a sixth determination procedure for comparing the magnitude of the second normalized average value and the amplitude of each discrete frequency domain signal of the group;
If the determination result of the sixth determination procedure is larger than the amplitude, a third compression function is applied to the discrete frequency domain signal by the third function calculation unit, and the third desired vibration is applied. A third compression function application procedure that suppresses the value so as to approach the value and leaves the discrete frequency domain signal as it is if the determination result of the sixth determination procedure is not greater than the amplitude;
A procedure for converting the entire discrete frequency domain signal processed by the third compression function application procedure for each group into a time domain signal;
A voice processing method characterized by comprising:

The voice processing method according to claim 9.
The second compression determination procedure is as follows.
A seventh determination procedure for comparing the magnitudes of the third target amplitude and the square root of the power sum;
An overall amplitude average calculation procedure for calculating an average amplitude value of the input signal of the frame;
An overall maximum amplitude detection procedure for detecting a maximum amplitude for all frequencies of the discrete frequency domain signal;
An eighth determination procedure for determining whether or not the difference between the overall amplitude average value and the maximum amplitude is within a predetermined range;
The input signal of the frame in which the determination result of the seventh determination procedure is larger in the square root of the total power and the determination result of the eighth determination procedure is within a predetermined range is an impulse signal A speech processing method characterized by determining that

A voice processing program for causing a computer to execute each procedure of the voice processing method according to claim 6.

The computer-readable recording medium which recorded the audio | voice processing program of Claim 11.