JP3426871B2

JP3426871B2 - Method and apparatus for adjusting spectrum shape of audio signal

Info

Publication number: JP3426871B2
Application number: JP24673396A
Authority: JP
Inventors: 公生三関; 正浩押切; 明延山下; 政巳赤嶺; 皇天田
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 1995-09-18
Filing date: 1996-09-18
Publication date: 2003-07-14
Anticipated expiration: 2016-09-18
Also published as: JPH09190195A

Abstract

PROBLEM TO BE SOLVED: To stably improve the quality of voices such as decoded voice, synthesized voice and so forth with a small amt. of calculation. SOLUTION: This device cascade connects a first filter 112 having pole-zero type transfer functions A(z)/B(z) for emphasizing the spectral envelope of a voice signal and a second filter 113 for compensating inclinations of spectrums of the voice signal. Then, two filter coefficients to be used in the second filter 113 in order to compensate the inclinations of the spectrums are respectively calculated independently from the pole-zero type transfer functions and then the inclinations of spectrums respectively corresponding to pole-zero transfer functions are compensated by the calculated filter coefficients.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、復号音声や合成音
声の品質を向上させるために音声信号のスペクトル形状
を調整する方法および装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a method and apparatus for adjusting the spectral shape of a speech signal in order to improve the quality of decoded speech and synthesized speech.

【０００２】[0002]

【従来の技術】音声信号を低ビットレートで符号化し、
得られた符号化データを伝送系または蓄積系に通した
後、符号化データを復号化する音声符号化／復号化シス
テムでは、復号化側で復号再生された音声信号の主観品
質を上げるために、復号化装置の最終段にポストフィル
タを配置することが多い。2. Description of the Related Art An audio signal is encoded at a low bit rate,
In a voice encoding / decoding system in which the obtained encoded data is passed through a transmission system or a storage system and then the encoded data is decoded, in order to improve the subjective quality of the voice signal decoded and reproduced on the decoding side. In many cases, a post filter is arranged at the final stage of the decoding device.

【０００３】ポストフィルタを組み合わせた従来の音声
復号化装置では、符号化データに含まれる各種パラメー
タがパラメータ復号部によって復号化され、これらの復
号化パラメータ情報に基づいて音声信号再生部により音
声信号が再生される。In a conventional speech decoding apparatus combined with a post filter, various parameters included in encoded data are decoded by a parameter decoding section, and a speech signal reproducing section produces a speech signal based on the decoded parameter information. Is played.

【０００４】パラメータ復号部と音声信号再生部からな
る復号化装置の後段には、ポストフィルタが配置され
る。ポストフィルタは、ピッチ成分強調フィルタ、スペ
クトル包絡強調フィルタ、高域強調フィルタおよびゲイ
ン調整部を縦続接続して構成される。A post filter is arranged at the subsequent stage of the decoding device including the parameter decoding unit and the audio signal reproducing unit. The post filter is configured by connecting a pitch component emphasis filter, a spectrum envelope emphasis filter, a high frequency emphasis filter, and a gain adjustment section in cascade.

【０００５】ポストフィルタの持つ機能は大別してピッ
チ成分の強調、スペクトル包絡の強調、高域成分の強
調、およびフィルタゲインの調整である。これらのうち
ピッチ成分とスペクトル包絡は、それぞれ音声の音程と
音韻を決める重量な要素であり、これらを強調すること
によりより明瞭で雑音感の少ない音声を生成できる効果
がある。また、フィルタゲインの調整はポストフィルタ
の入力と出力で音声信号の大きさが変わらないようにす
るために必要となる。The functions of the post filter are roughly classified into emphasis of pitch component, emphasis of spectrum envelope, emphasis of high frequency component, and adjustment of filter gain. Of these, the pitch component and the spectral envelope are important factors that determine the pitch and the phoneme of the voice, and by emphasizing these, the effect of producing a clearer and less noisy voice is obtained. Further, the adjustment of the filter gain is necessary so that the size of the audio signal does not change between the input and the output of the post filter.

【０００６】高域成分の強調は、「こもった感じ」や
「音の通りの悪さ」といった、符号化やポストフィルタ
の特性によって生じる音声の高域成分の不十分さを補う
ために行われる。特に、スペクトル包絡強調に用いるフ
ィルタは不要なスペクトルの傾き（平均的には低域強調
の傾き）を持つ場合が多く、これを補償することを目的
として高域成分の強調が用いられている。The enhancement of high frequency components is carried out in order to compensate for the insufficiency of the high frequency components of the voice caused by the characteristics of the coding and the post filter, such as "comfortable feeling" and "bad sound passage". In particular, a filter used for spectral envelope emphasis often has an unnecessary spectrum slope (on average, a low-frequency emphasis slope), and high-frequency component emphasis is used for the purpose of compensating for this.

【０００７】[0007]

【発明が解決しようとする課題】従来技術では、高域強
調フィルタとして例えばＣ（ｚ）＝１−μｚ^-1（μ＝
０．４程度の固定値）なる固定の伝達関数を持つフィル
タを用いている。このような高域強調フィルタを用いる
と、「こもった感じ」は低域し、主観音質がある程度は
向上する。しかし、その一方、例えば子音区間のような
高域強調の必要が無い区間の音声までもが過度に高域強
調されてしまうことによって、高域において異音を発生
することがあり、結果的に十分な音質改善効果が得られ
ないという問題点がある。In the prior art, for example, C (z) = 1-μz ^-1 (μ =
A filter having a fixed transfer function of (a fixed value of about 0.4) is used. When such a high-frequency emphasis filter is used, the "muffled feeling" is in the low frequency range, and the subjective sound quality is improved to some extent. On the other hand, on the other hand, the sound in a section where high-frequency emphasis does not need to be emphasized, such as a consonant section, is excessively emphasized in the high-frequency range, which may cause abnormal noise in the high-frequency range. There is a problem that a sufficient sound quality improvement effect cannot be obtained.

【０００８】すなわち、こもり感のある音声をよく聞い
て分析してみると、音声は常にこもっているわけではな
く、高域が十分に出ていない音声の区間の時間が長いた
めに、全体的にはこもっているように聞こえる。また、
音の区間によって高域が出ていない程度も様々である。
このため、固定の伝達関数の高域強調フィルタを用いる
と、高域が比較的出ている区間も高域強調がなされてし
まうことにより、音質が劣化する。That is, when listening to and analyzing a sound with a muffled sound, the sound is not always muffled, and since the time of the sound section in which the high frequency is not sufficient is long, Sounds like you're muddy. Also,
Depending on the section of sound, the extent to which the high range is not produced also varies.
For this reason, when a high-frequency emphasis filter having a fixed transfer function is used, high-frequency emphasis is performed even in a section in which a high frequency is relatively high, so that sound quality is deteriorated.

【０００９】もう一つの従来技術として、スペクトル包
絡強調フィルタの伝達関数Ｆ（ｚ）を予測分析し、それ
に基づき高域強調フィルタの伝達関数Ｃ（ｚ）における
パラメータμの値を適応的に変える方法が知られてい
る。しかし、この方法では、スペクトル包絡強調フィル
タの伝達関数Ｆ（ｚ）が一般に次数の多い極零型フィル
タのそれで表現されることから、パラメータμを求める
ための計算が非常に複雑になるという問題点がある。As another conventional technique, a method of predictively analyzing the transfer function F (z) of the spectral envelope emphasis filter and adaptively changing the value of the parameter μ in the transfer function C (z) of the high band emphasis filter based on the predictive analysis. It has been known. However, in this method, since the transfer function F (z) of the spectral envelope emphasis filter is generally expressed by that of a pole-zero filter having many orders, there is a problem that calculation for obtaining the parameter μ becomes very complicated. There is.

【００１０】上述したように、伝達関数が固定の高域強
調フィルタを用いた従来のポストフィルタでは、高域強
調の必要が無い区間の音声まで過度に高域強調すること
による高域での異音の発生という問題点があり、またス
ペクトル包絡強調フィルタの伝達関数を予測し、それに
基づいて高域強調フィルタの伝達関数を適応的に変化さ
せるポストフィルタでは、計算量が非常に多くなるため
に実施が困難であるという問題点があった。As described above, in the conventional post filter using the high-frequency emphasis filter whose transfer function is fixed, the difference in the high-frequency range is caused by the excessive high-frequency emphasis of the voice in the section where the high-frequency emphasis is unnecessary. There is a problem that sound is generated, and the post filter that predicts the transfer function of the spectral envelope emphasis filter and adaptively changes the transfer function of the high-frequency emphasis filter based on it, requires a large amount of calculation. There was a problem that it was difficult to implement.

【００１１】[0011]

【課題を解決するための手段】本発明の目的は、復号音
声や合成音声などの音声の品質を少ない計算量で安定し
て向上させることができる音声信号のスペクトル形状調
整方法および装置を提供することにある。SUMMARY OF THE INVENTION It is an object of the present invention to provide a method and apparatus for adjusting the spectral shape of a speech signal, which can stably improve the quality of speech such as decoded speech or synthesized speech with a small amount of calculation. Especially.

【００１２】また、本発明の他の目的は、音声信号のス
ペクトル形状の調整に伴うゲイン調整を行う際の音質劣
化を避けることができる音声信号のスペクトル形状調整
方法を提供することにある。Another object of the present invention is to provide a method for adjusting the spectrum shape of a voice signal, which can avoid the deterioration of the sound quality when the gain adjustment is performed along with the adjustment of the spectrum shape of the voice signal.

【００１３】本発明は音声信号をＡ（ｚ）／Ｂ（ｚ）で
表される極零型の伝達関数を持つ第１のフィルタと、こ
の第１のフィルタの特性を補償するための第２のフィル
タに通すことによって音声信号のスペクトル形状を調整
する方法において、第２のフィルタに用いる２つのパラ
メータをＡ（ｚ）とＢ（ｚ）とから別々に求めることを
特徴とする。According to the present invention, a first filter having a pole-zero type transfer function represented by A (z) / B (z) for an audio signal, and a second filter for compensating for the characteristic of the first filter are provided. In the method of adjusting the spectrum shape of the audio signal by passing through the filter (1), the two parameters used for the second filter are obtained separately from A (z) and B (z).

【００１４】また、本発明は音声信号のスペクトル形状
を調整するスペクトル形状調整装置において、Ａ（ｚ）
／Ｂ（ｚ）が表される極零型の伝達関数を持つ第１のフ
ィルタと、第１のフィルタと縦続接続され、第１のフィ
ルタの特性を補償するための第１および第２のパラメー
タを有する第２のフィルタと、第１および第２のパラメ
ータをＡ（ｚ）とＢ（ｚ）とから別々に生成するパラメ
ータ生成手段とを有することを特徴とする。Further, the present invention provides a spectrum shape adjusting device for adjusting the spectrum shape of an audio signal, wherein A (z)
/ B (z) is represented by a first filter having a pole-zero type transfer function, and first and second parameters cascade-connected to the first filter for compensating the characteristics of the first filter. And a parameter generating means for separately generating the first and second parameters from A (z) and B (z).

【００１５】ここで、パラメータ生成手段はより具体的
には、Ａ（ｚ）または１／Ａ（ｚ）の特性を予測し、そ
の予測係数を第１のパラメータとして生成する第１のパ
ラメータ生成手段と、Ｂ（ｚ）または１／Ｂ（ｚ）の特
性を予測し、その予測係数を第２のパラメータとして生
成する第２のパラメータ生成手段とからなる。Here, more specifically, the parameter generation means predicts the characteristic of A (z) or 1 / A (z), and generates the prediction coefficient as the first parameter. And a second parameter generating means for predicting the characteristic of B (z) or 1 / B (z) and generating the prediction coefficient as the second parameter.

【００１６】極零型で構成される第１のフィルタ（スペ
クトル包絡強調フィルタ）の伝達関数Ｆ（ｚ）＝Ａ
（ｚ）／Ｂ（ｚ）のスペクトルの傾きは、分子側Ａ
（ｚ）と分母側Ｂ（ｚ）のそれぞれのスペクトルの傾き
の合成で表現することができる。本発明はこの点に着目
し、Ｆ（ｚ）をＡ（ｚ）とＢ（ｚ）とに分離して、これ
らＡ（ｚ）とＢ（ｚ）のそれぞれのフィルタ係数から、
第２のフィルタ（適応フィルタ）の２つのパラメータを
直接的に求めるようにしたものである。適応フィルタで
は、このようにして得られた２つのパラメータを組み合
わせて、Ｆ（ｚ）全体のスペクトルの傾きを補償する伝
達関数を持つように構成される。The transfer function F (z) = A of the first filter (spectral envelope emphasis filter) of the pole-zero type.
The slope of the spectrum of (z) / B (z) is A on the molecular side.
(Z) and the denominator side B (z) can be expressed by combining the slopes of the respective spectra. The present invention pays attention to this point, separates F (z) into A (z) and B (z), and from these filter coefficients of A (z) and B (z),
The two parameters of the second filter (adaptive filter) are directly obtained. The adaptive filter is configured to have a transfer function that compensates for the slope of the spectrum of the entire F (z) by combining the two parameters thus obtained.

【００１７】このようにすると、従来技術のように次数
の大きな極零型であるＦ（ｚ）の全体的な特性を予測し
なくとも、Ｆ（ｚ）を構成する次数の小さな零型の形で
表現されたＡ（ｚ）やＢ（ｚ）のフィルタ係数の情報か
ら、簡単かつ精度よくＦ（ｚ）のスペクトルの傾きを補
償できる適応フィルタに必要なパラメータが求められ
る。すなわち、少ない計算量で安定的に音質の改善を図
ることが可能となる。In this way, the zero-type shape of F (z) having a small order is formed without predicting the overall characteristics of F (z), which is a pole-zero type having a large order, as in the prior art. From the information of the filter coefficients of A (z) and B (z) expressed by, the parameters necessary for the adaptive filter that can compensate the slope of the spectrum of F (z) easily and accurately can be obtained. That is, it is possible to stably improve the sound quality with a small amount of calculation.

【００１８】また、本発明によれば第２のフィルタは少
なくとも、それぞれ異なるパラメータを持つ極型フィル
タと零型フィルタによるフィルタリング処理を行うこと
を特徴とする。ここで、極型フィルタと零型フィルタは
いずれも低次のフィルタであることが計算量を少なくす
る上で望ましく、計算量の面からは１次のフィルタが最
適である。第２のフィルタを１次のフィルタで実現する
場合、第２のフィルタはｚ変換領域の伝達関数が（１−μ_z ｚ^-1）／（１−μ_p ｚ^-1）（但し、μ_z ，μ_p は絶対値が１より小さい値を持つ互
いに独立したフィルタ係数）で表される１次の極零型の
伝達関数を少なくとも含むフィルタによって実現され
る。Further, according to the present invention, the second filter is characterized by performing at least a filtering process using a polar filter and a zero filter having different parameters. Here, it is desirable that both the pole-type filter and the zero-type filter are low-order filters in order to reduce the amount of calculation, and in terms of the amount of calculation, the first-order filter is optimal. When implementing the second filter at the primary filter, the second filter has a transfer function of z-transform domain _{^{(1-μ z z -1)}} / (1-μ p z -1) ( where, mu _z , Μ _p are realized by a filter including at least a first-order pole-zero type transfer function represented by mutually independent filter coefficients each having an absolute value smaller than 1.

【００１９】このように第２のフィルタを極１次・零１
次のフィルタで構成すると、従来のは１次の零型フィル
タにより構成されたフィルタではパラメータが１個であ
ったのに対して、パラメータが２個となるため、第２の
フィルタの伝達関数の表現の自由度が高くなり、柔軟に
スペクトルの傾き補償を行うことができ、音質の改善効
果がさらに高くなる。In this way, the second filter is used as a pole first-order zero
When configured with the following filter, the conventional filter configured with a first-order zero-type filter has one parameter, but has two parameters, so that the transfer function of the second filter is The degree of freedom of expression is increased, the inclination of the spectrum can be compensated flexibly, and the effect of improving the sound quality is further enhanced.

【００２０】この場合において、Ａ（ｚ）からμ_p を求
め、Ｂ（ｚ）からμ_z を求めるようにすれば、より次数
の低いフィルタ係数でスペクトルの傾きを補償すること
ができる。[0020] In this case, we seek mu _p from A (z), if the B (z) to determine the mu _z, it is possible to compensate for the tilt of the spectrum at lower filter coefficients of orders.

【００２１】また、この場合にＣ₁ ＜Ｃ₃ ＜Ｃ₀ なる関
係の重み係数を用い、Ａ（ｚ）のパラメータから求めた
第１の自己相関係数が０に近いしきい値（Ｔｈとする）
より小のとき第１の自己相関係数に重み係数Ｃ₀ で重み
付けを行い、第１の自己相関係数がしきい値Ｔｈより大
のとき第１の自己相関係数に重み係数ｃ1 で重み付けを
行うことにより得られる値からμ_p を求め、Ｂ（ｚ）の
パラメータから求めた第２の自己相関係数に重み係数Ｃ
₃ で重み付けを行って得られる値からμ_z を求めること
とすれば、第１の自己相関係数がしきい値Ｔｈよりも小
さい場合、この区間の音声は高域の強い子音のような音
声であり、逆に第１の自己相関係数がしきい値Ｔｈより
も大きい場合は、この区間の音声は低域の強い母音のよ
うな音声であるから、上記のように自己相関係数としき
い値Ｔｈとの比較により重み係数を切り換えることによ
り、第２のフィルタを子音と母音それぞれに適合した補
償特性とすることができ、音質が効果的に改善される。Further, in this case, using the weighting coefficient of the relationship of C ₁ <C ₃ <C ₀ , the first autocorrelation coefficient obtained from the parameter of A (z) has a threshold value (Th and Do)
When it is smaller, the first autocorrelation coefficient is weighted with the weighting coefficient C ₀ , and when it is larger than the threshold Th, the first autocorrelation coefficient is weighted with the weighting coefficient c 1. Μ _p is obtained from the value obtained by performing, and the weighting factor C is added to the second autocorrelation coefficient obtained from the parameter of B (z).
If μ _z is obtained from the value obtained by weighting with ₃ , if the first autocorrelation coefficient is smaller than the threshold Th, the voice in this section is a voice like a strong consonant in the high frequency range. On the contrary, when the first autocorrelation coefficient is larger than the threshold value Th, the voice in this section is a voice like a strong vowel in the low frequency range. By switching the weighting factor by comparing with the threshold value Th, the second filter can have a compensation characteristic suitable for each consonant and vowel, and the sound quality is effectively improved.

【００２２】さらに、本発明は音声信号に対して所定の
フィルタ処理を行うことにより音声信号のスペクトル形
状を調整する方法において、フィルタ処理に伴う音声信
号のパワー変化を補償するために該音声信号に乗じるゲ
インを調整する際、音声信号に乗じるべきゲインの正負
判定を行い、これが負の場合は所定の方法で与えられる
非負の値にゲインを置き換えることを特徴とする。ここ
で、置き換える非負の値は小さな値、具体的には０以
上、１未満の値であることが望ましい。このようにする
と、音声信号のスペクトル形状の調整に伴うゲイン調整
を行う際の負のゲインに起因する音質劣化が回避され
る。Furthermore, the present invention is a method for adjusting the spectral shape of an audio signal by performing a predetermined filtering process on the audio signal, in order to compensate for the power change of the audio signal due to the filtering process. When adjusting the gain to be multiplied, the positive / negative of the gain to be multiplied by the audio signal is determined, and if the gain is negative, the gain is replaced with a non-negative value given by a predetermined method. Here, the non-negative value to be replaced is preferably a small value, specifically, a value of 0 or more and less than 1. In this way, it is possible to avoid the sound quality deterioration due to the negative gain when performing the gain adjustment accompanying the adjustment of the spectrum shape of the audio signal.

【００２３】この発明によると、音声信号をスペクトル
包絡強調するため極零型の伝達関数を持つ第１のフィル
タと、この第１のフィルタにより行われるフィルタ処理
による前記音声信号のスペクトルの傾きを補償するため
の第２のフィルタとを準備するステップと、スペクトル
の傾きを補償するために前記第２のフィルタにおいて用
いる２つのフィルタ係数を前記極零型伝達関数からそれ
ぞれ独立的に求めるステップと、求められた前記フィル
タ係数によって前記極零型伝達関数にそれぞれ対応する
スペクトルの傾きを補償するステップとによりなる音声
信号のスペクトル形状調整方法が提供される。According to the present invention, a first filter having a pole-zero type transfer function for emphasizing a spectrum envelope of a voice signal and a slope of the spectrum of the voice signal by the filtering process performed by the first filter are compensated. A second filter for adjusting the spectrum, and a step of independently obtaining two filter coefficients used in the second filter for compensating for the slope of the spectrum from the pole-zero type transfer function. And a step of compensating the slope of the spectrum corresponding to each of the pole-zero type transfer functions by the obtained filter coefficient.

【００２４】この発明によると、音声信号をスペクトル
包絡強調するため極零型の伝達関数を持つ第１のフィル
タと、第１のフィルタにより行われるフィルタ処理によ
る前記音声信号のスペクトルの傾きを補償するため、前
記第１のフィルタから入力される前記極零型伝達関数か
ら２つのフィルタ係数をそれぞれ独立的に求める計算部
と求めたフィルタ係数に従って前記第１のフィルタから
出力される音声信号をフィルタ処理し、前記極零型伝達
関数にそれぞれ対応するスペクトルの傾きを補償するフ
ィルタ部とを有する第２のフィルタとにより構成される
音声信号のスペクトル形状調整装置が提供される。According to the present invention, the first filter having a pole-zero type transfer function for enhancing the spectral envelope of the voice signal and the slope of the spectrum of the voice signal by the filtering process performed by the first filter are compensated. Therefore, the audio signal output from the first filter is filtered according to the calculation unit that independently obtains two filter coefficients from the pole-zero type transfer function that is input from the first filter and the obtained filter coefficient. And a second filter having a filter section for compensating for the slope of the spectrum corresponding to each of the pole-zero type transfer functions, and a spectrum shape adjusting device for an audio signal are provided.

【００２５】この発明によると、入力音声信号を分析
し、合成フィルタデータを出力する合成フィルタ分析器
と、合成フィルタ分析器からの合成フィルタデータに基
づいて重み付けフィルタデータを算出するフィルタデー
タ計算器と、重み付けフィルタデータに基づいて入力音
声信号をフィルタリングする重み付けフィルタとにより
構成され、重み付けフィルタが重み付けフィルタデータ
を用いて極零伝達関数を有する第１のフィルタと第１の
フィルタによるスペルトルの傾きを補償するための伝達
関数を有する第２のフィルタを含む音声信号のスペクト
ル形状調整装置が提供される。According to the present invention, a synthesis filter analyzer that analyzes an input voice signal and outputs synthesis filter data, and a filter data calculator that calculates weighted filter data based on the synthesis filter data from the synthesis filter analyzer. A weighting filter for filtering the input audio signal based on the weighting filter data, the weighting filter compensating for the slope of the spelltle by the first filter having a pole-zero transfer function using the weighting filter data. An apparatus for spectral shape adjustment of an audio signal is provided that includes a second filter having a transfer function for

【００２６】[0026]

【発明の実施の形態】図１を参照して、本発明の第１の
実施例に係るポストフィルタを組み込んだ音声復号化装
置を説明する。この音声復号化装置は、パラメータ復号
部１０１と音声信号再生部１０２およびポストフィルタ
１０３から構成される。BEST MODE FOR CARRYING OUT THE INVENTION A speech decoding apparatus incorporating a post filter according to a first embodiment of the present invention will be described with reference to FIG. This audio decoding device is composed of a parameter decoding unit 101, an audio signal reproducing unit 102, and a post filter 103.

【００２７】入力端子１００には、送信側の音声符号化
装置から伝送された符号化データが入力される。この符
号化データはパラメータ復号部１０１に入力され、音声
信号再生部１０２で使用するピッチベクトル、雑音ベク
トル、ゲインおよびＬＰＣ係数などのパラメータ情報が
復号化される。音声信号再生部１０２は、入力されたパ
ラメータ情報をもとに音声信号を再生する。To the input terminal 100, the encoded data transmitted from the voice encoding device on the transmitting side is input. This encoded data is input to the parameter decoding unit 101, and parameter information such as the pitch vector, noise vector, gain and LPC coefficient used in the audio signal reproducing unit 102 is decoded. The audio signal reproducing unit 102 reproduces an audio signal based on the input parameter information.

【００２８】音声信号再生部１０２の一例としては、Ｃ
ＥＬＰ (Code Excited Linear Prediction) 方式の音声
信号再生部を挙げることができる。この方式の音声信号
再生部では、再生されたピッチベクトルと雑音ベクトル
が再生されたゲインと乗じてから組み合わされることに
より、ＬＰＣ合成フィルタの駆動信号が生成され、この
駆動信号がＬＰＣ合成フィルタに通過させることによっ
て音声信号が再生される。As an example of the audio signal reproducing section 102, C
An example is an ELP (Code Excited Linear Prediction) audio signal reproducing unit. In the audio signal reproducing section of this method, the reproduced pitch vector and the noise vector are multiplied by the reproduced gain and then combined to generate a drive signal for the LPC synthesis filter, and the drive signal is passed to the LPC synthesis filter. By doing so, the audio signal is reproduced.

【００２９】ポストフィルタ１０３は音声復号化装置の
最終段に配置され、再生された音声信号の主観品質を向
上させるために用いられる。この実施例のポストフィル
タは、ピッチ成分強調フィルタ１１１、スペクトル包絡
強調フィルタ１１２、補償フィルタ１１３およびゲイン
調整部１１４を縦続接続して構成される。補償フィルタ
１１３は、適応フィルタ１２１とそのフィルタ係数を計
算するフィルタ係数計算部１２２から構成され、さらに
フィルタ係数計算部１２２は第１のパラメータ計算部１
２３と第２のパラメータ計算部１２４から構成される。
ゲイン調整部１１４は、ポストフィルタ１０３による処
理後の音声信号が処理前の音声信号と同程度のパワとな
るように滑らかにゲインを調整し、調整後の音声信号を
音声信号出力端子１０４へ出力する。The post filter 103 is arranged at the final stage of the speech decoding apparatus and is used to improve the subjective quality of the reproduced speech signal. The post filter of this embodiment is configured by connecting a pitch component emphasis filter 111, a spectrum envelope emphasis filter 112, a compensation filter 113 and a gain adjusting unit 114 in cascade connection. The compensation filter 113 includes an adaptive filter 121 and a filter coefficient calculation unit 122 that calculates the filter coefficient thereof, and the filter coefficient calculation unit 122 further includes the first parameter calculation unit 1
23 and the second parameter calculation unit 124.
The gain adjusting unit 114 smoothly adjusts the gain so that the audio signal after processing by the post filter 103 has the same power as the audio signal before processing, and outputs the adjusted audio signal to the audio signal output terminal 104. To do.

【００３０】以下、ポストフィルタ１０３についてさら
に詳細に説明する。ピッチ成分強調フィルタ１１１は、
音声信号のピッチ周期の繰り返しを強調する目的で用い
られるフィルタである。ピッチ成分強調フィルタ１１１
の設計法としては、ピッチ周期とピッチゲインをパラメ
ータとして用いる様々な設計法が考えられるが、その伝
達関数の一例としてＰ（ｚ）＝１／（１−εβｚ^-T）を
用いることができる。ここで、Ｔはピッチ周期である。
また、βはピッチゲイン、εはピッチ強調の程度を調節
するパラメータであり、これらは０＜εβ＜１の関係に
設定される。The post filter 103 will be described in more detail below. The pitch component emphasis filter 111 is
It is a filter used for the purpose of emphasizing the repetition of the pitch period of the audio signal. Pitch component emphasis filter 111
Although various design methods using the pitch period and the pitch gain as parameters can be considered as the design method of, the P (z) = 1 / (1-εβz ^−T ) can be used as an example of the transfer function thereof. Here, T is a pitch period.
Further, β is a pitch gain, ε is a parameter for adjusting the degree of pitch enhancement, and these are set in a relationship of 0 <εβ <1.

【００３１】スペクトル包絡強調フィルタ１１２は、音
声信号のスペクトル包絡の形状を強調する目的で用いら
れ、その伝達関数をＦ（ｚ）とする。ＣＥＬＰ方式で
は、次式に示す伝達関数Ｆ（ｚ）を持つ極零型フィルタ
をスペクトル包絡強調フィルタとして用いてスペクトル
包絡を強調する方法が一般に用いられる。The spectral envelope emphasis filter 112 is used for the purpose of emphasizing the shape of the spectral envelope of the audio signal, and its transfer function is F (z). In the CELP method, a method of emphasizing a spectrum envelope by using a pole-zero type filter having a transfer function F (z) shown below as a spectrum envelope emphasizing filter is generally used.

【００３２】Ｆ（ｚ）＝Ａ（ｚ）／Ｂ（ｚ）（１）但し、Ａ（ｚ）＝１／Ｈ（ｚ／γ₁ ）Ｂ（ｚ）＝１／Ｈ（ｚ／γ₂ ），０＜γ₁ ＜γ₂ Ｈ（ｚ）：音声信号のスペクトル包絡を表すフィルタ伝
達関数このようなスペクトル包絡強調フィルタ１１２を用いる
と、スペクトル包絡の凹凸を強調できるので、ポストフ
ィルタ１０３を通過した後の音声信号は、聴感上ノイズ
感が減って感知される。しかし、この構成では音声毎に
定まる伝達関数Ｆ（ｚ）の変化に応じて様々なスペクト
ルの傾斜が付加されてしまう。F (z) = A (z) / B (z) (1) where A (z) = 1 / H (z / γ ₁ ) B (z) = 1 / H (z / γ ₂ ) , 0 <γ ₁ <γ ₂ H (z): a filter transfer function representing the spectrum envelope of the audio signal. When such a spectrum envelope enhancement filter 112 is used, the unevenness of the spectrum envelope can be enhanced, and thus the filter passes the post filter 103. The subsequent audio signal is perceived as having a less audible noise. However, with this configuration, various spectrum slopes are added according to changes in the transfer function F (z) that is determined for each voice.

【００３３】すなわち、極零型フィルタからなるスペク
トル包絡強調フィルタ１１２の伝達関数Ｆ（ｚ）は、ス
ペクトル全体で見ると無視できない程度に低域強調型の
傾きを持つ場合がある。従来技術のポストフィルタで使
用されるＣ（ｚ）なる伝達関数の高域強調フィルタは、
符号化で劣化した高域成分を持ち上げる役割のほかに、
このようなスペクトル包絡強調フィルタが持つ不要な低
域強調のスペクトルの傾きをおおむね補償する役割を持
っていた。That is, the transfer function F (z) of the spectrum envelope emphasis filter 112 composed of the pole-zero type filter may have a low-frequency emphasis type inclination which cannot be ignored in the entire spectrum. The high-pass emphasis filter of the transfer function C (z) used in the prior art post filter is
In addition to the role of raising the high frequency components that have been degraded by encoding,
The spectrum envelope emphasis filter had a role of roughly compensating for the unnecessary slope of the spectrum of the low-frequency emphasis.

【００３４】しかし、スペクトル包絡強調フィルタ１１
２の伝達関数Ｆ（ｚ）は、処理する音声信号のスペクト
ル包絡の特性に応じて変化するように設定されるので、
そのスペクトルの傾きは時間と共に変化する。すなわ
ち、ある瞬間のＦ（ｚ）は低域強調特性を持つが、別の
瞬間（例えば子音の音声区間）のＦ（ｚ）は逆に高域強
調となることもある。この場合、従来のように固定の伝
達関数Ｃ（ｚ）の高域強調フィルタ用いると、音声の高
域を過度に強調してしまうことになり、これが異音を生
じる原因となる。However, the spectral envelope enhancement filter 11
The transfer function F (z) of 2 is set so as to change according to the characteristics of the spectral envelope of the audio signal to be processed.
The slope of the spectrum changes with time. That is, F (z) at a certain moment has low-frequency emphasis characteristics, but F (z) at another moment (for example, a voice section of a consonant) may have high-frequency emphasis. In this case, if a high-frequency emphasis filter having a fixed transfer function C (z) is used as in the conventional case, the high-frequency band of voice is excessively emphasized, which causes abnormal noise.

【００３５】これに対し、本実施例では適応フィルタ１
２１とフィルタ係数計算部１２２からなる補償フィルタ
１１３により、（１）式で示される伝達関数Ｆ（ｚ）の
スペクトル包絡強調フィルタ１１２を用いることにより
生じるスペクトルの傾斜を補償し、さらに必要に応じて
音質に明るさを与える等の調整を可能としている。フィ
ルタ係数計算部１２２のパラメータ計算部１２３および
２３は、それぞれ零フィルタ伝達関数Ａ（ｚ）および極
フィルタ伝達関数Ｂ（ｚ）を適応フィルタ１２１に入力
し、適応フィルタ１２１で使用する２つのパラメータを
計算する。On the other hand, in this embodiment, the adaptive filter 1
21 and the filter coefficient calculation unit 122 compensates the inclination of the spectrum generated by using the spectrum envelope emphasis filter 112 of the transfer function F (z) represented by the equation (1), and further, if necessary. It is possible to make adjustments such as giving brightness to the sound quality. The parameter calculation units 123 and 23 of the filter coefficient calculation unit 122 input the zero filter transfer function A (z) and the pole filter transfer function B (z) to the adaptive filter 121, respectively, and input two parameters used by the adaptive filter 121. calculate.

【００３６】次に、補償フィルタ１１３について詳細に
説明する。（１）式に示したスペクトル包絡強調フィル
タ１１２の伝達関数Ｆ（ｚ）は、Ｆ（ｚ）＝Ａ（ｚ）／
Ｂ（ｚ）であり、これは極型及び零型のフィルタに分離
した形で表すことができる。ここで、Ａ（ｚ）＝Σａ_i ｚ^-i，ａ₀ ＝１，（ｉ＝０〜１０）（２）Ｂ（ｚ）＝Σｂ_i ｚ^-i，ｂ₀ ＝１，（ｉ＝０〜１０）（３）である。Next, the compensation filter 113 will be described in detail. The transfer function F (z) of the spectral envelope emphasis filter 112 shown in Expression (1) is F (z) = A (z) /
B (z), which can be represented separately in pole-type and zero-type filters. Here, A (z) = Σa _i z ⁻ⁱ , a ₀ = 1 and (i = 0 to 10) (2) B (z) = Σb _i z ⁻ⁱ and b ₀ = 1 and (i = 0 to 0) 10) (3)

【００３７】フィルタ係数計算部１２２において、パラ
メータ計算部１２３は伝達関数Ａ（ｚ）のフィルタ係数
を伝達関数Ａ（ｚ）のインパルス応答と見なし、このイ
ンパルス応答の１次の正規化自己相関係数に相当する第
１のパラメータρ_A を求め、これを適応フィルタ１２１
に渡す。同様に、係数計算部１２４は伝達関数Ｂ（ｚ）
のフィルタ係数を伝達関数Ｂ（ｚ）のインパルス応答と
見なし、このインパルス応答の１次の正規化自己相関関
数に相当する第２のパラメータρ_B を適応フィルタ１２
１に渡す。これらのパラメータρ_A ，ρ_B は、次式で定
義できる。In the filter coefficient calculation unit 122, the parameter calculation unit 123 regards the filter coefficient of the transfer function A (z) as the impulse response of the transfer function A (z), and the first-order normalized autocorrelation coefficient of this impulse response. The first parameter ρ _A corresponding to
Pass to. Similarly, the coefficient calculation unit 124 uses the transfer function B (z).
Is regarded as the impulse response of the transfer function B (z), and the second parameter ρ _B corresponding to the first-order normalized autocorrelation function of this impulse response is used as the adaptive filter 12
Pass to 1. These parameters ρ _A and ρ _B can be defined by the following equations.

【００３８】 ρ_A ＝（Σａ_i ａ_i-1 ）／（Σａ_i ² ）（４） ρ_B ＝（Σｂ_i ｂ_i-1 ）／（Σｂ_i ² ）（５）これらのパラメータρ_A ，ρ_B の値は、それぞれ伝達関
数Ａ（ｚ），Ｂ（ｚ）のフィルタのインパルス応答に対
する１次の予測係数となっている。これらのパラメータ
ρ_A ，ρ_B を用い式（６）及び（７）に基づいてａ
（ｚ）およびｂ（ｚ）が求められる。Ρ _A = (Σa _i a _i-1 ) / (Σa _i ² ) (4) ρ _B = (Σb _i b _i-1 ) / (Σb _i ² ) (5) These parameters ρ _A , ρ The value of _B is a first-order prediction coefficient for the impulse response of the filter having the transfer functions A (z) and B (z), respectively. Based on equations (6) and (7) using these parameters ρ _A and ρ _B ,
(Z) and b (z) are determined.

【００３９】ａ（ｚ）＝１−τ_A （ρ_A ）ｚ^-1 （６）ｂ（ｚ）＝１−τ_B （ρ_B ）ｚ^-1 （７）これらａ（ｚ）およびｂ（ｚ）を用いて適応フィルタ１
２１の伝達関数が式（８）のように設定される。 _A (z) = 1−τ _A (ρ _A ) z ⁻¹ (6) b (z) = 1−τ _B (ρ _B ) z ⁻¹ (7) These a (z) and b (z ) Using adaptive filter 1
The transfer function of 21 is set as in equation (8).

【００４０】Ｄ（ｚ）＝ａ（ｚ）／ｂ（ｚ）（８）ここで、τ_A （），τ_B （）はパラメータρ_A ，ρ_B の
値を調整する関数である。このようにすることで、伝達
関数Ｆ（ｚ）のスペクトル包絡強調フィルタ１１２によ
るスペクトルの傾斜が伝達関数Ｄ（ｚ）の適応フィルタ
１２１によって効果的に補償し得る。D (z) = a (z) / b (z) (8) Here, τ _A () and τ _B () are functions that adjust the values of the parameters ρ _A and ρ _B. By doing so, the slope of the spectrum of the transfer function F (z) by the spectral envelope emphasis filter 112 can be effectively compensated by the adaptive filter 121 of the transfer function D (z).

【００４１】式（８）の伝達関数は、（１−μ_z ｚ^-1）
／（１−μ_p ｚ^-1）で表される１次の極零型の伝達関数
となっている。但し、μ_z ，μ_p は絶対値が１より小さ
い値を持つ互いに独立したフィルタ係数であり、この場
合の例では、μ_z ＝τ_A （ρ_A ）、μ_p ＝τ_B （ρ_B ）
である。即ち、これら伝達関数μ_z ，μ_p は伝達関数Ａ
（ｚ）およびＢ（ｚ）に応じてそれぞれ独立的に設定で
きる。The transfer function of the equation (8) is (1-μ _z z ^-1 )
It is a first-order pole-zero type transfer function represented by / (1-μ _p z ^-1 ). However, μ _z and μ _p are independent filter coefficients having absolute values smaller than 1, and in this example, μ _z = τ _A (ρ _A ), μ _p = τ _B (ρ _B ).
Is. That is, these transfer functions μ _z and μ _p are transfer functions A
It can be set independently depending on (z) and B (z).

【００４２】次に、上述したポストフィルタ１０３での
処理の流れを図２に示すフローチャートを用いて説明す
る。まず、最初にスペクトル包絡強調フィルタ１１２の
伝達関数Ｆ（ｚ）のパラメータ（フィルタ係数）を取得
する（ステップＳ１１）。次に、これらのパラメータか
らＦ（ｚ）を分子側伝達関数Ａ（ｚ）と分母側伝達関数
Ｂ（ｚ）とに分離して、フィルタ係数計算部１１３のパ
ラメータ計算部１２３，１２４にそれぞれ与える（ステ
ップＳ１２）。Next, the flow of processing in the above-mentioned post filter 103 will be described using the flowchart shown in FIG. First, the parameter (filter coefficient) of the transfer function F (z) of the spectrum envelope emphasis filter 112 is acquired (step S11). Next, from these parameters, F (z) is separated into a numerator-side transfer function A (z) and a denominator-side transfer function B (z), and given to the parameter calculation units 123 and 124 of the filter coefficient calculation unit 113, respectively. (Step S12).

【００４３】パラメータ計算部１２３，２３では、伝達
関数Ａ（ｚ），Ｂ（ｚ）のフィルタ係数を伝達関数Ａ
（ｚ），Ｂ（ｚ）のインパルス応答と見なして、そのイ
ンパルス応答の１次の正規化自己相関係数に相当するパ
ラメータρ_A ，ρ_B を式（４），（５）により計算し、
これらを適応フィルタ１２１に渡す。適応フィルタ１２
１では、パラメータρ_A ，ρ_B から式（６），（７）に
よって１次のフィルタであるａ（ｚ），ｂ（ｚ）を求
め、式（８）で示される伝達関数Ｄ（ｚ）に設定する
（ステップＳ１３）。適応フィルタ１２１はフィルタａ
（ｚ），ｂ（ｚ）に従って極／零フィルタの伝達関数の
傾きをそれぞれ独立的に補償しながらフィルタリング処
理を行うことよって、スペクトル包絡強調フィルタ１１
２におけるスペクトルの傾きを補償する処理を行う。In the parameter calculation units 123 and 23, the filter coefficients of the transfer functions A (z) and B (z) are transferred to the transfer function A.
(Z) and B (z) are regarded as impulse responses, and parameters ρ _A and ρ _B corresponding to the first-order normalized autocorrelation coefficient of the impulse responses are calculated by equations (4) and (5),
These are passed to the adaptive filter 121. Adaptive filter 12
1, the first-order filters a (z) and b (z) are obtained from the parameters ρ _A and ρ _B by the equations (6) and (7), and the transfer function D (z) shown in the equation (8) is obtained. Is set (step S13). The adaptive filter 121 is a filter a
(Z), b each independently compensate Shinano the slope of the transfer function pole / zero filter in accordance with (z) is I'll be performed et filtering, spectral envelope emphasis filter 11
The process of compensating the inclination of the spectrum in 2 is performed.

【００４４】次に、第２の実施例について説明する。本
実施例においては、外見的な構成は第１の実施例である
図１と同様であるが、補償フィルタ１１３の設計法が異
なっている。Next, the second embodiment will be described. In this embodiment, the external structure is similar to that of the first embodiment shown in FIG. 1, but the design method of the compensation filter 113 is different.

【００４５】第１の実施例においては、スペクトル包絡
強調フィルタ１１２の伝達関数Ｆ（ｚ）の分子側の伝達
関数Ａ（ｚ）によるスペクトルの傾斜を適応フィルタ１
２１の伝達関数Ｄ（ｚ）の分子側のａ（ｚ）で補償し、
Ｆ（ｚ）の分母側の伝達関数Ｂ（ｚ）によるスペクトル
の傾斜を分母側のｂ（ｚ）で補償している。これに対
し、第２の実施例では伝達関数Ｆ（ｚ）の零点側の伝達
関数Ａ（ｚ）によるスペクトル傾斜をＤ（ｚ）の極側の
フィルタｂ（ｚ）で補償し、伝達関数Ｆ（ｚ）の極側の
伝達関数Ｂ（ｚ）によるスペクトル傾斜をＤ（ｚ）の零
点側のａ（ｚ）で補償する。言い換えれば、伝達関数Ａ
（ｚ）から係数μ_p を求め、伝達関数Ｂ（ｚ）から係数
μ_z を求めるようにしている。これは、零点は極で補償
し、極は零点で補償することが、より次数の低いフィル
タ係数で補償を行うことができ、効率が良いという考え
に基づいている。In the first embodiment, the filter slope of the spectrum due to the transfer function A (z) on the numerator side of the transfer function F (z) of the spectral envelope emphasis filter 112 is adjusted by the adaptive filter 1.
21 is compensated by a (z) on the numerator side of the transfer function D (z),
The slope of the spectrum due to the transfer function B (z) on the denominator side of F (z) is compensated by b (z) on the denominator side. On the other hand, in the second embodiment, the spectral tilt due to the transfer function A (z) on the zero side of the transfer function F (z) is compensated for by the filter b (z) on the pole side of D (z) to obtain the transfer function F (z). The spectral tilt due to the transfer function B (z) on the pole side of (z) is compensated by a (z) on the zero point side of D (z). In other words, the transfer function A
Determined coefficient mu _p from (z), so that obtaining the coefficient mu _z transfer function B (z). This is based on the idea that the zero point is compensated by the pole and the pole point is compensated by the zero point, because the compensation can be performed by the filter coefficient having a lower order, and the efficiency is good.

【００４６】具体的には、伝達関数Ａ（ｚ）のフィルタ
係数をＬＰＣ係数と見なすことにより、Ｄｕｒｂｉｎ法
の逆のアルゴリズムを利用して、伝達関数Ａ（ｚ）のス
ペクトル包絡に近似する１次のＰＡＲＣＯＲ係数（部分
自己相関係数）ｋ_A を適応フィルタ１２１の第１のパラ
メータとして求める。同様に、Ｂ（ｚ）のスペクトル包
絡に近似する１次のＰＡＲＣＯＲ係数ｋ_B を適応フィル
タ１２１の第２のパラメータとして求める。このとき、
パラメータｋ_A とｋ_B はそれぞれ１／Ａ（ｚ）および１
／Ｂ（ｚ）のインパルス応答に対する１次の予測係数と
なっている。Specifically, by regarding the filter coefficient of the transfer function A (z) as an LPC coefficient, a first-order approximation of the spectral envelope of the transfer function A (z) is performed by using the inverse algorithm of the Durbin method. The PARCOR coefficient (partial autocorrelation coefficient) k _{A of} is obtained as the first parameter of the adaptive filter 121. Similarly, a first-order PARCOR coefficient k _B that approximates the spectral envelope of B (z) is obtained as the second parameter of the adaptive filter 121. At this time,
The parameters k _A and k _B are 1 / A (z) and 1 respectively.
It is a first-order prediction coefficient for the impulse response of / B (z).

【００４７】これらの２つのパラメータｋ_A ，ｋ_B を用
いてＡ（ｚ）とＢ（ｚ）がもたらすスペクトルの傾きを
補償するように、適応フィルタ１２１の伝達関数Ｄ
（ｚ）を設定する。その具体的な一例は、Ｄ（ｚ）＝ａ（ｚ）／ｂ（ｚ）（９）ａ（ｚ）＝１−η_B （ｋ_B ）ｚ^-1 （１０）ｂ（ｚ）＝１−η_A （ｋ_A ）ｚ^-1 （１１）とすることである。ここで、η_A （），η_B （）はパラ
メータｋ_A ，ｋ_B の値を調整する関数である。一例とし
て、η_A （ｋ_A ）＝０．５ｋ_A ，η_B （ｋ_B ）＝０．８
ｋ_B とすることができる。Using these two parameters k _A and k _B , the transfer function D of the adaptive filter 121 is adjusted so as to compensate for the spectral tilt caused by A (z) and B (z).
(Z) is set. Its specific example is, D (z) = a ( z) / b (z) (9) a (z) = 1-η B (k B) z -1 (10) b (z) = 1- η _A (k _A ) z ⁻¹ (11). Here, η _A () and η _B () are functions that adjust the values of the parameters k _A and k _B. As an example, η _A (k _A ) = 0.5 k _A , η _B (k _B ) = 0.8
It can be k _B.

【００４８】式（９）の伝達関数も、第１の実施例の場
合と同様（１−μ_z ｚ^-1）／（１−μ_p ｚ^-1）で表され
る１次の極零型の伝達関数となっている。但し、μ_z，
μ_p は絶対値が１より小さい値を持つ互いに独立したフ
ィルタ係数であり、この場合はμ_z ＝η_B （ｋ_B ）、μ
_p ＝η_A （ｋ_A ）である。Similarly to the case of the first embodiment, the transfer function of equation (9) is also represented by (1-μ _z z ^-1 ) / (1-μ _p z ^-1 ) of the first-order pole-zero type. Is the transfer function of. Where μ _z ,
μ _p is an independent filter coefficient having an absolute value smaller than 1, and in this case μ _z = η _B (k _B ), μ
_p = η _A (k _A ).

【００４９】なお、Ｄｕｒｂｉｎ法のアルゴリズムを逆
に用いるＬＰＣ係数からＰＡＲＣＯＲ係数への変換式は
公知であり、例えば「ディジタル音声処理」（東海大学
出版会、古井著）に詳しく書かれている。The conversion formula from the LPC coefficient to the PARCOR coefficient, which uses the Durbin method algorithm in reverse, is well known, and is described in detail in, for example, "Digital Speech Processing" (Tokai University Press, Furui).

【００５０】次に、本実施例におけるポストフィルタ１
０３での処理の流れを図３に示すフローチャートを用い
て説明する。まず、極零フィルタからなるスペクトル包
絡強調フィルタ１１２の伝達関数Ｆ（ｚ）＝Ａ（ｚ）／
Ｂ（ｚ）における零側の伝達関数Ａ（ｚ），極側の伝達
関数Ｂ（ｚ）のパラメータを取得する（ステップＳ２
１）。次に、パラメータ計算部１２３，２３でＤｕｒｂ
ｉｎ法の逆アルゴリズムを用いて、Ａ（ｚ）のパラメー
タから１次のフィルタｂ（ｚ）のパラメータｋ_A を、Ｂ
（ｚ）のパラメータから１次のフィルタａ（ｚ）のパラ
メータｋ_B を計算によりそれぞれ求め（ステップＳ２
２）、これらａ（ｚ），ｂ（ｚ）を式（９）に示すよう
にＤ（ｚ）のパラメータとして設定する（ステップＳ２
３）。適応フィルタ１２１で伝達関数Ｄ（ｚ）に従った
フィルタリング処理を行うことよって、スペクトル包絡
強調フィルタ１１２におけるスペクトルの傾きを補償す
る処理を行う。Next, the post filter 1 in the present embodiment.
The flow of processing in 03 will be described using the flowchart shown in FIG. First, the transfer function F (z) = A (z) / of the spectrum envelope emphasis filter 112 including a pole-zero filter.
Parameters of the zero-side transfer function A (z) and the pole-side transfer function B (z) in B (z) are acquired (step S2).
1). Next, in the parameter calculation units 123 and 23, Durb
Using the inverse algorithm of the in method, the parameter k _A of the first-order filter b (z) is changed from the parameter of _A (z) to B
From the parameters of (z), the parameters k _B of the primary filter a (z) are obtained by calculation (step S2
2), these a (z) and b (z) are set as parameters of D (z) as shown in Expression (9) (step S2).
3). The adaptive filter 121 performs a filtering process according to the transfer function D (z), thereby performing a process of compensating for the slope of the spectrum in the spectrum envelope emphasis filter 112.

【００５１】なお、第１および第２の実施例で述べた１
次極零型の適応フィルタ１２１の具体的構成は、例えば
図４や図５のシグナルフローで表すことができる。この
ように本実施例によれば、伝達関数Ａ（ｚ）からμ_p を
求め、伝達関数Ｂ（ｚ）からμ_z を求める構成とするこ
とによって、より次数の低いフィルタ係数で、すなわち
より少ない計算量でスペクトルの傾きを補償することが
できる。Incidentally, 1 described in the first and second embodiments
The specific configuration of the second-pole / zero-type adaptive filter 121 can be represented by, for example, the signal flow of FIGS. 4 and 5. According to this embodiment, transmitting seek mu _p from the function A (z), by adopting a configuration for determining the mu _z from the transfer function B (z), a lower filter coefficients of orders, i.e. less The slope of the spectrum can be compensated by the amount of calculation.

【００５２】次に、第３の実施例について述べる。第１
および第２の実施例では、スペクトル包絡強調フィルタ
１１１がもたらすスペクトルの傾きを補償することを主
目的に、極零それぞれ１次の予測に基づいて得られたパ
ラメータを用いて補償フィルタ１１３を構成する方法に
ついて述べた。Next, a third embodiment will be described. First
In the second and second embodiments, the compensation filter 113 is configured by using the parameters obtained based on the first-order prediction of poles and zeros, mainly for the purpose of compensating the slope of the spectrum provided by the spectrum envelope emphasis filter 111. Described how.

【００５３】第３の実施例では、さらに高次の予測に基
づく方法を用いると、スペクトルの傾き以外にスペクト
ルの凹凸まで補償することができるようになることにつ
いて説明する。本実施例は第１および第２の実施例にお
ける１次の予測を２次以上の予測とすることが特徴であ
り、外見的な構成は基本的に第１および第２の実施例を
示す図１と同様である。以下に、本実施例のように高次
の予測を導入した場合の効果について説明する。The third embodiment will explain that it is possible to compensate not only the slope of the spectrum but also the unevenness of the spectrum by using a method based on higher-order prediction. The present embodiment is characterized in that the primary prediction in the first and second embodiments is a secondary or higher prediction, and the external appearance is basically a diagram showing the first and second embodiments. The same as 1. The effect of introducing high-order prediction as in the present embodiment will be described below.

【００５４】もし、極零それぞれ２次の予測係数を用い
て補償フィルタ１１３を構成すると、スペクトル包絡強
調フィルタ１１１が持つスペクトル包絡の凹凸を強調す
る特性の一部を弱めることができるようになる。その理
由は、予測フィルタの性質に基づいている。すなわち、
強調が弱められるスペクトル包絡の部分は、通常のポス
トフィルタで最も強く強調される第１ホルマント付近の
周波数帯域となる。従って、２次の予測係数を用いて補
償フィルタ１１３を構成すると、通常のポストフィルタ
では強調されにくい他の周波数帯域のホルマントを優先
的に強調することができる効果がある。予測係数の次数
をさらに上げると、２次の予測係数を用いたときよりも
さらに狭い範囲の周波数で音声のスペクトル包絡の凹凸
を強調できるようになる。この方法を利用すると、従来
のポストフィルタでは強調が難しかった母音の高域のホ
ルマントを帯域フィルタを用いずに比較的簡単に強調す
ることができるようになる。If the compensation filter 113 is constructed by using second-order prediction coefficients for each pole and zero, it is possible to weaken a part of the characteristic of the spectrum envelope emphasizing filter 111 for emphasizing the unevenness of the spectrum envelope. The reason is based on the nature of the prediction filter. That is,
The part of the spectral envelope where the emphasis is weakened is the frequency band near the first formant, which is most strongly emphasized by the ordinary post filter. Therefore, when the compensating filter 113 is configured using the second-order prediction coefficient, there is an effect that the formants in other frequency bands that are difficult to be emphasized by the ordinary post filter can be emphasized preferentially. By further increasing the order of the prediction coefficient, it becomes possible to emphasize the unevenness of the spectrum envelope of the voice in a frequency in a narrower range than when the second-order prediction coefficient is used. By using this method, it becomes possible to relatively easily emphasize a high-frequency formant of a vowel, which is difficult to be emphasized by a conventional post filter, without using a bandpass filter.

【００５５】本実施例では、スペクトル包絡強調フィル
タの傾き補償だけでなく、ピッチ成分強調フィルタを使
用することによってもたらされる不要なスペクトルの傾
き（ピッチ傾きという）まで補償する、より高度なスペ
クトルの傾き補償法について述べる。ピッチ成分強調フ
ィルタは、図１のようにポストフィルタの中で使用され
る場合と音声信号再生部の中で使用される場合がある
が、ここでは音声信号再生部の中で合成フィルタの駆動
信号に対してピッチ成分強調フィルタを使用する例につ
いて述べる。In the present embodiment, not only the slope compensation of the spectrum envelope emphasis filter, but also a higher-order spectrum slope that compensates for unnecessary spectrum slope (called pitch slope) brought about by using the pitch component emphasis filter. The compensation method will be described. The pitch component emphasis filter may be used in the post filter as shown in FIG. 1 or may be used in the audio signal reproduction unit. Here, in the audio signal reproduction unit, the drive signal of the synthesis filter is used. An example of using a pitch component emphasis filter will be described.

【００５６】図６の波形ａは、現音声区間における合成
フィルタの駆動信号のスペクトルの形状とその傾き（図
では簡単のため直線で表している）を表した図である。
この波形ａに示すように、ピッチ周期性を持つ駆動信号
のスペクトルは、ピッチ周期に対応する周波数の整数倍
の周波数にスペクトルのピークがある調波構造を持つ。
理想的には合成フィルタの駆動信号のスペクトル包絡の
傾きは平坦であるが、実際の駆動信号のスペクトルを観
測すると平坦であると言えない区間が多い。この理由と
しては、スペクトル包絡の分析が正しく行われず、合成
フイルタが完全に音声のスペクトル包絡を表現しきれな
かったり、音声符号化装置において合成フィルタの符号
化ビット数の不足によるフイルタ特性の劣化などが考え
られる。The waveform a in FIG. 6 is a diagram showing the shape of the spectrum of the drive signal of the synthesis filter and its slope (represented by a straight line in the figure for simplicity) in the current voice section.
As shown in the waveform a, the spectrum of the drive signal having the pitch periodicity has a harmonic structure having a spectrum peak at a frequency that is an integral multiple of the frequency corresponding to the pitch period.
Ideally, the slope of the spectrum envelope of the drive signal of the synthesis filter is flat, but when the spectrum of the actual drive signal is observed, there are many sections that cannot be said to be flat. The reason for this is that the spectrum envelope is not correctly analyzed, the synthesis filter cannot completely express the spectrum envelope of the speech, and the deterioration of the filter characteristic due to the lack of the number of coding bits of the synthesis filter in the speech coder. Can be considered.

【００５７】ＣＥＬＰ (Code Excited Linear Predicti
on) 方式のような分析合成系の音声符号化装置では、こ
のような合成フイルタの特性の不足分を駆動信号の特性
で補うようになっている。このような場合には、本来平
坦であるような駆動信号のスペクトルが傾きや多少の凹
凸を持つことは明らかである。また、駆動信号が持つス
ペクトルの傾きは、各音声区間（例えばフレームまたは
サブフレーム）毎に異なる。CELP (Code Excited Linear Predicti
On the other hand, in the analysis / synthesis type speech coding apparatus such as the on) system, such a shortage of the characteristics of the synthesis filter is compensated by the characteristics of the drive signal. In such a case, it is obvious that the spectrum of the drive signal, which is originally flat, has an inclination or some unevenness. Further, the slope of the spectrum of the drive signal differs for each voice section (for example, frame or subframe).

【００５８】従来技術によるピッチ成分強調フイルタの
基本的な働きは、図６の波形ａ，ｂ，ｃを用いて説明す
ることができる。波形ｂは、ピッチ周期分だけ時間的に
離れた音声区間における合成フィルタの駆動信号のスペ
クトルの形状とその傾きの例を示したものである。ピッ
チ成分強調フイルタの処理は、ピッチ周期分だけ時間的
に離れた信号にピッチゲインβを乗じて現音声区間の信
号を加え合わせることで、波形ｃに示すようにピッチの
調波構造をより明瞭にする処理である。ピッチゲインβ
は、ピッチ周期分だけ離れた駆動信号の相関に応じて決
められる。The basic function of the pitch component emphasizing filter according to the prior art can be explained using the waveforms a, b and c of FIG. The waveform b shows an example of the shape of the spectrum of the drive signal of the synthesis filter and its inclination in the voice section temporally separated by the pitch period. The processing of the pitch component emphasis filter is performed by adding a signal of the current voice section by multiplying a signal temporally separated by the pitch period by the pitch gain β, thereby making the harmonic structure of the pitch clearer as shown in the waveform c. Is the process of Pitch gain β
Is determined according to the correlation of the drive signals separated by the pitch period.

【００５９】しかし、波形ａの駆動信号が持つスペクト
ルの傾き（ｚ変換領域で図のようにＱ（ｚ）と表現する
ことにする）は、これとピッチ周期分だけ時間的に離れ
ているためにスペクトルの傾きが異なっている波形ｂの
駆動信号を用いてピッチ成分を強調した結果、ピッチ成
分強調後の波形ｃの駆動信号のスペクトルの傾きは、Ｑ
（ｚ）からＱ′（ｚ）のように変質してしまっている。
すなわち、この例ではＱ（ｚ）は右上がりの傾きを示し
ているが、Ｑ′（ｚ）は右下がりの傾きになっている。
発明者らの実験によると、従来のピッチ成分強調処理は
雑音感を減らす効果はあるものの、このような駆動信号
のスペクトルの傾きの変化によって、音にこもり感を付
加したり、部分的に音韻の明瞭性を無くしてしまうこと
が判明した。特に、音声符号化／復号化処理により再生
された音声信号を再度符号化／復号化して再生するよう
なタンデム接続の条件では、上述の音のこもり感や音韻
の部分的な不明瞭が増幅される結果、極端な音質劣化と
して知覚されやすくなる。However, the slope of the spectrum of the drive signal of the waveform a (expressed as Q (z) in the z conversion region as shown in the figure) is distant from this by a pitch period. As a result of emphasizing the pitch component by using the drive signal of the waveform b having a different spectrum inclination, the spectrum inclination of the drive signal of the waveform c after the pitch component emphasis is Q
The quality has changed from (z) to Q '(z).
That is, in this example, Q (z) shows an upward slope but Q '(z) has a downward slope.
According to the experiments by the inventors, although the conventional pitch component enhancement processing has an effect of reducing the noise sensation, such a change in the slope of the spectrum of the drive signal adds a muffled feeling to the sound or partially phonologically It has been found that the clarity of is lost. In particular, under the condition of the tandem connection in which the audio signal reproduced by the audio encoding / decoding process is encoded / decoded again and reproduced, the above-mentioned muffled feeling of sound and the partial unclearness of the phoneme are amplified. As a result, it is likely to be perceived as extreme sound quality deterioration.

【００６０】この問題を解決するため、ピッチ成分強調
処理に、それに伴うスペクトルの傾き（または変形）を
補償する処理を導入することが本実施例の特徴である。
この補償は、従来のピッチ成分強調フィルタリングで得
られた波形ｃの駆動信号のスペクトルの傾きＱ′（ｚ）
を波形ｄに示すようにピッチの調波構造はそのままで傾
きだけを本来の傾きＱ（ｚ）に回復させることである。
こうすることで、ピッチ成分強調フィルタリングによっ
て生じる音のこもりや音韻の劣化の問題を大幅に改善す
ることができる。In order to solve this problem, the feature of the present embodiment is to introduce a process of compensating for the inclination (or deformation) of the spectrum accompanying the pitch component emphasis process.
This compensation is performed by the conventional method for enhancing the pitch component filtering. The inclination of the spectrum Q '(z) of the drive signal of the waveform c is obtained.
Is to recover only the inclination to the original inclination Q (z) while maintaining the pitch harmonic structure as shown by the waveform d.
By doing so, the problems of muffled sound and phoneme deterioration caused by the pitch component emphasis filtering can be significantly improved.

【００６１】すなわち、本実施例では、波形ｃのように
変形したスペクトルの傾き（または概形）Ｑ′（ｚ）を
本来のスペクトルの傾き（または概形）Ｑ（ｚ）に回復
させるために、ピッチ成分強調フィルタリング処理の前
または後に、Ｑ′（ｚ）の影響を除去して代わりにＱ
（ｚ）の特性を付加する処理か、またはＱ（ｚ）／Ｑ′
（ｚ）のフィルタリング処理を行う。この処理を実現す
るためには、少なくともＱ（ｚ）の特性を抽出すること
が必要となる。That is, in the present embodiment, in order to restore the slope (or outline) Q '(z) of the spectrum deformed like the waveform c to the original slope (or outline) Q (z) of the spectrum. , Before or after the pitch component emphasis filtering process, the effect of Q ′ (z) is removed and Q
A process of adding the characteristic of (z), or Q (z) / Q '
The filtering process of (z) is performed. In order to realize this processing, it is necessary to extract at least the characteristic of Q (z).

【００６２】図７は、このようなピッチ成分強調フィル
タリングに伴う駆動信号のスペクトルの傾き（ピッチ傾
き）を補償する機能を備えた本実施例に係る音声復号化
装置のブロック図である。この音声復号化装置は、音声
信号再生部１０２′およびポストフィルタ１０３′の構
成が図１と異なっている。音声信号再生部１０２′は、
駆動信号を合成フイルタに入力して音声信号を合成する
前に、ピッチ成分強調フィルタを用いて駆動信号のピッ
チ成分の強調を行う構成となっている。すなわち、本実
施例では、図１のポストフィルタ１０３内に設けられて
いたピッチ強調フィルタが音声信号再生部１０２′に内
蔵されており、ポストフィルタ１０３′は図１における
ポストフィルタ１０３内に設けられていたピッチ成分強
調フィルタ１１１が除去された構成となっている。FIG. 7 is a block diagram of a speech decoding apparatus according to this embodiment having a function of compensating for the inclination (pitch inclination) of the spectrum of the drive signal due to such pitch component emphasis filtering. In this voice decoding device, the configurations of a voice signal reproducing unit 102 'and a post filter 103' are different from those in FIG. The audio signal reproducing unit 102 'is
Before the drive signal is input to the synthesis filter to synthesize the audio signal, the pitch component enhancement filter is used to enhance the pitch component of the drive signal. That is, in this embodiment, the pitch enhancement filter provided in the post filter 103 of FIG. 1 is incorporated in the audio signal reproducing unit 102 ′, and the post filter 103 ′ is provided in the post filter 103 of FIG. The pitch component emphasizing filter 111 that has been used is removed.

【００６３】図８は、図７における音声信号再生部１０
２′の詳細な構成を示すブロック図である。この音声信
号再生部１０２′は、合成フィルタ情報生成部２０１、
駆動信号生成部２０２、第１の合成フィルタ２０３、ピ
ッチ成分強調フィルタ２０４、ピッチ傾き補償フィルタ
２０５、第１、第２のＬＰＣ分析部２０６、３７および
第２の合成フィルタ２０８から構成される。合成フィル
タ情報生成部２０１および駆動信号生成部２０２は、そ
れぞれ図７におけるパラメータ復号部１０１で復号され
たパラメータ情報から、合成フィルタ２０３，２０８の
フィルタ係数を決定する合成フィルタ情報および第１の
合成フィルタ２０３の駆動信号ｅ（ｎ）を生成する。FIG. 8 shows the audio signal reproducing section 10 in FIG.
It is a block diagram which shows the detailed structure of 2 '. The audio signal reproducing unit 102 'includes a synthesis filter information generating unit 201,
The drive signal generation unit 202, the first synthesis filter 203, the pitch component enhancement filter 204, the pitch inclination compensation filter 205, the first and second LPC analysis units 206 and 37, and the second synthesis filter 208 are included. The synthesis filter information generation unit 201 and the driving signal generation unit 202 determine the filter coefficients of the synthesis filters 203 and 208 from the parameter information decoded by the parameter decoding unit 101 in FIG. 7, respectively, and the first synthesis filter. The drive signal e (n) of 203 is generated.

【００６４】駆動信号生成部２０２で生成された駆動信
号ｅ（ｎ）は、第１の合成フィルタ２０３に入力される
と共に、ピッチ成分強調フィルタ２０４および第１のＬ
ＰＣ分析部２０６に入力される。ピッチ成分強調フィル
タ２０４によってピッチ成分が強調された駆動信号ｅｐ
（ｎ）は、ピッチ傾き補償フィルタ２０５および第２の
ＬＰＣ分析部２０７に入力される。第１、第２のＬＰＣ
分析部２０６，２０７では、ピッチ傾き補償フィルタ２
０５のフィルタ係数が生成される。ピッチ傾き補償フィ
ルタ２０５でピッチ傾き、すなわちピッチ成分強調フィ
ルタ２０４による駆動信号のスペクトルの傾きが補償さ
れた駆動信号は第２の合成フィルタ２０８に入力され、
音声信号が再生される。再生された音声信号は、さらに
ポストフィルタ１０３′内のスペクトル包絡強調フィル
タ１１２に入力される。また、合成フィルタ情報生成部
２０１で生成された合成フィルタ情報は、スペクトル包
絡強調フィルタ１１２の式（１）に示した伝達関数Ｆ
（ｚ）の決定に用いられる。さらに、第１の合成フィル
タ２０３の出力信号は、ポストフィルタ１０３′内のゲ
イン調整部１１４のゲイン決定に用いられる。The drive signal e (n) generated by the drive signal generation unit 202 is input to the first synthesis filter 203, and the pitch component enhancement filter 204 and the first L filter are added.
It is input to the PC analysis unit 206. The drive signal ep in which the pitch component is emphasized by the pitch component emphasis filter 204.
(N) is input to the pitch inclination compensation filter 205 and the second LPC analysis unit 207. First and second LPC
In the analysis units 206 and 207, the pitch tilt compensation filter 2
A filter coefficient of 05 is generated. The pitch gradient compensation filter 205 compensates the pitch gradient, that is, the drive gradient of the spectrum of the drive signal compensated by the pitch component enhancement filter 204 is input to the second synthesis filter 208.
The audio signal is reproduced. The reproduced audio signal is further input to the spectral envelope emphasis filter 112 in the post filter 103 '. Further, the synthesis filter information generated by the synthesis filter information generation unit 201 is the transfer function F shown in the equation (1) of the spectrum envelope emphasis filter 112.
Used to determine (z). Further, the output signal of the first synthesis filter 203 is used for the gain determination of the gain adjusting unit 114 in the post filter 103 '.

【００６５】次に、図８におけるピッチ成分強調フィル
タ２０４、ピッチ傾き補償フィルタ２０５および第１、
第２のＬＰＣ分析部２０６，２０７について、さらに詳
しく説明する。Next, the pitch component emphasizing filter 204, the pitch inclination compensating filter 205 and the first,
The second LPC analysis units 206 and 207 will be described in more detail.

【００６６】第１のＬＰＣ分析部２０６は、再生された
音声信号の所定区間、例えば１サブフレームまたは１フ
レーム区間の駆動信号ｅ（ｎ）についてＬ次の線形予測
分析を行い、Ｌ個の予測係数を求める。線形予測分析の
手法は周知であるため、ここでは詳細な説明を省略す
る。Ｌ＝１の場合の予測係数ρ₁ は、 ρ₁ ＝Σｅ（ｎ）ｅ（ｎ＋１）／Σｅ（ｎ）ｅ（ｎ）（１２）で求めることができる。Ｌ＝１のとき、図６で説明した
スペクトルの傾き特性Ｑ（ｚ）は、Ｑ（ｚ）＝１／（１−ｇ（ρ₁ ）ｚ^-1）（１３）と表すことができる。ここで、ｇ（）は予測係数を調節
するための関数である。−例として、ｇ（ρ₁ ）＝ηρ
₁ ，ηには０より大きく１以下の値を用いる。Ｌを２以
上にすると、ｅ（ｎ）のより詳細なスペクトルの概形を
Ｑ（ｚ）で表現できる効果がある。このときＱ（ｚ）
は、Ｑ（ｚ）＝１／（１−ρ₁ ｚ^-1−ρ₂ ｚ^-2−…−ρ_L ｚ^-L）と表すことができる。ここでρ₁ ，ρ₂ ，…，ρ_L はＬ
次の線形予測分析から得られるＬ個の予測係数を表す。The first LPC analysis section 206 performs an L-th order linear prediction analysis on the drive signal e (n) of a predetermined section of the reproduced audio signal, for example, one subframe or one frame section, and predicts L predictions. Find the coefficient. Since the method of linear prediction analysis is well known, detailed description is omitted here. Prediction coefficient [rho ₁ in the case of L = 1 can be obtained by _{ρ 1 = Σe (n) e} (n + 1) / Σe (n) e (n) (12). When L = 1, the slope characteristic Q (z) of the spectrum described in FIG. 6 can be expressed as Q (z) = 1 / (1-g (ρ ₁ ) z ⁻¹ ) (13). Here, g () is a function for adjusting the prediction coefficient. As an example, g (ρ ₁ ) = ηρ
_{For 1} and η, values greater than 0 and less than 1 are used. When L is 2 or more, there is an effect that a more detailed outline of the spectrum of e (n) can be expressed by Q (z). At this time Q (z)
Can be expressed as Q (z) = 1 / (1-ρ ₁ z ⁻¹ −ρ ₂ z ⁻² −... −ρ _L z ^−L ). Where ρ ₁ , ρ ₂ , ..., ρ _L are L
It represents L prediction coefficients obtained from the following linear prediction analysis.

【００６７】ピッチ成分強調フィルタ２０４は、駆動信
号ｅ（ｎ）を入力してピッチ強調された駆動信号ｅｐ
（ｎ）を出力する。このピッチ強調フィルタリングの方
法としては、例えばｅｐ（ｎ）＝ｅ（ｎ）＋βｅ（ｎ−Ｔ），ｎ＝０，１，…，Ｎ−１（１４）を用いることができる。ここで、Ｔはピッチ周期、Ｎは
ピッチ強調に用いる区間の長さ、βはピッチゲインを表
す。βの値はピッチ分析により得られた値に基づいて決
めることができ、通常０＜β＜０．７程度の値が用いら
れる。また、別の方法としてピッチ周期の有無の程度に
応じて、予め用意した固定の値をβに用いる方法も有効
である。−例として、ピッチ周期が無いときはβ＝０、
ピッチ周期性が比較的強いときはβ＝０．６のようにβ
の値を決める。The pitch component emphasizing filter 204 receives the drive signal e (n) and pitch-enhances the drive signal ep.
(N) is output. As a method of this pitch emphasis filtering, for example, ep (n) = e (n) + βe (n−T), n = 0, 1, ..., N−1 (14) can be used. Here, T is the pitch period, N is the length of the section used for pitch enhancement, and β is the pitch gain. The value of β can be determined based on the value obtained by the pitch analysis, and a value of 0 <β <0.7 is usually used. Further, as another method, a method of using a fixed value prepared in advance for β depending on the presence or absence of the pitch period is also effective. -For example, β = 0 when there is no pitch period,
When the pitch periodicity is relatively strong, β = 0.6
Determine the value of.

【００６８】第２のＬＰＣ分析部２０７では、ピッチ強
調された駆動信号ｅｐ（ｎ）についてＭ次の線形予測分
析を行い、Ｍ個の予測係数を求める。Ｍ＝１の場合の予
測係数ρ₁ ′は、 ρ₁ ′＝Σｅｐ（ｎ）ｅｐ（ｎ＋１）／Σｅｐ（ｎ）ｅｐ（ｎ）（１５）で求めることができる。The second LPC analysis section 207 performs M-th order linear prediction analysis on the pitch-enhanced drive signal ep (n) to obtain M prediction coefficients. The prediction coefficient ρ ₁ ′ when M = 1 can be obtained by ρ ₁ ′ = Σep (n) ep (n + 1) / Σep (n) ep (n) (15).

【００６９】Ｍ＝１のとき、図６で説明したスペクトル
の傾き特性Ｑ′（ｚ）はＱ′（ｚ）＝１／（１−ｆ（ρ₁ ′）ｚ^-1）（１６）と表すことができる。ここで、ｆ（）は予測係数を調節
するための関数である。−例としては、ｆ（ρ₁ ′）＝
η′ρ₁ ′、η′には０より大きく１以下の値を用い
る。Ｍを２以上にすると、ｅｐ（ｎ）のより詳細なスペ
クトルの概形をＱ′（ｚ）で表現できる効果がある。こ
のときＱ′（ｚ）はＱ′（ｚ）＝１／（１−ρ₁ ′ｚ^-1−ρ₂ ′ｚ^-2−…ρ_M ′ｚ^-M）（１７）と表すことができる。ここで、ρ₁ ’，ρ₂ ’，…，ρ
_M ’はＭ次の線形予測分析から得られるＭ個の予測係数
を表すことができる。When M = 1, the slope characteristic Q '(z) of the spectrum explained in FIG. 6 is expressed as Q' (z) = 1 / (1-f (ρ ₁ ′) z ⁻¹ ) (16). be able to. Here, f () is a function for adjusting the prediction coefficient. -As an example, f (ρ ₁ ′) =
For η′ρ ₁ ′ and η ′, values larger than 0 and 1 or less are used. When M is 2 or more, there is an effect that a more detailed spectrum outline of ep (n) can be expressed by Q '(z). At this time, Q ′ (z) can be expressed as Q ′ (z) = 1 / (1-ρ ₁ ′ z ⁻¹ −ρ ₂ ′ z ⁻² −... ρ _M ′ z ^−M ) (17). Where ρ ₁ ', ρ ₂ ', ..., ρ
_M ′ can represent M prediction coefficients obtained from an M-th order linear prediction analysis.

【００７０】ピッチ傾き補償フィルタ２０５は、ＬＰＣ
分析部２０６，２０７からの予測係数を基にＱ′（ｚ）
とＱ（ｚ）を用いてＱ（ｚ）／Ｑ′（ｚ）の特性を入力
されるピッチ成分強調後の駆動信号ｅｐ（ｎ）に与える
フィルタリング処理を行い、ピッチ傾きが補償された信
号ｅｑ（ｎ）を第２の合成フィルタ２０８に与える。Ｌ
＝１，Ｍ＝１の場合、式（１３），（１６）を用いればＱ（ｚ）／Ｑ′（ｚ）＝（１−ｆ（ρ₁ ’）ｚ^-1）／（１−ｇ（ρ₁ ）ｚ^-1）（１８）となる。また、ηとη′を用いた場合で、η＝η′＝１
のときにはＱ（ｚ）／Ｑ′（ｚ）＝（１−ρ₁ ′ｚ^-1）／（１−ρ₁ ｚ^-1）（１９）と表すことができる。The pitch inclination compensation filter 205 is an LPC.
Q '(z) based on the prediction coefficients from the analysis units 206 and 207.
And Q (z) are used to perform a filtering process to give the characteristic of Q (z) / Q '(z) to the input drive signal ep (n) after the pitch component emphasis, and a signal eq in which the pitch inclination is compensated is performed. (N) is given to the second synthesis filter 208. L
= 1 and M = 1, using equations (13) and (16), Q (z) / Q ′ (z) = (1-f (ρ ₁ ′) z ⁻¹ ) / (1-g ( ρ ₁ ) z ⁻¹ ) (18). When η and η ′ are used, η = η ′ = 1
Then, Q (z) / Q ′ (z) = (1-ρ ₁ ′ z ⁻¹ ) / (1-ρ ₁ z ⁻¹ ) (19) can be expressed.

【００７１】図９は、図６に示したスペクトルの傾き補
正の原理図をＬ＝１，Ｍ＝１，η＝η′＝１の場合につ
いてＱ（ｚ）とＱ′（ｚ）をより具体的な形で示した図
である。FIG. 9 is a more detailed view of Q (z) and Q '(z) for the case where L = 1, M = 1, and η = η' = 1 in the principle diagram of the spectrum tilt correction shown in FIG. It is the figure shown in a typical form.

【００７２】図８に戻り、音声信号再生部１０２′の説
明を続ける。ピッチ傾き補償後の駆動信号ｅｑ（ｎ）を
第２の合成フィルタ２０８に与える際に、ｅｑ（ｎ）の
パワをｅ（ｎ）のパワと同程度になるように調整したも
のを改めてｅｑ（ｎ）として合成フィルタ２０８に与え
る方法も有効である。第２の合成フィルタ２０８は、ピ
ッチ傾きつまりピッチ成分の強調によるスペクトル傾き
が補正された駆動信号ｅｑ（ｎ）を合成し、ピッチ成分
が強調された音声信号を再生する。再生された音声信号
はポストフィルタ１０３′に与えられる。音声信号再生
部１０２′からポストフィルタ１０３′のゲイン調整部
１１４にパワ情報を与えるために、ここでは駆動信号生
成部２０２で生成された駆動信号ｅ（ｎ）を第１の合成
フイルタ３３に入力して、ピッチ強調が行われていない
音声信号を求める。なお、上述したようなパワの調整が
なされた駆動信号ｅｑ（ｎ）を用いる場合は、第１の合
成フィルタ２０３を用いずに、第２の合成フイルタ３８
の出力であるピッチ成分が強調された音声信号をゲイン
調整部４１１に与える方法も有効である。Returning to FIG. 8, the description of the audio signal reproducing unit 102 'will be continued. When the drive signal eq (n) after the pitch inclination compensation is given to the second synthesis filter 208, the one adjusted so that the power of eq (n) is about the same as the power of eq (n) is newly added. The method of giving it to the synthesis filter 208 as n) is also effective. The second synthesis filter 208 synthesizes the drive signal eq (n) in which the pitch inclination, that is, the spectrum inclination due to the emphasis of the pitch component is corrected, and reproduces the audio signal in which the pitch component is emphasized. The reproduced audio signal is given to the post filter 103 '. In order to give power information from the audio signal reproducing unit 102 'to the gain adjusting unit 114 of the post filter 103', here, the drive signal e (n) generated by the drive signal generating unit 202 is input to the first synthesis filter 33. Then, an audio signal in which pitch enhancement is not performed is obtained. When the drive signal eq (n) whose power has been adjusted as described above is used, the second synthesis filter 38 is used without using the first synthesis filter 203.
It is also effective to provide the gain adjusting section 411 with an audio signal in which the pitch component, which is the output of, is emphasized.

【００７３】次に、本実施例における処理の流れを図１
０のフローチャートを用いて説明する。まず、駆動信号
生成部２０２によって第１の合成フィルタ２０３の駆動
信号ｅ（ｎ）を生成し（ステップＳ３１）、この駆動信
号ｅ（ｎ）について第１のＬＰＣ分析部２０６で１次の
自己相関係数ρ₁ を求める（ステップＳ３２）。また、
駆動信号ｅ（ｎ）をピッチ成分強調フィルタ２０４に入
力してピッチ成分が強調された駆動信号ｅｐ（ｎ）を求
め（ステップＳ３３）、この駆動信号ｅｐ（ｎ）につい
て第２のＬＰＣ分析部２０７で１次の自己相関係数ρ′
₁ を求める（ステップＳ３４）。これらの自己相関係数
ρ₁ とρ′₁ を用いてピッチ傾き補償フィルタ２０５に
よってピッチ傾き、すなわちピッチ成分が強調された駆
動信号ｅｐ（ｎ）のスペクトルの傾きを補償する（ステ
ップＳ３５）。次に、ピッチ傾きが補償された駆動信号
ｅｑ（ｎ）を第２の合成フィルタ２０８に入力して合成
フィルタリングを行うことにより、音声信号を再生す
る。以上のステップＳ３１〜Ｓ３５までが音声信号再生
部１０２′の処理である。Next, the flow of processing in this embodiment is shown in FIG.
This will be described using the flowchart of 0. First, the drive signal generation unit 202 generates the drive signal e (n) of the first synthesis filter 203 (step S31), and the drive signal e (n) is subjected to the primary self-phase by the first LPC analysis unit 206. The relation number ρ ₁ is obtained (step S32). Also,
The drive signal e (n) is input to the pitch component emphasis filter 204 to obtain the drive signal ep (n) in which the pitch component is emphasized (step S33), and the second LPC analysis unit 207 for this drive signal ep (n). And the first-order autocorrelation coefficient ρ ′
₁ is obtained (step S34). Using these autocorrelation coefficients ρ ₁ and ρ ′ ₁ , the pitch tilt compensation filter 205 compensates the pitch tilt, that is, the tilt of the spectrum of the drive signal ep (n) in which the pitch component is emphasized (step S35). Next, the drive signal eq (n) in which the pitch inclination is compensated is input to the second synthesis filter 208 to perform synthesis filtering, thereby reproducing the audio signal. The above steps S31 to S35 are the processing of the audio signal reproducing unit 102 '.

【００７４】次に、上記のようにして音声信号再生部１
０２′で再生された音声信号をポストフィルタ１０３′
に入力して、先の実施例と同様にまずスペクトル包絡強
調フィルタ１１２によってスペクトル包絡強調フィルタ
リングを行い（ステップＳ３７）、引き続き補償フィル
タ１１３によってスペクトル包絡強調フィルタリングに
よって生じるスペクトルの傾きを補正する（ステップＳ
３８）。最後に、ゲイン調整部１１４によってポストフ
ィルタ１０３′による処理後の音声信号が処理前の音声
信号と同程度のパワとなるように滑らかにゲインを調整
し、最終的な音声信号を出力する（ステップＳ３９）。Next, the audio signal reproducing section 1 is operated as described above.
The audio signal reproduced in 02 'is post-filtered 103'
In the same manner as in the previous embodiment, the spectrum envelope emphasizing filter 112 first performs the spectrum envelope emphasizing filtering (step S37), and the compensation filter 113 subsequently corrects the slope of the spectrum generated by the spectrum envelope emphasizing filtering (step S37).
38). Finally, the gain adjusting unit 114 smoothly adjusts the gain so that the audio signal after processing by the post filter 103 'has a power equivalent to that of the audio signal before processing, and outputs the final audio signal (step S39).

【００７５】第４の実施例の別の実現方法として、現区
間のピッチ強調前の駆動信号のスぺクトルの傾き（また
は概形）Ｑ（ｚ）を抽出しておき、ピッチ強調に用いる
信号に含まれるスペクトルの傾きを平坦化してからピッ
チ成分の強調フイルタリングを行い、ピッチ成分強調後
の駆動信号にＱ（ｚ）の特性を与える処理による実現も
可能である。また、ピッチ傾き補償をより安定的に調整
する方法として、Ｑ（ｚ）の代わりにＱ（ｚ／γ）、
Ｑ′（ｚ）の代わりにＱ′（ｚ／γ′）をそれぞれ用い
ることもできる。ここで、γ，γ′は、例えば０＜γ＜
１、０＜γ′＜１に設定できる。As another method of realizing the fourth embodiment, the slope (or approximate shape) Q (z) of the spectrum of the drive signal before pitch enhancement in the current section is extracted and the signal used for pitch enhancement is extracted. It is also possible to realize the processing by giving the characteristic of Q (z) to the drive signal after the pitch component emphasis by filtering the pitch component emphasis after flattening the slope of the spectrum included in the above. Further, as a method for more stably adjusting the pitch tilt compensation, Q (z / γ) instead of Q (z),
Q ′ (z / γ ′) may be used instead of Q ′ (z). Here, γ and γ ′ are, for example, 0 <γ <
It can be set to 1, 0 <γ ′ <1.

【００７６】次に、第５の実施例について説明する。本
実施例は、第２の実施例で説明した伝達関数Ｄ（ｚ）の
適応フィルタをさらに改良した伝達関数Ｔｐｚ（ｚ）の
適応フィルタを用いてスペクトルの傾き補償処理を行う
例であり、特に子音区間の明瞭さが改善され、音の歯切
れが良くなる効果を有するものである。Next, a fifth embodiment will be described. The present embodiment is an example of performing the slope compensation processing of the spectrum by using the adaptive filter of the transfer function Tpz (z) which is a further improvement of the adaptive filter of the transfer function D (z) described in the second embodiment. This has the effect of improving the clarity of the consonant section and improving the crispness of the sound.

【００７７】図１１は、本実施例に係るポストフィルタ
を音声復号化装置の最終段に適用した実施例を示し、図
１と同じ機能を有するブロックについて同じ参照番号を
付してある。すなわち、入力端子１００に入力された送
信側の音声符号化装置から伝送された符号化データ（パ
ラメータ化された音声圧縮情報）からパラメータ復号部
１０１および音声信号再生部１０２を介して再生音声信
号Ｓ（ｎ）が再生され、この再生音声信号がポストフィ
ルタ１０３を通過させて最終的な出力音声信号Ｓｏ
（ｎ）を生成する。以下、本実施例におけるポストフィ
ルタ１０３について詳細に説明する。FIG. 11 shows an embodiment in which the post filter according to the present embodiment is applied to the final stage of a speech decoding apparatus, and blocks having the same functions as in FIG. 1 are assigned the same reference numerals. That is, the reproduced audio signal S is input from the encoded data (parameterized audio compression information) transmitted from the audio encoding device on the transmitting side input to the input terminal 100 via the parameter decoding unit 101 and the audio signal reproducing unit 102. (N) is reproduced, this reproduced audio signal is passed through the post filter 103, and the final output audio signal So is obtained.
(N) is generated. Hereinafter, the post filter 103 in this embodiment will be described in detail.

【００７８】ポストフィルタ１０３は、ピッチ成分強調
フィルタ１１１、スペクトル包絡強調フィルタ１１２、
補償フィルタ１１３およびゲイン調整部１１４から構成
され、これらの各要素は次のように構成される。The post filter 103 includes a pitch component emphasis filter 111, a spectrum envelope emphasis filter 112,
The compensating filter 113 and the gain adjusting unit 114 are included, and each of these elements is configured as follows.

【００７９】スペクトル包絡強調フィルタ１１２は、そ
の伝達関数がＦ（ｚ）＝Ａ（ｚ）／Ｂ（ｚ）の構成で表
されることは前述した通りであるが、ここではスペクト
ル包絡強調フィルタ１１２の中で行われる処理がより明
確になるように、さらに詳細な処理ブロックに分けて説
明することにする。As described above, the spectral envelope enhancement filter 112 has a transfer function represented by the configuration of F (z) = A (z) / B (z), but here, the spectral envelope enhancement filter 112. In order to clarify the processing performed in the above, the processing will be divided into more detailed processing blocks.

【００８０】音声信号再生部１０２から入力される１０
個のＬＰＣ係数（ここでは、１０次のＬＰＣ係数が使用
される）が、Ａ（ｚ）のパラメータ計算部２２００とＢ
（ｚ）のパラメータ計算部２２０１に入力され、これら
の各パラメータ計算部２２００，２２０１はＡ（ｚ）の
パラメータａｗｉ（ｉ＝１〜１０）と、Ｂ（ｚ）のパラ
メータｂｗｉ（ｉ＝１〜１０）を計算して出力する。10 input from the audio signal reproducing unit 102
LPC coefficients (here, a 10th-order LPC coefficient is used) are A (z) parameter calculation units 2200 and B.
(Z) is input to the parameter calculation unit 2201, and each of these parameter calculation units 2200 and 2201 has a parameter Awi (i = 1 to 10) of A (z) and a parameter bwi (i = 1 to 1) of B (z). 10) is calculated and output.

【００８１】一方、ポストフィルタ１０３に入力される
信号はピッチ強調フィルタ１１１によりピッチ周期の繰
り返しを強調する処理が施された後、スペクトル包絡強
調特性のうちＡ（ｚ）の伝達関数を有する零型フィルタ
２２０２によるフィルタリング処理を経て、１／Ｂ
（ｚ）なる伝達関数を有する極型フィルタ２２０３でフ
ィルタリングされる。On the other hand, the signal input to the post filter 103 is subjected to the process of emphasizing the repetition of the pitch period by the pitch emphasizing filter 111, and then the zero type having the transfer function of A (z) in the spectrum envelope emphasizing characteristics. 1 / B after filtering by the filter 2202
It is filtered by the polar filter 2203 having a transfer function of (z).

【００８２】こうしてスペクトル包絡強調フィルタ１１
２によりスペクトル包絡が強調された音声信号は、さら
に補償フィルタ１１３において不要なスペクトルの傾き
が補償される。補償フィルタ１１３において、具体的な
フィルタリング処理を行う適応フィルタ２１２１の伝達
関数Ｔｐｚ（ｚ）は、Ｔｐｚ（ｚ）＝（１−μ_zeroｚ^-1）／（１−μ_poleｚ^-1）（２０）で表される。すなわち、この適応フィルタ２１２１もｚ
変換領域の伝達関数が（１−μ_z ｚ^-1）／（１−μ_p ｚ^-1）（但し、μ_z ，μ_p は絶対値が１より小さい値を持つ互
いに独立したフィルタ係数）で表される１次の極零型と
なっていることは先の実施例と同様である。Thus, the spectrum envelope emphasis filter 11
In the audio signal whose spectrum envelope is emphasized by 2, the unnecessary slope of the spectrum is further compensated by the compensation filter 113. In the compensation filter 113, the transfer function Tpz (z) of the adaptive filter 2121 that performs a specific filtering process is as follows: Tpz (z) = (1-μ _zero z ⁻¹ ) / (1-μ _pole z ⁻¹ ) (20 ) Is represented by. That is, this adaptive filter 2121 is also z
The transfer function of the conversion domain is (1-μ _z z ^-1 ) / (1-μ _p z ^-1 ) (where μ _z and μ _p are mutually independent filter coefficients with absolute values smaller than 1) The first-order pole-zero type shown is the same as in the previous embodiment.

【００８３】この適応フィルタ２１２１によるフィルタ
リング処理に際しては、まず適応フィルタ２１２１の特
性を決める２つのフィルタ係数μ_zero，μ_poleを予め求
めなければならないが、これらのフィルタ係数μ_zero，
μ_poleは以下のようにそれぞれμ_zero計算部２１２４，
μ_pole計算部２１２３において別々に求められる。In the filtering process by the adaptive filter 2121, first, the two filter coefficients μ _zero and μ _pole which determine the characteristics of the adaptive filter 2121 must be obtained in advance. These filter coefficients μ _zero ,
Each mu _pole is as follows mu _zero calculator 2124,
It is separately calculated by the μ _pole calculation unit 2123.

【００８４】μ_pole計算部２１２３は、Ａ（ｚ）のパラ
メータ計算部２２００の出力であるＡ（ｚ）のパラメー
タを入力し、これらのパラメータから後述する自己相関
係数ｒ_1zero を求め、 μ_pole′＝Ｃ₀ ｒ_1zero （ｒ_1zero ＜Ｔｈ） μ_pole′＝Ｃ₁ ｒ_1zero （ｒ_1zero ≧Ｔｈ）（２１） μ_pole ＝Ｃ₂ μ_pole′＋（１−Ｃ₂ ）last＿μ_pole （２２）によりμpoleを計算する。The μ _pole calculation unit 2123 inputs the parameters of A (z) which are the output of the parameter calculation unit 2200 of A (z), _{obtains an} autocorrelation coefficient r _1zero described later from these parameters, and μ _pole ′ = C ₀ r _1zero (r _1zero <Th) μ _pole ′ = C ₁ r _1zero (r _1zero ≧ Th) (21) μ _pole = C ₂ μ _pole ′ + (1-C ₂ ) last_μ _pole (22) Calculate μpole.

【００８５】ここで、重み係数Ｃ₀ ，Ｃ₁ ，Ｃ₂ および
しきい値Ｔｈは調整値であり、０＜Ｃ₁ ＜Ｃ₀ ≦１、０
＜Ｃ₂ ≦１、またＴｈは０に近い値である。last＿μ
_poleは直前の音声区間（例えば前サブフレーム）のμ
_poleを表す。ｒ_1zero はスペクトル包絡強調フィルタ１
１２のうち、分子側の伝達関数Ａ（ｚ）を有する零型フ
ィルタ２２０２のフィルタ係数ａｗ1 〜ａｗ10を用いて
計算される１次の自己相関係数（１次のＰＡＲＣＯＲ係
数に等しい）である。ｒ_1zero の値は、１／Ａ（ｚ）の
インパルス応答系列を用いて１サンプル時間をずらした
ときの自己相関値として求めることができるが、より効
率の良い方法として、上述のＤｕｒｂｉｎの再帰アルゴ
リズム（またはＬｅｖｉｎｓｏｎの再帰アルゴリズム、
あるいはＬｅｖｉｎｓｏｎ−Ｄｕｒｂｉｎアルゴリズ
ム）を逆に用いることにより、実際にインパルス応答を
計算すること無しに、少ない計算量で１次の自己相関係
数を求めることができる。Here, the weighting factors C ₀ , C ₁ , C ₂ and the threshold value Th are adjustment values, and 0 <C ₁ <C ₀ ≦ 1, 0
<C ₂ ≦ 1, and Th is a value close to 0. last_μ
_pole is the μ of the immediately preceding speech section (for example, the previous subframe)
Represents a _pole . r _1zero is a spectral envelope enhancement filter 1
Among them, the first-order autocorrelation coefficient (equal to the first-order PARCOR coefficient) calculated by using the filter coefficients aw1 to aw10 of the zero-type filter 2202 having the transfer function A (z) on the numerator side. The value of r _1zero can be obtained as an autocorrelation value when one sample time is shifted using an impulse response sequence of 1 / A (z), but as a more efficient method, the above-mentioned Durbin recursive algorithm is used. (Or Levinson's recursive algorithm,
Alternatively, by using the Levinson-Durbin algorithm) in reverse, the primary autocorrelation coefficient can be obtained with a small amount of calculation without actually calculating the impulse response.

【００８６】一方、μ_zero計算部２１２４は、Ｂ（ｚ）
のパラメータ計算部２２０１の出力であるＢ（ｚ）のパ
ラメータを入力し、これらから自己相関係数ｒ_1pole を
求める。係数μ_zeroは、 μ_zero＝Ｃ₃ ｒ_1pole （２３）により計算する。On the other hand, the μ _zero calculation unit 2124 uses B (z)
The parameter of B (z) which is the output of the parameter calculation unit 2201 of 1 is input, and the autocorrelation coefficient r _1pole is _obtained from them. The coefficient μ _zero is calculated by μ _zero = C ₃ r _1pole (23).

【００８７】ここで、Ｃ₃ は重み係数の調整値であり、
０＜Ｃ₃ ＜１であることが望ましい。ｒ_1pole は、スペ
クトル包絡強調フィルタ１１２のうち分母側の伝達関数
Ｂ（ｚ）を有する極型フィルタの係数ｂｗ1 〜ｂｗ10を
用いて計算される１次の自己相関係数（１次のＰＡＲＣ
ＯＲ係数に等しい）である。ｒ_1pole の値は、１／Ｂ
（ｚ）のインパルス応答系列を用いて１サンプル時間を
ずらしたときの自己相関値として求めることができる
が、より効率の良い方法として、上述のＤｕｒｂｉｎの
再帰アルゴリズム（またはＬｅｖｉｎｓｏｎの再帰アル
ゴリズム、あるいはＬｅｖｉｎｓｏｎ−Ｄｕｒｂｉｎア
ルゴリズム）を逆に用いることにより、実際にインパル
ス応答を計算すること無しに、少ない計算量でｒ_1pole
の値を求めることができる。Here, C ₃ is the adjustment value of the weighting coefficient,
It is desirable that 0 <C ₃ <1. r _1pole is a first-order autocorrelation coefficient (first-order PARC) calculated using the coefficients bw1 to bw10 of the polar filter having the transfer function B (z) on the denominator side of the spectral envelope emphasis filter 112.
Is equal to the OR coefficient). The value of r _1pole is 1 / B
It can be obtained as an autocorrelation value when one sample time is shifted using the impulse response sequence of (z), but as a more efficient method, the above-mentioned Durbin's recursive algorithm (or Levinson's recursive algorithm, or Levinson's) is used. -Durbin algorithm) in reverse, the r _1pole can be calculated with a small amount of calculation without actually calculating the impulse response.
The value of can be obtained.

【００８８】発明者らの実験では、上記の各調整値をＣ
₀ ＝０．９、Ｃ₁ ＝０．４、Ｃ₂ ＝０．７、Ｔｈ＝０．
０、Ｃ₃ ＝０．７という値にした場合に音質の改善が顕
著であることが分かった。これらの値を代入して（２
１），（２２），（２３）の各式を書き表すと、 μ_pole′＝０．９ｒ_1zero （ｒ_1zero ＜０．０） μ_pole′＝０．４ｒ_1zero （ｒ_1zero ≧０．０）（２４） μ_pole ＝０．７μ_pole′＋０．３last＿μ_pole （２５） μ_zero ＝０．７ｒ_1pole （２６）となる。In the experiments conducted by the inventors, the above adjustment values were set to C
₀ = 0.9, C ₁ = 0.4, C ₂ = 0.7, Th = 0.
It was found that the sound quality was significantly improved when the values of 0 and C ₃ = 0.7 were set. Substituting these values ((2
1), (22) and (23) are written, μ _pole ′ = 0.9r _1zero (r _1zero <0.0) μ _pole ′ = 0.4r _1zero (r _1zero ≧ 0.0) ( a _{_{24) μ pole = 0.7μ pole '}} + 0.3last_μ pole (25) μ zero = 0.7r 1pole (26).

【００８９】適応フィルタ２１２１は、以上のように計
算された係数を用いて極１次・零１次のＴｐｚ（ｚ）な
る伝達関数の適応フィルタを構成し、スペクトル包絡が
強調された音声信号を入力としてフィルタリングを行
う。The adaptive filter 2121 constitutes an adaptive filter of a pole first-order / zero-first-order transfer function Tpz (z) using the coefficients calculated as described above, and outputs a speech signal in which the spectrum envelope is emphasized. Filter as input.

【００９０】最後に、ゲイン調整部１１４によってポス
トフィルタ１０３による処理後の出力音声信号が処理前
の入力音声信号と同程度のパワとなるように、滑らかに
ゲインを調整してポストフィルタ１０３の出力音声信号
とする。Finally, the gain adjusting unit 114 smoothly adjusts the gain so that the output audio signal after being processed by the post filter 103 has the same power as the input audio signal before being processed, and the output of the post filter 103 is output. Audio signal.

【００９１】次に、本実施例におけるポストフィルタ１
０３での処理の流れを図１２のフローチャートを用いて
説明する。まず、スペクトル包絡強調フィルタＦ（ｚ）
（＝Ａ（ｚ）／Ｂ（ｚ））を構成するフィルタＡ（ｚ）
のパラメータａｗｉ（ｉ＝１〜１０）とＢ（ｚ）のパラ
メータｂｗｉ（ｉ＝１〜１０）を取得する（ステップＳ
５１）。ステップＳ５１の具体的な方法の一例は、音声
信号再生部１０２からの現音声区間でのＬＰＣ係数αｉ
（ｉ＝１〜１０）を用いてａｗｉ＝（γ1 ）ⁱ α_i （ｉ＝１〜１０）（２７）ｂｗｉ＝（γ2 ）ⁱ α_i （ｉ＝１〜１０）（２８）により計算することである。このとき、Ａ（ｚ）とＢ
（ｚ）はＡ（ｚ）＝１＋Σａｗｉｚ^-i （ｉ＝１〜１０）（２９）Ｂ（ｚ）＝１＋Σｂｗｉｚ^-i （ｉ＝１〜１０）（３０）と表すことができる。また、ＬＰＣ係数の正負の定義が
異なる場合には、式（２９），（３０）はＡ（ｚ）＝１−Σａｗｉｚ^-i （ｉ＝１〜１０）（２９′）Ｂ（ｚ）＝１−Σｂｗｉｚ^-i （ｉ＝１〜１０）（３０′）と表される。γ1 とγ2 はスペクトルの強調の程度を調
整するパラメータで、通常０＜γ1 ＜γ2 ＜１の関係が
ある値が用いられる。Next, the post filter 1 in the present embodiment.
The flow of processing in 03 will be described using the flowchart in FIG. First, the spectrum envelope emphasis filter F (z)
Filter A (z) that constitutes (= A (z) / B (z))
Parameters awi (i = 1 to 10) of B and parameters bwi (i = 1 to 10) of B (z) are acquired (step S).
51). An example of a specific method of step S51 is an LPC coefficient αi in the current voice section from the voice signal reproducing unit 102.
Using (i = 1 to 10) awi = (γ1) ⁱ α _i (i = 1 to 10) (27) bwi = (γ2) ⁱ α _i (i = 1 to 10) (28) Is. At this time, A (z) and B
(Z) can be expressed as A (z) = 1 + Σawiz ⁻ⁱ (i = 1 to 10) (29) B (z) = 1 + Σbwiz ⁻ⁱ (i = 1 to 10) (30). When the positive and negative definitions of the LPC coefficient are different, equations (29) and (30) are: A (z) = 1−Σawiz ⁻ⁱ (i = 1 to 10) (29 ′) B (z) = 1 -Σbwiz ⁻ⁱ (i = 1 to 10) (30 ′). γ1 and γ2 are parameters for adjusting the degree of emphasis of the spectrum, and values having a relation of 0 <γ1 <γ2 <1 are usually used.

【００９２】次に、入力の音声信号に対してピッチ強調
のためのフィルタリング（ステップＳ５２）と、スペク
トル包絡強調のためのフィルタリング（ステップＳ５
３）を行う。Next, filtering for pitch enhancement of the input voice signal (step S52) and filtering for spectral envelope enhancement (step S5).
Perform 3).

【００９３】次に、本実施例の特徴をなすＴｐｚ（ｚ）
なる伝達関数の適応フィルタを用いて以下のようにスペ
クトルの傾き補正を行う。まず、Ａ（ｚ）のパラメータ
ａｗｉ（ｉ＝１〜１０）から自己相関係数ｒ１_zeroを求
め（ステップＳ５４）、次いでｒ１_zeroとしきい値Ｔｈ
との大小比較を行い（ステップＳ５５）、ｒ１_zeroがＴ
ｈより小さければＣ₀ 倍したものをμ_pole′とし（ステ
ップＳ５６）、Ｔｈ以上ならばｒ１zeroをＣ₁ 倍したも
のをμ_pole′とする（ステップＳ５７）。μ_pole′と前
のμ_poleに相当するlast＿μ_poleとをＣ₂ を用いて補間
した値を現音声区間のμ_poleとする（ステップＳ５
８）。求めたμ_poleの値は次の音声区間での補間処理の
ためにlast＿μ_poleに格納する（ステップＳ５９）。Next, Tpz (z) which characterizes this embodiment
Using the adaptive filter of the transfer function, the inclination of the spectrum is corrected as follows. First, the autocorrelation coefficient r1 _zero is obtained from the parameter awi (i = 1 to 10) of A (z) (step S54), and then r1 _zero and the threshold Th.
Is compared (step S55), and r1 _zero is T
If it is smaller than h, the _value multiplied by C ₀ is set as μ _pole ′ (step S56), and if Th or more, the value obtained by multiplying r1zero by C ₁ is set as μ _pole ′ (step S57). A value obtained by interpolating μ _pole ′ and last_μ _pole corresponding to the previous μ _pole using C ₂ is set as the μ _{pole of the} current speech section (step S5).
8). The obtained μ _pole value is stored in last_μ _pole for interpolation processing in the next voice section (step S59).

【００９４】次に、Ｂ（ｚ）のパラメータｂｗｉ（ｉ＝
１〜１０）から、自己相関係数ｒ１_poleを求め（ステッ
プＳ６０）、ｒ１_poleをＣ₃ 倍してμ_zeroとする（ステ
ップＳ６１）。Next, the parameter bwi (i =
1 to 10), the autocorrelation coefficient r1 _pole is obtained (step S60), and the r1 _pole is multiplied by C ₃ to obtain μ _zero (step S61).

【００９５】こうして求められた２つのフィルタ係数μ
_pole，μ_zeroで決定された伝達関数Ｔｐｚ（ｚ）の適応
フィルタでフィルタリング処理を行うことにより、スペ
クトル包絡強調フィルタリングで混入した不要なスペク
トルの傾きを補償する（ステップＳ６２）。The two filter coefficients μ thus obtained
By performing the filtering process with the adaptive filter of the transfer function Tpz (z) determined by _pole and μ _zero , the unnecessary spectrum inclination mixed by the spectrum envelope emphasis filtering is compensated (step S62).

【００９６】最後に、ゲイン調整によりポストフィルタ
１０３による処理後の出力音声信号が処理前の入力音声
信号と同程度のパワとなるように滑らかにゲイン調整を
施してポストフィルタの出力とする（ステップＳ６
３）。Finally, the gain is adjusted so that the output audio signal after processing by the post filter 103 has the same level of power as the input audio signal before processing, and the output is output from the post filter (step). S6
3).

【００９７】なお、本実施例で用いる適応フィルタが独
自にフィルタゲインを持つ形で処理を実現しても構わな
い。その場合、適応フィルタの伝達関数Ｔｐｚ（ｚ）はＴｐｚ（ｚ）＝Ｇｐｚ（１−μ_zeroｚ^-1）／（１−μ_poleｚ^-1）（３１）と表すことができる。フィルタゲインＧｐｚはＧｐｚ＝（１−λ_poleμ_pole）／（１−λ_zeroμ_zero）（３２）を用いることができる。ここで、λ_poleおよびλ_zeroは
０＜λ_pole，λ_zero＜１を満たす固定の調整値である。Note that the adaptive filter used in this embodiment may implement the processing in such a way that it has its own filter gain. In that case, the transfer function Tpz (z) of the adaptive filter can be expressed as Tpz (z) = Gpz (1-μ _zero z ⁻¹ ) / (1-μ _pole z ⁻¹ ) (31). As the filter gain Gpz, Gpz = (1-λ _pole μ _pole ) / (1-λ _zero μ _zero ) (32) can be used. Here, λ _pole and λ _zero are fixed adjustment values that satisfy 0 <λ _pole and λ _zero <1.

【００９８】このようにすると、Ｔｐｚ（ｚ）なる伝達
関数の適応フィルタがゲインの簡易的な自己調整機能を
持つようにできるので、スペクトルの傾きを補償する補
償フィルタをゲイン調整部の後に配置するようなポスト
フィルタの構成の場合に有効である。By doing so, the adaptive filter of the transfer function Tpz (z) can be made to have a simple gain self-adjusting function, so that a compensating filter for compensating the slope of the spectrum is arranged after the gain adjusting section. This is effective in the case of such a post filter configuration.

【００９９】このように本実施例によれば、先の実施例
の効果に加えて、Ｃ₁ ＜Ｃ₃ ＜Ｃ₀なる関係の重み係数
を用い、Ａ（ｚ）のパラメータから求めた第１の自己相
関係数ｒ１_zeroが０に近いしきい値（Ｔｈとする）より
小のときｒ１_zeroに重み係数Ｃ₀ で重み付けを行い、ｒ
１_zeroがしきい値Ｔｈより大のときｒ１_zeroに重み係数
Ｃ₁ で重み付けを行うことにより得られる値からμ_pole
を求め、Ｂ（ｚ）のパラメータから求めた第２の自己相
関係数ｒ１_poleに重み係数Ｃ₃ で重み付けを行って得ら
れる値からμ_zeroを求めることにより、ｒ１_zeroがしき
い値Ｔｈよりも小さい場合、この区間の音声は高域の強
い子音のような音声であり、逆にｒ１_zeroがしきい値Ｔ
ｈよりも大きい場合は、この区間の音声は低域の強い母
音のような音声であるから、上記のように自己相関係数
としきい値Ｔｈとの比較により重み係数を切り換えるこ
とにより、補償フィルタ１１３を子音と母音それぞれに
適合した補償特性とすることができ、音質をさらに効果
的に改善することができる。As described above, according to the present embodiment, in addition to the effect of the previous embodiment, the first coefficient obtained from the parameter of A (z) using the weighting coefficient of the relation of C ₁ <C ₃ <C ₀ . When the autocorrelation coefficient r1 _zero of r1 _zero is smaller than a threshold value close to 0 (Th), r1 _zero is weighted with a weighting coefficient C ₀ ,
When 1 _zero is larger than the threshold Th, the value obtained by weighting r1 _zero with the weighting coefficient C ₁ is μ _pole
Is obtained and μ _zero is obtained from a value obtained by weighting the second autocorrelation coefficient r1 _pole obtained from the parameter of B (z) with the weighting coefficient C ₃ , whereby r1 _zero becomes smaller than the threshold Th. Is small, the voice in this section is a voice like a strong high-frequency consonant, and conversely r1 _zero is a threshold T.
If it is larger than h, the speech in this section is a speech like a strong vowel in the low frequency range. Therefore, by comparing the autocorrelation coefficient with the threshold Th as described above, the weighting coefficient is switched, and the compensation filter It is possible to make 113 the compensation characteristic suitable for each consonant and vowel, and to improve the sound quality more effectively.

【０１００】次に、第６の実施例として、ゲイン調整部
を改良したポストフィルタについて説明する。図１３
は、本実施例に係るポストフィルタを音声復号化装置の
最終段に適用した実施例を示し、図１と同じ機能を有す
るブロックについて同じ参照番号を付してある。すなわ
ち、入力端子１００に入力された送信側の音声符号化装
置から伝送された符号化データ（パラメータ化された音
声圧縮情報）からパラメータ復号部１０１および音声信
号再生部１０２を介して再生音声信号Ｓ（ｎ）が再生さ
れ、これをポストフィルタ４０３に通過させて最終的な
出力音声信号Ｓｏ（ｎ）が生成される。以下、本実施例
におけるポストフィルタ４０３について詳細に説明す
る。Next, as a sixth embodiment, a post filter with an improved gain adjusting section will be described. FIG.
Shows an embodiment in which the post filter according to the present embodiment is applied to the final stage of a speech decoding apparatus, and blocks having the same functions as in FIG. 1 are assigned the same reference numerals. That is, the reproduced audio signal S is input from the encoded data (parameterized audio compression information) transmitted from the audio encoding device on the transmitting side input to the input terminal 100 via the parameter decoding unit 101 and the audio signal reproducing unit 102. (N) is reproduced and passed through the post filter 403 to generate the final output audio signal So (n). Hereinafter, the post filter 403 in this embodiment will be described in detail.

【０１０１】ポストフィルタ４０３は、フィルタ処理部
４１０とゲイン調整部４１４から構成される。フィルタ
処理部４１０は、ポストフィルタ４０３における様々な
フィルタリング処理を行う。具体的には音声信号再生部
１０２からのＬＰＣ係数α_i（ｉ＝１〜１０）やピッチ
周期などの情報を基に行われるスペクトル包絡強調フィ
ルタ処理、ピッチ強調フィルタ処理およびこれらととも
に使用されるスペクトルの傾き補償フィルタ処理であ
る。ただし、フィルタ処理部４１０は、ここに挙げた処
理を全て含まなくともよく、例えばピッチ強調フィルタ
処理を含めなくてもよい。The post filter 403 comprises a filter processing section 410 and a gain adjusting section 414. The filter processing unit 410 performs various filtering processes in the post filter 403. Specifically, the spectrum envelope emphasis filter processing, the pitch emphasis filter processing, and the spectrum used together with the spectrum envelope emphasis filter processing performed based on the information such as the LPC coefficient α _i (i = 1 to 10) and the pitch period from the audio signal reproducing unit 102. Is a slope compensation filter process of. However, the filter processing unit 410 may not include all of the processes described here, and may not include the pitch enhancement filter process, for example.

【０１０２】フィルタ処理部４１０は、フィルタの零入
力応答Ｚ_i （ｎ）と零状態応答Ｚｓ（ｎ）を現音声区間
に対応する長さの分だけ求め、これらをゲイン調整部４
１４に出力する。零入力応答Ｚｉ（ｎ）は、フィルタ処
理部４１０の入力側の信号が完全に零であるとしてフィ
ルタを動作させたときに、フィルタの内部状態だけに依
存して出力される応答である。一方、零状態応答Ｚｓ
（ｎ）は、フィルタの内部状態が零であるとしてフィル
タ処理部４１０に入力を与えたときに出力される応答で
ある。The filter processing section 410 finds the zero-input response Z _i (n) and the zero-state response Zs (n) of the filter by the length corresponding to the current speech section, and these are obtained.
It outputs to 14. The zero input response Zi (n) is a response that is output depending on only the internal state of the filter when the filter is operated assuming that the signal on the input side of the filter processing unit 410 is completely zero. On the other hand, the zero-state response Zs
(N) is a response output when an input is given to the filter processing unit 410 assuming that the internal state of the filter is zero.

【０１０３】ゲイン調整部４１４はゲイン算出部４１
５、ゲイン乗算部４１６および加算部４１７から構成さ
れ、フィルタ処理部４１０からの零状態応答Ｚｓ（ｎ）
に乗じるゲインをゲイン算出部４１５で算出し、このゲ
インをゲイン乗算部４１６で乗じた後、その結果を加算
部４１７で零入力応答と加算する。これにより、パワ調
整された出力音声信号Ｓｏ（ｎ）が生成され、これが音
声信号出力端子４０４に出力される。The gain adjusting unit 414 is the gain calculating unit 41.
5, the gain multiplication unit 416 and the addition unit 417, and the zero-state response Zs (n) from the filter processing unit 410.
The gain calculation unit 415 calculates the gain to be multiplied by, and the gain multiplication unit 416 multiplies the gain, and the addition unit 417 adds the result to the zero input response. As a result, the power-adjusted output audio signal So (n) is generated and output to the audio signal output terminal 404.

【０１０４】本実施例によるゲイン調整方法を用いる
と、ポストフィルタ４０３の出力音声信号Ｓｏ（ｎ）の
パワを入力音声信号Ｓ（ｎ）のパワと所定の音声区間
（例えばサブフレーム）単位で完全に一致させることが
できる。しかも、ゲインのスムージング等の処理を一切
行わなくとも、区間の境界で出力音声信号のパワが不連
続になることを回避できる。また、本実施例では正のゲ
インを用いたときにパワを一致できるかどうかを判定
し、できない場合は、ゲインは入出力におけるパワの不
一致の影響が少ないゲイン値Ｃ₄ （≧０）に設定され
る。これによりポストフィルタ４０３からの出力音声信
号Ｓｏ（ｎ）の音質を安定的に改善できる効果がある。When the gain adjusting method according to the present embodiment is used, the power of the output audio signal So (n) of the post filter 403 is completely combined with the power of the input audio signal S (n) in units of a predetermined audio section (for example, subframe). Can be matched to. Moreover, it is possible to avoid discontinuity in the power of the output audio signal at the boundary of the section without performing any processing such as gain smoothing. Further, in the present embodiment, it is determined whether or not the power can be matched when a positive gain is used, and if not, the gain is set to a gain value C ₄ (≧ 0) which is less affected by the power mismatch in input and output. To be done. This has the effect of stably improving the sound quality of the output audio signal So (n) from the post filter 403.

【０１０５】ゲイン算出部４１５は、次式に基づいてゲ
インｇを求める。ＩＦ（ｄ＞０）（３３）ｇ＝［ｓｑｒｔ（ｂ² ＋ｄ）−ｂ］／ａ（３４）ｅｌｓｅｇ＝Ｃ₄ （３５）ｅｎｄｉｆここで、ａ＝ΣＺ_s （ｎ）Ｚ_s （ｎ）（ｎ＝０〜Ｎ−１）（３６）ｂ＝ΣＺ_i （ｎ）Ｚ_s （ｎ）（ｎ＝０〜Ｎ−１）（３７）ｄ＝ａ（ΣＳ（ｎ）Ｓ（ｎ）−ΣＺ_i （ｎ）Ｚ_i （ｎ））（ｎ＝０〜Ｎ−１）（３８）である。ここで、関数ｓｑｒｔ（ｘ）はｘの平方根、Ｎ
は所定の音声区間（例えばサブフレーム）の長さを表
す。また、パラメータＣ₄ は入出力音声信号のパワを負
でないゲインで一致させることができないような悪い状
態のときにｇとして用いる値であり、０≦Ｃ₄ ＜１の範
囲の数値であることが望ましい。例えば、Ｃ₄ ＝０．５
というように固定の値にしてもよい。The gain calculator 415 calculates the gain g based on the following equation. IF (d> 0) (33) g = [sqrt (b ² + d) -b] / a (34) else g = C ₄ (35) endif where a = ΣZ _s (n) Z _s (n) (N = 0 to N−1) (36) b = ΣZ _i (n) Z _s (n) (n = 0 to N−1) (37) d = a (ΣS (n) S (n) −ΣZ a _{_{i (n) Z i (n}} )) (n = 0~N-1) (38). Where the function sqrt (x) is the square root of x, N
Represents the length of a predetermined voice section (for example, a subframe). The parameter C ₄ is a value used as g in a bad state where the powers of the input and output audio signals cannot be matched with a non-negative gain, and is a numerical value in the range of 0 ≦ C ₄ <1. desirable. For example, C ₄ = 0.5
You may make it a fixed value like this.

【０１０６】（３３）式の条件（ｄ＞０）に基づいてｇ
を求めるようにすると、ｇが負の値になることを確実に
防止できるので、安定的なゲイン調整を実現できるとい
うメリットがある。この条件は（３６），（３８）式か
ら分かるように、零状態応答のパワが正で、かつ入力音
声信号のパワが零入力応答のパワより大きいことを意味
している。もしこの条件が満足されない場合は、正（プ
ラス）のゲインで入出力のパワを一致させることはでき
ない。Based on the condition (d> 0) of the equation (33), g
When g is obtained, it is possible to reliably prevent g from becoming a negative value, and there is an advantage that stable gain adjustment can be realized. This condition means that the power of the zero-state response is positive and the power of the input voice signal is larger than the power of the zero-input response, as can be seen from the equations (36) and (38). If this condition is not satisfied, the input and output powers cannot be matched with a positive (plus) gain.

【０１０７】上記の（３４），（３６），（３７），
（３８）式は、特願平２−４１２８６（適応ポストフィ
ルタ）にも示されているが、この方法ではゲインｇを求
める条件式に問題がある。すなわち、特願平２−４１２
８６では「ｓｑｒｔの括弧内の値ｂ² ＋ｄが正ならば
（３４）式からｇを計算する」ようになっているため、
これから求めたｇの値がマイナスになる場合がある。負
のゲイン値を用いると、零状態応答Ｚｓ（ｎ）にゲイン
を乗じた後の波形が反転することになり、最終的な出力
音声波形の形状が乱れ、音質的にはプチプチという耳ざ
わりな雑音が混入する問題があった。The above (34), (36), (37),
Expression (38) is also shown in Japanese Patent Application No. 2-41286 (adaptive post filter), but this method has a problem in the conditional expression for obtaining the gain g. That is, Japanese Patent Application No. 2-412
In 86, "If the value b ² + d in the brackets of sqrt is positive, g is calculated from the equation (34)".
The value of g obtained from this may be negative. If a negative gain value is used, the waveform after the zero-state response Zs (n) is multiplied by the gain will be inverted, and the shape of the final output speech waveform will be disturbed, resulting in a jarring noise that is apt to sound in terms of sound quality. There was a problem that was mixed.

【０１０８】これについて具体的な数値例で説明する
と、（３５），（３６），（３７）式からａ＝２、ｂ＝
５、ｄ＝−２４と求められた場合、特願平２−４１２８
６に記載されている方法ではｂ² ＋ｄ＝５² −２４＞０
であるから、（３４）式のゲイン計算式を用いてｇ＝
（ｓｑｒｔ（５² −２４）−５）／２＝−２となり、マ
イナスのゲインを用いて波形を変形させることにより強
制的に入出力のパワの一致が図られるという結果にな
る。This will be explained using a specific numerical example. From equations (35), (36) and (37), a = 2 and b =
5 and d = −24, Japanese Patent Application No. 2-4128
In the method described in No. 6, b ² + d = 5 ² -24> 0.
Therefore, using the gain calculation formula (34), g =
Becomes ^{(sqrt (5 2 -24) -5} ) / 2 = -2 , and the called forced match input and output power by deforming the waveform using a negative gain is achieved results.

【０１０９】一方、本実施例では、ｄがマイナス値であ
るから、（３３）式の条件文により、（３４）式は用い
ずに（３５）式で正のゲイン値ｇ＝Ｃ₄ （１＞Ｃ₄ ≧
０）となる。このように、本実施例におけるゲイン調整
においては、負のゲインで入出力のパワを一致させるこ
とはせずに、正のゲインでパワの一致ができない場合
は、ゲインの不一致の影響をできるだけ少なくなるよう
に、ゲインｇを負でない値Ｃ₄ に置き換えるようにした
ことを特徴とする。これにより、従来よりもポストフィ
ルタの音質を安定的に改善することができる効果があ
る。On the other hand, in this embodiment, since d is a negative value, the conditional expression of the equation (33) does not use the equation (34), but the positive gain value g = C ₄ (1 > C ₄ ≧
0). As described above, in the gain adjustment in the present embodiment, if the input and output powers are not matched with a negative gain and the powers cannot be matched with a positive gain, the influence of the gain mismatch is reduced as much as possible. As described above, the gain g is replaced with a non-negative value C ₄ . As a result, there is an effect that the sound quality of the post filter can be improved more stably than before.

【０１１０】図１４に、ゲイン算出部４１５のさらに詳
細な処理のシグナルフローの一例を表す。同図におい
て、計算部４２０は入力音声信号Ｓ（ｎ）から、そのパ
ワを計算する（（３８）式右辺の括弧内の第１項に対
応）。計算部４２１は零入力応答Ｚ_i （ｎ）のパワを計
算する（（３８）式右辺の括弧内の第２項に対応）。計
算部４２２は零状態応答Ｚ_s （ｎ）のパワを計算する
（（３６）式のａに対応）。計算部４２３は零入力応答
と零状態応答の内積を計算する（（３７）式のｂに対
応）。次に、ゲイン判定部４２５は計算部４２０，４２
１，４２２からの計算値（パラメータａとｄの情報）を
基に、（３３）式に相当する条件の判定を行う。但し、
（３７）式のパラメータｂは判定に用いない。この判定
結果に基づき、ゲインの計算に（３４）式を使うべきか
（３５）式を使うべきかの判定情報をゲイン決定部４２
６へ渡す。ゲイン決定部４２６は、計算部４２０，４２
１，４２２，４２３からのそれぞれの計算値と、正ゲイ
ン出力部４２４からの正のゲイン値Ｃ₄ を入力し、ゲイ
ン判定部４２５からの判定情報に従い（３４）式または
（３５）式からゲインをｇを決定して、これをゲイン算
出部４１５の出力とする。FIG. 14 shows an example of a signal flow of a more detailed process of the gain calculation section 415. In the figure, the calculation unit 420 calculates the power from the input audio signal S (n) (corresponding to the first term in parentheses on the right side of the expression (38)). The calculation unit 421 calculates the power of the quiescent response Z _i (n) (corresponding to the second term in the parentheses on the right side of Expression (38)). The calculator 422 calculates the power of the zero-state response Z _s (n) (corresponding to a in the equation (36)). The calculator 423 calculates the inner product of the zero-input response and the zero-state response (corresponding to b in equation (37)). Next, the gain determination unit 425 determines the calculation units 420 and 42.
Based on the calculated values (information of parameters a and d) from 1, 422, the condition corresponding to the equation (33) is determined. However,
The parameter b of the equation (37) is not used for the determination. Based on this determination result, the gain determining unit 42 is provided with the determination information whether to use the equation (34) or the equation (35) for the gain calculation.
Pass to 6. The gain determination unit 426 includes calculation units 420 and 42.
The calculated values from 1, 422 and 423 and the positive gain value C ₄ from the positive gain output unit 424 are input, and the gain is calculated from the formula (34) or (35) according to the judgment information from the gain judgment unit 425. Is determined as g and is used as the output of the gain calculation unit 415.

【０１１１】図１３に戻って説明を続けると、ゲイン乗
算部４１６はゲイン算出部４１５において求められたゲ
インｇをフィルタ処理部４１０から入力される零状態応
答Ｚ_s （ｎ）に乗じる。加算部４１７は、乗算部４１６
の出力信号とフィルタ処理部４１０からの零入力応答Ｚ
_i （ｎ）を加算した信号を出力音声信号Ｓｏ（ｎ）とし
てポストフィルタの出力端子４０４に出力する。ゲイン
調整部４１４の出力、つまりポストフィルタ４０３の出
力Ｓｏ（ｎ）は、次式で表すことができる。Returning to FIG. 13 and continuing the description, the gain multiplication unit 416 multiplies the zero-state response Z _s (n) input from the filter processing unit 410 by the gain g calculated by the gain calculation unit 415. The addition unit 417 is a multiplication unit 416.
Output signal and quiescent response Z from the filter processing unit 410
_The signal obtained by adding _i (n) is output to the output terminal 404 of the post filter as the output audio signal So (n). The output of the gain adjusting unit 414, that is, the output So (n) of the post filter 403 can be expressed by the following equation.

【０１１２】Ｓｏ（ｎ）＝Ｚ_i （ｎ）＋ｇＺ_s （ｎ）（ｎ＝０〜Ｎ−１）（３９）本実施例では特願平２−４１２８６と異なり、（３９）
式のゲインｇは常に零以上の値となる。従って、Ｚ_s
（ｎ）の波形が反転することを確実に防止できるため、
出力音声信号Ｓｏ（ｎ）の音質が安定的に改善されたポ
ストフィルタを提供できる。So (n) = Z _i (n) + gZ _s (n) (n = 0 to N−1) (39) In this embodiment, unlike Japanese Patent Application No. 2-41286, (39)
The gain g of the expression is always a value of zero or more. Therefore, Z _s
Since it is possible to reliably prevent the waveform of (n) from being inverted,
It is possible to provide a post filter in which the sound quality of the output audio signal So (n) is stably improved.

【０１１３】（３９）式で求めた出力音声信号Ｓｏ
（ｎ）のうち、後半のＰ個（Ｓｏ（Ｎ−Ｐ），…，Ｓｏ
（Ｎ−１））の値は、次の音声区間での零入力応答計算
に用いるフィルタの初期内部状態として利用できるの
で、図１３に示すようにフィルタ処理部４１０に、この
Ｓｏ（ｎ）の後半のＰ個の値を示すデータ４１８を渡す
ようにする。Output audio signal So obtained by the equation (39)
Of (n), the latter half P (So (NP), ..., So)
Since the value of (N-1)) can be used as the initial internal state of the filter used for the zero-input response calculation in the next speech section, the filter processing unit 410 is notified of this So (n) as shown in FIG. Data 418 indicating the latter half P values is passed.

【０１１４】次に、図１５のフローチャートを用いて本
実施例における一つの音声区間内で行われる処理の流れ
について説明する。まず、パラメータ化された音声圧縮
情報を復号し（ステップＳ７１）、この復号情報から音
声信号Ｓ（ｎ）を再生する（ステップＳ７２）。ポスト
フィルタに音声信号Ｓ（ｎ）を入力すると共に、ポスト
フィルタ内のフィルタを構成するために必要なＬＰＣ係
数やピッチ情報をポストフィルタに入力する（ステップ
Ｓ７３）。次にポストフィルタ内の処理に入る。初め
に、ポストフィルタ４０３内部のフィルタ処理部で零入
力応答と零状態応答が求められる（ステップＳ７４）。
次に、零入力応答と零状態応答と入力音声信号を用いて
ゲイン判定に必要なパラメータａ，ｄがそれぞれ（３
６）式、（３８）式により計算される（ステップＳ７
６）。計算されたパラメータａ，ｄのうちのパラメータ
ｄは（３３）式のゲイン判定にかけられ（ステップＳ７
７）、もし条件が満たされれば（Ｙｅｓならば）、（３
７），（３４）式を用いてゲインｇが求められ（ステッ
プＳ７８、ステップＳ７９）、条件が満足されなければ
（Ｎｏならば）、（３５）式を用いてゲインがｇ＝Ｃ4
に設定される（ステップＳ８０）。零状態応答をｇ倍し
た信号と零入力応答とを加算することによって、出力音
声信号Ｓｏ（ｎ）が得られる（ステップＳ８１）。最後
に、Ｓｏ（ｎ）を用いて零入力応答計算に用いるフィル
タの初期内部状態が更新される（ステップＳ８２）。Next, the flow of processing performed in one voice section in this embodiment will be described with reference to the flowchart of FIG. First, the parameterized voice compression information is decoded (step S71), and the voice signal S (n) is reproduced from this decoded information (step S72). The audio signal S (n) is input to the post filter, and the LPC coefficient and pitch information necessary to configure the filter in the post filter are input to the post filter (step S73). Next, the processing in the post filter is started. First, a zero-input response and a zero-state response are obtained by the filter processing unit inside the post filter 403 (step S74).
Next, using the zero input response, the zero state response, and the input voice signal, the parameters a and d necessary for the gain determination are (3
It is calculated by the equations (6) and (38) (step S7).
6). The parameter d of the calculated parameters a and d is subjected to the gain determination of the equation (33) (step S7).
7) If the condition is met (Yes), then (3
The gain g is obtained by using the equations (7) and (34) (steps S78 and S79). If the condition is not satisfied (No), the gain is g = C4 by using the equation (35).
Is set (step S80). An output audio signal So (n) is obtained by adding the signal obtained by multiplying the zero-state response by g and the zero-input response (step S81). Finally, So (n) is used to update the initial internal state of the filter used for the zero-input response calculation (step S82).

【０１１５】このように本実施例によれば、音声信号の
スペクトル形状を調整すべく音声信号に対して行われる
フィルタ処理に伴う音声信号のパワー変化を補償するた
めに、音声信号に乗じるゲインを調整する際、音声信号
に乗じるべきゲインを計算した後、そのゲインの正負判
定を行い、これが負の場合は所定の方法で与えられる非
負の値、好ましくは０以上、１未満の小さな値にゲイン
を置き換えることによによって、負のゲインに起因する
音質劣化を回避することができる。As described above, according to this embodiment, in order to compensate the power change of the audio signal due to the filtering process performed on the audio signal to adjust the spectrum shape of the audio signal, the gain by which the audio signal is multiplied is set. When adjusting, the gain to be multiplied to the audio signal is calculated, and then the positive / negative judgment of the gain is performed. If this is negative, the gain is given to a non-negative value given by a predetermined method, preferably a small value of 0 or more and less than 1 By replacing, it is possible to avoid the sound quality deterioration due to the negative gain.

【０１１６】なお、本実施例では（３８）式に示したよ
うにゲイン調整部の入力音声信号Ｓ（ｎ）のパワを指標
として出力音声信号Ｓｏ（ｎ）のパワを調整することで
ゲイン調整を行ったが、ゲイン調整に用いる指標は入力
音声信号のパワに限られるものではなく、例えば音声信
号再生部１０２から得られるパワ情報や、音声の有音区
間と無音区間とによってゲインを異ならせるための情報
その他をゲイン調整の指標として用いる場合にも本発明
は有効である。In this embodiment, the gain adjustment is performed by adjusting the power of the output audio signal So (n) using the power of the input audio signal S (n) of the gain adjusting unit as an index as shown in the equation (38). However, the index used for the gain adjustment is not limited to the power of the input audio signal. For example, the gain is made different depending on the power information obtained from the audio signal reproducing unit 102 and the voiced section and the silent section of the voice. The present invention is effective when the information for the above is used as an index for gain adjustment.

【０１１７】また、以上説明した実施例では、Ｆ（ｚ）
＝Α（ｚ）／Ｂ（ｚ）なる伝達特性のスペクトル包絡強
調フィルタ１１２がもたらすスペクトルの不要な傾きを
補償する方法として、(1) 分子側の伝達関数Α（ｚ）が
もたらすスペクトルの傾きを零フィルタで補償し、分母
側の伝達関数Ｂ（ｚ）がもたらすスペクトルの傾きを極
フィルタで補償する方法（零極法）と、(2) 分子側の伝
達関数Α（ｚ）がもたらすペクトルの傾きを極フィルタ
で補償し、分母側の伝達関数Ｂ（ｚ）がもたらすスペク
トルの傾きを零フイルタで補償する方法（極零法）の２
つの方法について述べたが、これら(1)(2)の方法を組み
合わせた方法として、(3) 分子側の伝達関数Α（ｚ）と
分母側の伝達関数Ｂ（ｚ）がそれぞれもたらすスペクト
ルの傾きを零フイルタと零フイルタの組み合わせの適応
フィルタで補償する方法（零零法）や、(4) 極フィルタ
と極フイルタの組み合わせで補償する方法（極極法）な
どが考えられるが、ここでは詳細な説明を省略する。In the embodiment described above, F (z)
= (A (z) / B (z)) As a method of compensating for the unnecessary slope of the spectrum produced by the spectrum envelope emphasizing filter 112, (1) the spectrum slope produced by the transfer function Α (z) on the numerator side is The zero filter is used to compensate for the spectral slope caused by the transfer function B (z) on the denominator side by the polar filter (zero pole method), and (2) The spectrum of the numerator side transfer function Α (z) A method (pole-zero method) of compensating the gradient with a pole filter and compensating the gradient of the spectrum caused by the transfer function B (z) on the denominator side with a zero filter
Two methods have been described. As a method combining these methods (1) and (2), (3) the slope of the spectrum caused by the transfer function Α (z) on the numerator side and the transfer function B (z) on the denominator side respectively. There are possible methods such as a method of compensating with an adaptive filter that combines zero filters and zero filters (zero-zero method), and a method of compensating (4) using a combination of pole filters and pole filters (pole-pole method). The description is omitted.

【０１１８】また、以上の実施例では適応フィルタ１２
１やピッチ傾き補償フィルタ２０５のフィルタ係数は、
それぞれスペクトル包絡強調フィルタ１１２やピッチ成
分強調フィルタ２０４のフィルタ係数と共に更新され
る。これに対して、時間的により滑らかにフィルタ係数
が更新されるようにするために、適応フィルタ１２１や
ピッチ傾き補償フィルタ２０５において前音声区間に使
用したフィルタ係数と、現音声区間でスペクトル包絡強
調フィルタ１１２やピッチ成分強調フィルタ２０４のフ
ィルタ係数から求められたフィルタ係数を用いて補間し
たフィルタ係数を現音声区間において適応フィルタ１２
１とピッチ傾き補償フィルタ２０５に使用する方法も有
効である。このようにすると、適応フィルタ１２１やピ
ッチ傾き補償フィルタ２０５の伝達関数変化がゆるやか
に変動するようになるため、最終的な音声信号が背景雑
音によって細かく変動する現象を防止できる効果があ
る。In the above embodiment, the adaptive filter 12 is used.
1 and the filter coefficient of the pitch slope compensation filter 205 are
They are updated together with the filter coefficients of the spectrum envelope emphasis filter 112 and the pitch component emphasis filter 204. On the other hand, in order to update the filter coefficient more smoothly in time, the filter coefficient used in the previous speech section in the adaptive filter 121 and the pitch gradient compensation filter 205 and the spectrum envelope emphasis filter in the current speech section. 112 and the filter coefficient interpolated using the filter coefficient obtained from the filter coefficient of the pitch component enhancement filter 204 in the current speech section.
1 and the method used for the pitch slope compensation filter 205 is also effective. In this way, changes in the transfer function of the adaptive filter 121 and the pitch slope compensation filter 205 gradually change, so that it is possible to prevent a phenomenon in which the final audio signal slightly changes due to background noise.

【０１１９】次に、図１６及び図１７を参照して第７の
実施例を説明する。第１から第６の実施例は、本発明を
復号化側のポストフィルタに適用する例について述べて
きたが、本実施例では、この発明のスペクトル形状調整
方法が符号化側で用いられる重みフイルタの不要なスペ
クトルの傾きを補償するために用いられる例が示されて
いる。Next, a seventh embodiment will be described with reference to FIGS. Although the first to sixth embodiments have described the examples in which the present invention is applied to the post filter on the decoding side, in the present embodiment, the weight filter used in the encoding side is the spectral shape adjusting method of the present invention. An example is shown used to compensate for the unwanted spectral tilt of the.

【０１２０】本実施例によると、スペクトルの傾きが補
償された重みフイルタを用いることにより、符号選択の
指標となる歪み尺度の重み付けが適正化されるため、よ
り忠実に原音を表現できる符号が選択でき、結果として
同じビットレート、同じ符号化方式でも音質を改善する
ことができるようになるという効果がある。According to the present embodiment, by using the weight filter in which the slope of the spectrum is compensated, the weighting of the distortion measure, which is an index of code selection, is optimized, so that the code that can more faithfully express the original sound is selected. As a result, there is an effect that the sound quality can be improved even with the same bit rate and the same encoding method.

【０１２１】図１６は、本実施例に係る重みフイルタを
音声符号化装置に適用した例を示す。入力端子７０に入
力された音声信号はフレーム単位に分析・符号化され、
最終的に符号化された情報は端子８４〜８７から出力さ
れる。ここで行う符号化は、大きく分けて、合成フイル
タの情報と駆動信号の情報の２つの情報を符号化するた
めに行われる。合成フィルタの情報は１０〜３０ｍｓ程
度の長さのフレーム単位毎に音声から抽出して符号化さ
れ、駆動信号の情報はフレームをさらに小さく分割した
サブフレーム単位に符号化されることが多いが、ここで
は簡単化のため、駆動信号の符号化もフレーム単位で行
う例で説明する。FIG. 16 shows an example in which the weight filter according to this embodiment is applied to a speech coder. The audio signal input to the input terminal 70 is analyzed and coded in frame units,
The finally encoded information is output from terminals 84-87. The encoding performed here is roughly performed to encode two pieces of information, that is, information of the synthesis filter and information of the drive signal. The information of the synthesis filter is often extracted from the voice and encoded for each frame unit having a length of about 10 to 30 ms, and the information of the drive signal is often encoded in subframe units obtained by dividing a frame into smaller parts. Here, for the sake of simplicity, an example will be described in which the drive signal is encoded in frame units.

【０１２２】駆動信号が入力される合成フイルタから得
られる出力信号が再生された音声信号となることはすで
に述べた通りである。以下に図１６の音声符号化装置に
ついて詳細に説明する。As described above, the output signal obtained from the synthesis filter to which the drive signal is input becomes the reproduced audio signal. The speech coding apparatus of FIG. 16 will be described in detail below.

【０１２３】図１６の音声符号化装置は、合成フイルタ
情報分析部７１、重みフィルタ情報計算部７２、重みフ
イルタ７３、目標信号生成部７４、適応コードブック７
５、雑音コードブック７６、ゲインコードブック７７、
ゲイン付与部７８、７９、加算部８０、重み付き合成フ
イルタ８１、歪み評価部８２、符号選択部８３から構成
される。重みフィルタ情報計算部７２はさらにＷＡ計算
部８８、ＷＢ計算部８９、μ_p 計算部９０、μ_z 計算部
９１から構成される。本実施例ではμ_p 計算部９０、μ
_z 計算部９１で求められた情報を基に重みフィルタの特
性を補償するようにしたことが従来法の音声符号化と異
なる。The speech coding apparatus shown in FIG. 16 includes a synthesis filter information analysis unit 71, a weight filter information calculation unit 72, a weight filter 73, a target signal generation unit 74, and an adaptive codebook 7.
5, noise codebook 76, gain codebook 77,
The gain adding units 78 and 79, the addition unit 80, the weighted synthesis filter 81, the distortion evaluation unit 82, and the code selection unit 83 are included. The weighting filter information calculation unit 72 further includes a WA calculation unit 88, a WB calculation unit 89, a μ _p calculation unit 90, and a μ _z calculation unit 91. In this embodiment, the μ _p calculator 90, μ
_The fact that the characteristic of the weighting filter is compensated based on the information obtained by the _z calculation unit 91 is different from the conventional speech coding.

【０１２４】以下に図１７のフローチャートを参照して
第７の実施例の動作を説明する。合成フイルタ情報分析
部７１は端子７０から入力される音声信号をフレーム単
位で分析し、対応する音声信号のスペクトル包絡の形状
を表現できるような合成フィルタのパラメータを抽出す
る。合成フイルタのパラメータの抽出方法としてはＬＰ
Ｃ分析を用いてＬＰＣ係数を抽出する方法を用いること
ができる。抽出されたパラメータは合成フイルタ情報分
析部７１において量子化に適したパラメータに変換さ
れ、符号化される。符号化された合成フイルタの情報は
端子８４に出力される。The operation of the seventh embodiment will be described below with reference to the flowchart of FIG. The synthesis filter information analysis unit 71 analyzes the voice signal input from the terminal 70 on a frame-by-frame basis, and extracts the parameters of the synthesis filter capable of expressing the shape of the spectral envelope of the corresponding voice signal. LP is used as the method for extracting the parameters of the synthesis filter.
A method of extracting the LPC coefficient using C analysis can be used. The extracted parameters are converted into parameters suitable for quantization in the synthesis filter information analysis unit 71 and encoded. Information of the encoded synthesis filter is output to the terminal 84.

【０１２５】合成フイルタ情報分析部７１において量子
化された合成フイルタの情報は重み付き合成フイルタ８
１で用いられる。また、量子化されない合成フィルタの
情報は重みフイルタ情報計算部７２で用いられる。重み
フイルタ情報計算部７２は、量子化されない合成フイル
タの情報を基に重みフィルタ７３と重み付き合成フイル
タ８１で用いる重みフィルタのパラメータを計算する。
ここでは、この計算において量子化しない合成フイルタ
の情報を用いる例で説明するが、応用によっては量子化
した合成フイルタの情報を用いる方法も利用可能であ
る。また、この実施例で用いる重みフイルタＷ（ｚ）は
次式で表せるものを用いる。The information on the synthesizing filter quantized by the synthesizing filter information analyzing unit 71 is the weighted synthesizing filter 8
Used in 1. The information of the synthesis filter that is not quantized is used by the weight filter information calculation unit 72. The weight filter information calculation unit 72 calculates the parameters of the weight filter used in the weight filter 73 and the weighted synthesis filter 81 based on the information of the unquantized synthesis filter.
Here, an example will be described in which the information of the synthesis filter that is not quantized is used in this calculation, but a method of using the information of the quantized synthesis filter can also be used depending on the application. The weight filter W (z) used in this embodiment is represented by the following equation.

【０１２６】[0126]

【数１】ここでＷＡ（ｚ）／ＷＢ（ｚ）は従来の構成の重みフィ
ルタであるが、これには不要なスペクトルの傾きが含ま
れているので、これを補償するために本発明の極零フィ
ルタ（１−μ_z ｚ^-1）／（１−μ_p ｚ^-1）を用いるのが
本実施例の主旨である。ここでは１次の極零フィルタを
用いて説明するが、これに限られるものではない。ま
た、重みフィルタのフイルタリングに要する計算量を削
減するために（４０）式のＷ（ｚ）の特性をある程度維
持できるようにして別の構成のフイルタに変換したもの
をＷ（ｚ）として用いる方法も有効であり、本発明にて
実現できる。具体的な例としては、（４０）式の右辺の
インパルス応答に時間窓をかけて所定の短いＫ＋１サン
プルで計算は打ち切る。このようなフィルタを重みフィ
ルタＷ（ｚ）とすることで、本発明のスペクトル形状の
傾き補償処理を含めて計算量が少ない重みフィルタを構
成することができる。このとき傾き補償を含めた重みフ
イルタＷ（ｚ）は[Equation 1] Here, WA (z) / WB (z) is a weighting filter of a conventional configuration, but since this includes an unnecessary spectrum slope, the pole-zero filter ( _{^{1-μ z z -1) /}} (1-μ p z -1) is used to take a gist of the present embodiment. Here, a first-order pole-zero filter will be used for explanation, but the present invention is not limited to this. Further, in order to reduce the amount of calculation required for filtering the weight filter, a filter having a different configuration so that the characteristic of W (z) in the expression (40) can be maintained to some extent is used as W (z). The method is also effective and can be realized by the present invention. As a specific example, a time window is applied to the impulse response on the right side of the equation (40) to terminate the calculation with a predetermined short K + 1 sample. By using such a filter as the weighting filter W (z), it is possible to configure a weighting filter having a small calculation amount including the slope compensation processing of the spectrum shape of the present invention. At this time, the weight filter W (z) including the tilt compensation is

【０１２７】[0127]

【数２】ここで、window(i) は時間窓を表し、w(i)は（４０）式
右辺の伝達関数のインパルス応答を表す。時間窓として
は方形窓やハミング窓などを用いることができる。[Equation 2] Here, window (i) represents a time window, and w (i) represents the impulse response of the transfer function on the right side of Expression (40). A square window or a Hamming window can be used as the time window.

【０１２８】重みフィルタ情報計算部７２のＷＡ計算部
８８とＷＢ計算部８９は、それぞれ、重みフィルタを構
成するＷＡ（ｚ）とＷＢ（ｚ）のパラメータを求める。
具体的な計算方法の一例を以下に示す。いまＬＰＣ分析
の次数をＰとすると、量子化しないＬＰＣ係数α_i （ｉ
＝１〜Ｐ）を用いて、ＷＡ（ｚ）の係数φ_i とＷＢ
（ｚ）のψ_i は φ_i ＝（υ₁ ）ⁱ α_i （ｉ＝１〜Ｐ）（４２） ψ_i ＝（υ₂ ）ⁱ α_i （ｉ＝１〜Ｐ）（４３）により計算する。Ｐは音声符号化では１０程度の値を用
いることが多い。ここでThe WA calculation unit 88 and the WB calculation unit 89 of the weight filter information calculation unit 72 obtain the parameters of WA (z) and WB (z), respectively, which form the weight filter.
An example of a specific calculation method is shown below. Assuming that the order of the LPC analysis is P, the LPC coefficient α _i (i
= 1 to P), the coefficient φ _i of WA (z) and WB
Ψ _{i of} (z) is calculated by φ _i = (υ ₁ ) ⁱ α _i (i = 1 to P) (42) ψ _i = (υ ₂ ) ⁱ α _i (i = 1 to P) (43) . A value of about 10 is often used for P in speech coding. here

【０１２９】[0129]

【数３】である。式（４２）と（４３）で用いられるυ₁ とυ₂
は重み付けの程度を調整するパラメータで、０＜υ₂ ＜
υ₁ ＜１（ポストフィルタの場合と調整値が異なること
に注意）の関係がある。代表的な値はυ₁ ＝０．９、υ
₂ ＝０．４である。μ_p 計算部９０は、ＷＡ計算部から
のＷＡ（ｚ）の情報（ここでは（φ_i ）を用いて、ＷＡ
（ｚ）の持つ不要なスペクトルの傾きを補償する極フイ
ルタの係数μ_p を求める。すなわち、第２の実施例で説
明した方法と同様に、Ｄｕｒｂｉｎ法の逆のアルゴリズ
ムを利用することにより、φ_i から１次のＰＡＲＣＯＲ
係数を求め、これをμ_p とする。[Equation 3] Is. Υ ₁ and υ ₂ used in equations (42) and (43)
Is a parameter for adjusting the degree of weighting, and 0 <υ ₂ <
There is a relation of υ ₁ <1 (note that the adjustment value is different from that of the post filter). Typical values are υ ₁ = 0.9, υ
₂ = 0.4. The μ _p calculation unit 90 uses the information of WA (z) from the WA calculation unit (here, (φ _i ) to calculate WA
The coefficient μ _{p of the} polar filter for compensating for the unnecessary spectral tilt of (z) is obtained. That is, similar to the method described in the second embodiment, by using the inverse algorithm of the Durbin method, the first-order PARCOR from φ _i is used.
Find the coefficient and call it μ _p .

【０１３０】μ_z 計算部９１は、ＷＢ計算部からのＷＢ
（ｚ）の情報（ここではψ_i ）を用いて、ＷＢ（ｚ）の
持つ不要なスペクトルの傾きを補償する零フイルタの係
数μ_z を求める。すなわち、第２の実施例で説明した方
法と同様に、Ｄｕｒｂｉｎ法の逆のアルゴリズムを利用
することにより、ψ_i から１次のＰＡＲＣＯＲ係数を求
め、これをμ_z とする。この他、重みフイルタに用いる
係数μ_p とμ_z の値をさらに変形し、より詳細な最適化
の調整をする方法も有効である。一例としては、次のよ
うな係数の変形を行う。The μ _z calculation unit 91 uses the WB from the WB calculation unit.
Using the information of (z) (here, ψ _i ), the zero filter coefficient μ _z for compensating for the unnecessary spectrum inclination of WB (z) is obtained. That is, similar to the method described in the second embodiment, the inverse algorithm of the Durbin method is used to obtain the first-order PARCOR coefficient from ψ _i and set it as μ _z . In addition to this, a method of further modifying the values of the coefficients μ _p and μ _z used for the weight filter to perform more detailed optimization adjustment is also effective. As an example, the following coefficient transformation is performed.

【０１３１】 μ_p ←ｙ_p μ_p （４６） μ_z ←ｙ_z μ_z （４７）ここでｙ_p とｙ_z は調整用の係数であり、０＜ｙ_p ＜＝
１、０＜ｙ_z ＜＝１であることが望ましい。さらなる最
適化の方法として、ＷＡ（ｚ）またはＷＢ（ｚ）または
合成フイルタの特性に応じて補償用の極零フイルタの調
整方法を変えることも最適化の方法のーつである。例え
ば、上記のフィルタのなかのいずれかのフィルタの特性
に着目し、これが高域通過型であるか低域通過型である
かに応じて調整用の係数の値を適応化させる方法も有効
と考えられる。Μ _p ← y _p μ _p (46) μ _z ← y _z μ _z (47) Here, y _p and y _z are coefficients for adjustment, and 0 <y _p <=
It is desirable that 1 and 0 < _yz <= 1. As a further optimization method, it is one of the optimization methods that the adjustment method of the pole / zero filter for compensation is changed according to the characteristics of WA (z) or WB (z) or the composite filter. For example, focusing on the characteristics of any one of the above filters, a method of adapting the value of the adjustment coefficient depending on whether this is a high-pass type or a low-pass type is also effective. Conceivable.

【０１３２】再度、図１６を参照して説明すると、重み
フイルタ情報計算部７２で求められた情報は重みフイル
タ７３と重み付き合成フイルタ８１で用いられる。重み
フイルタ７３は重みフイルタ情報計算部７２で求められ
た情報に基づいて入力音声信号を重み付けし、重み付け
られた音声信号を目標信号生成部７４に出力する。この
重み付け音声信号に基づき、目標信号生成部７４は前の
フレームの符号化による影響を重み付けられた音声信号
のレベルによって除去し、現フレームの駆動信号の符号
化に用いる目標信号を生成する。Referring again to FIG. 16, the information obtained by the weight filter information calculation unit 72 is used by the weight filter 73 and the weighted synthesis filter 81. The weight filter 73 weights the input voice signal based on the information obtained by the weight filter information calculation unit 72, and outputs the weighted voice signal to the target signal generation unit 74. Based on this weighted audio signal, the target signal generation unit 74 removes the influence of the encoding of the previous frame by the level of the weighted audio signal, and generates the target signal used for encoding the drive signal of the current frame.

【０１３３】次に、駆動信号の符号化が行われる。駆動
信号の符号化には、適応コードブック７５、雑音コード
ブック７６、ゲインコードブック７７の３つのコードブ
ックを用いる。適応コードブックは過去の駆動信号を格
納し、ピッチ周期に対応したピッチ周期を符号に持つピ
ッチベクトルにより駆動信号のピッチ周期成分を表す。
雑音コードブックは符号である雑音コードに対応した雑
音ベクトルにより駆動信号の雑音成分を表す。また、ゲ
インコードブックはピッチベクトルと雑音ベクトルのそ
れぞれの大きさを制御する目的で用いられ、ゲインコー
ドに対応したゲイン候補をゲイン付与部７８およびゲイ
ン付与部７９にて与え、ゲイン付与後のピッチベクトル
と雑音ベクトルを加算部８０で加算して駆動信号の候補
を生成する。こうして生成される駆動信号の候補は重み
付き合成フイルタ８１を通過させた後、目標信号との歪
みに対応する歪み評価値を計算するため歪み評価部８２
に入力される。符号選択部８３は計算された歪み評価値
が小さくなるような符号をそれぞれのコードブックから
探索する。Next, the drive signal is encoded. Three codebooks, an adaptive codebook 75, a noise codebook 76, and a gain codebook 77, are used to encode the drive signal. The adaptive codebook stores past drive signals, and represents a pitch period component of the drive signal by a pitch vector having a pitch period corresponding to the pitch period as a code.
The noise codebook represents the noise component of the drive signal by the noise vector corresponding to the noise code that is the code. Further, the gain code book is used for the purpose of controlling the magnitudes of the pitch vector and the noise vector, and the gain candidates corresponding to the gain code are given by the gain giving unit 78 and the gain giving unit 79, and the pitch after the gain is given is given. The addition unit 80 adds the vector and the noise vector to generate drive signal candidates. The drive signal candidates thus generated are passed through the weighted synthesis filter 81, and then the distortion evaluation unit 82 for calculating the distortion evaluation value corresponding to the distortion with the target signal.
Entered in. The code selection unit 83 searches each codebook for a code such that the calculated distortion evaluation value is small.

【０１３４】以上が駆動信号の符号探索の原理である
が、通常は探索に要する計算量を削減するため、適応コ
ードブック、雑音コードブック、ゲインコードブックの
順に段階的にコードを探索する方法が多く用いられる。
最終的に探索された駆動信号を表すコードのうち、ピッ
チ周期（適応コードブックの符号）、雑音コード（雑音
コードブックの符号）、ゲインコード（ゲインコードブ
ックの符号）はそれぞれ端子８５、８６、８７に出力さ
れる。The above is the principle of the code search of the drive signal, but in order to reduce the calculation amount normally required for the search, there is a method of searching the code stepwise in the order of the adaptive codebook, the noise codebook, and the gain codebook. Mostly used.
Of the codes representing the drive signals finally searched, the pitch period (the code of the adaptive codebook), the noise code (the code of the noise codebook), and the gain code (the code of the gain codebook) are terminals 85 and 86, respectively. It is output to 87.

【０１３５】次に、本実施例における音声符号化装置の
処理を図１７のフローチャートを用いて説明する。始め
に初期設定が行われる（ステップＳ１８０）。次に、フ
レーム毎の処理ができるように必要な分の音声信号が合
成フィルタ情報分析部７１に入力される（ステップＳ１
８１）。合成フィルタ情報分析部７１はこの音声信号を
分析し、音声信号に対応する合成フィルタのパラメータ
を抽出し、これを符号化する（ステッブＳ１８２）。次
に、重みフイルタを構成するための重みフィルタ情報が
生成される（ステップＳ１８３）。このステップはさら
に詳細にはＷＡ（ｚ）の係数の計算（ステップＳ１８
４）、ＷＡ（ｚ）の係数を用いたμ_p の計算（ステップ
Ｓ１８５）、ＷＢ（ｚ）の係数の計算（ステッブＳ１８
６）、ＷＢ（ｚ）の係数を用いたμ_z の計算（ステップ
Ｓ１８７）により行われる。Next, the processing of the speech coding apparatus according to this embodiment will be described with reference to the flowchart of FIG. Initially, initial setting is performed (step S180). Next, the necessary amount of audio signals so that the processing for each frame can be performed is input to the synthesis filter information analysis unit 71 (step S1).
81). The synthesis filter information analysis unit 71 analyzes this voice signal, extracts the parameter of the synthesis filter corresponding to the voice signal, and encodes this (step S182). Next, weight filter information for forming the weight filter is generated (step S183). More specifically, this step calculates the coefficient of WA (z) (step S18).
4), calculation of μ _p using the coefficient of WA (z) (step S185), calculation of coefficient of WB (z) (step S18)
6), calculation of μ _z using the coefficient of WB (z) (step S187).

【０１３６】次に、求められた重みフイルタの情報を用
いて重み付き音声信号が生成され（ステップＳ１８
８）、前のフレームの符号化による影響が重み付けられ
た音声信号のレベルで除かれ、現フレームの駆動信号の
符号化に用いる目標信号が生成される（ステップＳ１８
９）。この目標信号を用いて、適応コードブック探索
（Ｓ１９１）、雑音コードブック探索（Ｓ１９１）、ゲ
インコードブック探索（Ｓ１９２）を順に行うことによ
り駆動信号が符号化される。この際に用いる重み付き合
成フィルタの重みフィルタはステップＳ１８３で求めら
れたものを用いる。最後に符号化によって得られた現フ
レームの符号データを出力する。Next, a weighted voice signal is generated using the information of the obtained weight filter (step S18).
8) The influence of the encoding of the previous frame is removed by the weighted level of the audio signal, and the target signal used for encoding the drive signal of the current frame is generated (step S18).
9). The drive signal is encoded by sequentially performing adaptive codebook search (S191), noise codebook search (S191), and gain codebook search (S192) using this target signal. As the weight filter of the weighted synthesis filter used at this time, the one obtained in step S183 is used. Finally, the coded data of the current frame obtained by the coding is output.

【０１３７】上述では、ＷＡ（ｚ）からμ_p を求め、Ｗ
Ｂ（ｚ）からμ_z を求めるような例で説明したが、第１
の実施例で示したような方法により、ＷＢ（ｚ）からμ
_p を求め、ＷＡ（ｚ）からμ_z を求めるような実施例も
有効であることは言うまでもない。また、第３の実施例
で示したような２次以上の次数の極零フィルタを用いる
ことも可能である。In the above, μ _p is calculated from WA (z) and W
Although the example of obtaining μ _z from B (z) has been described,
From WB (z) by the method as shown in the embodiment of
It goes without saying that an embodiment in which _p is obtained and μ _z is obtained from WA (z) is also effective. It is also possible to use a pole-zero filter of the second or higher order as shown in the third embodiment.

【０１３８】また、以上の実施例においてピッチ成分強
調フィルタ、スペクトル包絡強調フィルタ、適応フィル
タ、ピッチ傾き補償フィルタなどの各種フィルタの順序
は任意に変えることができ、要するにこれらの各フィル
タが縦続接続されていればよい。また、重み付けフィル
タで用いる各種のフィルタの順序も任意である。Further, in the above embodiment, the order of various filters such as the pitch component emphasis filter, the spectrum envelope emphasis filter, the adaptive filter, and the pitch slope compensation filter can be arbitrarily changed, that is, these respective filters are connected in cascade. If you have. Further, the order of various filters used in the weighting filter is arbitrary.

【０１３９】さらに、以上の実施例では本発明を音声復
号化装置の最終段に適用した例について説明したが、主
観品質の向上を目的に、音声符号化／復号化システムに
おける復号音声信号以外にも、例えば音声合成装置で得
られた合成音声信号など、種々の音声信号に本発明を適
用することができる。Further, in the above embodiments, the example in which the present invention is applied to the final stage of the speech decoding apparatus has been described. However, in order to improve the subjective quality, other than the decoded speech signal in the speech encoding / decoding system. Also, the present invention can be applied to various voice signals such as a synthesized voice signal obtained by a voice synthesizer.

【０１４０】[0140]

【発明の効果】以上説明したように、本発明によれば音
声信号をＡ（ｚ）／Ｂ（ｚ）で表される極零型の伝達関
数を持つ第１のフィルタと、この第１のフィルタの特性
を補償するための第２のフィルタに通すことによって、
音声信号のスペクトル形状を調整する際、第２のフィル
タの２つのパラメータをＡ（ｚ）とＢ（ｚ）とから別々
に求めることによって、復号音声や合成音声などの音声
信号の品質を少ない計算量で効果的に改善することがで
きる。As described above, according to the present invention, the first filter having the pole-zero type transfer function of the audio signal represented by A (z) / B (z) and the first filter By passing it through a second filter to compensate the characteristics of the filter,
When adjusting the spectral shape of the speech signal, by calculating the two parameters of the second filter separately from A (z) and B (z), it is possible to reduce the quality of the speech signal such as decoded speech or synthesized speech. The amount can be effectively improved.

【０１４１】また、本発明によれば第２のフィルタにお
いてそれぞれ異なるパラメータを持つ極型フィルタと零
型フィルタによるフィルタリング処理を行うことによ
り、そのパラメータが従来の１次の零型フィルタにより
構成されたフィルタに比較して増えるため、そのフィル
タの伝達関数の表現の自由度が高くなり、柔軟にスペク
トルの傾き補償を行うことができ、音質の改善効果がさ
らに高くなる。この場合において、Ａ（ｚ）からμ_p を
求め、Ｂ（ｚ）からμ_z を求めるようにすれば、より次
数の低いフィルタ係数でスペクトルの傾きを補償するこ
とができる。According to the present invention, the second filter performs the filtering process with the polar filter and the zero filter having different parameters, so that the parameter is configured by the conventional first-order zero filter. Since the number of filters is increased as compared with the number of filters, the degree of freedom in expressing the transfer function of the filter is increased, the slope compensation of the spectrum can be flexibly performed, and the effect of improving sound quality is further enhanced. In this case, we seek mu _p from A (z), if the B (z) to determine the mu _z, it is possible to compensate for the tilt of the spectrum at lower filter coefficients of orders.

【０１４２】さらに、Ｃ₁ ＜Ｃ₃ ＜Ｃ₀ なる関係の重み
係数を用い、Ａ（ｚ）のパラメータから求めた第１の自
己相関係数が０に近いしきい値（Ｔｈとする）より小の
とき第１の自己相関係数に重み係数Ｃ₀ で重み付けを行
い、第１の自己相関係数がしきい値Ｔｈより大のとき第
１の自己相関係数に重み係数Ｃ₁ で重み付けを行うこと
により得られる値からμ_p を求め、Ｂ（ｚ）のパラメー
タから求めた第２の自己相関係数に重み係数Ｃ₃ で重み
付けを行って得られる値からμ_z を求めることとすれ
ば、第１の自己相関係数がしきい値Ｔｈよりも小さい場
合、この区間の音声は高域の強い子音のような音声であ
り、逆に第１の自己相関係数がしきい値Ｔｈよりも大き
い場合は、この区間の音声は低域の強い母音のような音
声であるから、上記のように自己相関係数としきい値Ｔ
ｈとの比較により重み係数を切り換えることにより、第
２のフィルタを子音と母音それぞれに適合した補償特性
とすることができ、音質が効果的に改善される。Further, using the weighting factors of the relationship of C ₁ <C ₃ <C ₀ , the first autocorrelation coefficient obtained from the parameter of A (z) is calculated from a threshold value (Th) which is close to 0. When it is small, the first autocorrelation coefficient is weighted with the weighting coefficient C ₀ , and when the first autocorrelation coefficient is larger than the threshold Th, the first autocorrelation coefficient is weighted with the weighting coefficient C ₁ . It is possible to obtain μ _p from the value obtained by performing the above, and to obtain μ _z from the value obtained by weighting the second autocorrelation coefficient obtained from the parameter of B (z) with the weighting coefficient C _3. For example, when the first autocorrelation coefficient is smaller than the threshold Th, the voice in this section is a strong consonant voice in the high frequency range, and conversely, the first autocorrelation coefficient has the threshold Th. If it is larger than the above, the sound in this section is a strong vowel sound in the low frequency range. Autocorrelation coefficient and threshold T
By switching the weighting coefficient by comparing with h, the second filter can have the compensation characteristics adapted to the consonant and the vowel respectively, and the sound quality is effectively improved.

【０１４３】また、本発明によれば音声信号のスペクト
ル形状を調整するためのフィルタ処理に伴う音声信号の
パワー変化を補償するゲインを調整において、音声信号
に乗じるべきゲインの正負判定を行い、これが負の場合
は所定の方法で与えられる非負の値、特に０以上、１未
満といった小さな値にゲインを置き換えることによっ
て、負のゲインを用いることに起因するゲイン調整での
音質劣化を回避することができる。Further, according to the present invention, in adjusting the gain for compensating the power change of the audio signal due to the filter processing for adjusting the spectrum shape of the audio signal, the positive / negative judgment of the gain to be multiplied by the audio signal is performed. When the gain is negative, by replacing the gain with a non-negative value given by a predetermined method, in particular, a small value such as 0 or more and less than 1, it is possible to avoid the sound quality deterioration due to the gain adjustment caused by using the negative gain. it can.

[Brief description of drawings]

【図１】第１〜第３の実施例に係るポストフィルタを組
み込んだ音声復号化装置のブロック図。FIG. 1 is a block diagram of a speech decoding apparatus incorporating post filters according to first to third embodiments.

【図２】第１の実施例に係るポストフィルタでの処理の
流れを示すフローチャート図。FIG. 2 is a flowchart showing a processing flow of a post filter according to the first embodiment.

【図３】第２の実施例に係るポストフィルタでの処理の
流れを示すフローチャート図。FIG. 3 is a flowchart showing a processing flow of a post filter according to the second embodiment.

【図４】本発明で用いる適応フィルタのブロック図。FIG. 4 is a block diagram of an adaptive filter used in the present invention.

【図５】本発明で用いる他の適応フィルタのブロック
図。FIG. 5 is a block diagram of another adaptive filter used in the present invention.

【図６】ピッチ成分強調フィルタの基本的な働きとピッ
チ成分強調処理によるスペクトルの傾き補償の原理を説
明するための図。FIG. 6 is a diagram for explaining the basic function of a pitch component emphasis filter and the principle of spectrum tilt compensation by pitch component emphasis processing.

【図７】第４の実施例に係るポストフィルタを組み込ん
だ音声復号化装置のブロック図。FIG. 7 is a block diagram of a speech decoding apparatus incorporating a post filter according to a fourth embodiment.

【図８】図７における音声信号再生部のブロック図。8 is a block diagram of an audio signal reproducing unit in FIG.

【図９】第４の実施例におけるピッチ成分強調フィルタ
の働きとピッチ成分強調処理によるスペクトルの傾き補
償の作用を説明するための図。9A and 9B are views for explaining the function of a pitch component emphasis filter and the effect of spectrum tilt compensation by the pitch component emphasis process in the fourth embodiment.

【図１０】第４の実施例における処理の流れを示すフロ
ーチャート。FIG. 10 is a flowchart showing the flow of processing in the fourth embodiment.

【図１１】第５の実施例に係るポストフィルタを組み込
んだ音声復号化装置のブロック図。FIG. 11 is a block diagram of a speech decoding apparatus incorporating a post filter according to a fifth embodiment.

【図１２】第５の実施例に係るポストフィルタでの処理
の流れを示すフローチャート。FIG. 12 is a flowchart showing the flow of processing in a post filter according to the fifth embodiment.

【図１３】第６の実施例に係るポストフィルタを組み込
んだ音声復号化装置のブロック図。FIG. 13 is a block diagram of a speech decoding apparatus incorporating a post filter according to a sixth embodiment.

【図１４】図１３におけるゲイン算出部の構成を示すブ
ロック図。FIG. 14 is a block diagram showing the configuration of a gain calculation unit in FIG.

【図１５】第６の実施例に係るポストフィルタでの処理
の流れを示すフローチャート。FIG. 15 is a flowchart showing a processing flow of a post filter according to the sixth embodiment.

【図１６】第７の実施例であり、符号化装置に適用する
音声信号のスペクトル形状調整装置FIG. 16 is a seventh embodiment and is a spectrum shape adjusting device for a voice signal applied to an encoding device.

【図１７】図１６の装置の動作を説明するためのフロー
チャート。17 is a flowchart for explaining the operation of the apparatus of FIG.

[Explanation of symbols]

１０１…パラメータ復号部１０２、１０２’…音声信号再生部１０３、１０３’…ポストフィルタ１１１…ピッチ成分強調フィルタ１１２…スペクトル包絡強調フィルタ１１３…補償フィルタ１１４…ゲイン調整部１２１…適応フィルタ１２２…フィルタ係数計算部１２３…第１のパラメータ計算部１２４…第２のパラメータ計算部２０１…合成フィルタ情報生成部２０２…駆動信号生成部２０３…第１の合成フィルタ２０４…ピッチ成分強調フィルタ２０５…ピッチ傾き補償フィルタ２０６…第１のＬＰＣ分析部２０７…第２のＬＰＣ分析部２０８…第２の合成フィルタ４０３…ポストフィルタ４１０…フィルタ処理部４１４…ゲイン調整部４１５…ゲイン算出部４１６…ゲイン乗算部４１７…加算部４２０〜４２３…計算部４２４…正ゲイン出力部４２５…ゲイン判定部４２６…ゲイン決定部２１０３…ポストフィルタ２１２１…適応フィルタ２１２２…第１のフィルタ係数計算部２１２３…第２のフィルタ係数計算部２２００…パラメータ計算部２２０１…パラメータ計算部２２０２…零型フィルタ２２０３…極型フィルタ 101 ... Parameter decoding unit 102, 102 '... Audio signal reproducing unit 103, 103 '... Post filter 111 ... Pitch component emphasis filter 112 ... Spectral envelope enhancement filter 113 ... Compensation filter 114 ... Gain adjusting section 121 ... Adaptive filter 122 ... Filter coefficient calculation unit 123 ... First parameter calculation unit 124 ... Second parameter calculator 201 ... Synthesis filter information generation unit 202 ... Drive signal generator 203 ... First synthesis filter 204 ... Pitch component emphasis filter 205 ... Pitch tilt compensation filter 206 ... First LPC analysis unit 207 ... Second LPC analysis unit 208 ... Second synthesis filter 403 ... Post filter 410 ... Filter processing unit 414 ... Gain adjusting unit 415 ... Gain calculation unit 416 ... Gain multiplication unit 417 ... Addition unit 420-423 ... Calculation unit 424 ... Positive gain output section 425 ... Gain determination unit 426 ... Gain determination unit 2103 ... Post filter 2121 ... Adaptive filter 2122 ... First filter coefficient calculation unit 2123 ... Second filter coefficient calculation unit 2200 ... Parameter calculation unit 2201 ... Parameter calculation unit 2202 ... Zero filter 2203 ... Polar filter

フロントページの続き (72)発明者赤嶺政巳神奈川県川崎市幸区小向東芝町１番地株式会社東芝研究開発センター内 (72)発明者天田皇神奈川県川崎市幸区小向東芝町１番地株式会社東芝研究開発センター内 (56)参考文献特開平３−245195（ＪＰ，Ａ) 特開平２−82710（ＪＰ，Ａ) 特表平２−502135（ＪＰ，Ａ) (58)調査した分野(Int.Cl.⁷，ＤＢ名) G10L 19/00 G10L 19/06 Front page continued (72) Inventor Masami Akamine 1 Komukai Toshiba-cho, Saiwai-ku, Kawasaki-shi, Kanagawa Toshiba Research & Development Center Co., Ltd. (72) Inventor Emperor Amata 1 Komukai-shiba-cho, Kawasaki-shi, Kanagawa Corporate Toshiba Research and Development Center (56) Reference JP-A-3-245195 (JP, A) JP-A-2-82710 (JP, A) Special Table 2-502135 (JP, A) (58) Fields investigated (Int.Cl. ⁷ , DB name) G10L 19/00 G10L 19/06

Claims

(57) [Claims]

1. A first filter having a pole-zero type transfer function represented by A (z) / B (z) for an audio signal, and a second filter for compensating for the characteristics of the first filter. In the method of adjusting the spectral shape of an audio signal by passing it through a filter, the second filter has a transfer function in the z-transform domain of (1-
μ _z z ⁻¹ ) / (1−μ _p z ⁻¹ ) (where μ _z , μ
_p is at least a first-order pole-zero type transfer function represented by mutually independent filter coefficients each having an absolute value smaller than 1. _p .

2. A first filter having a pole-zero type transfer function represented by A (z) / B (z) for an audio signal, and a second filter for compensating for the characteristic of the first filter. In the method of adjusting the spectral shape of an audio signal by passing it through a filter, two parameters used for the second filter are A
(Z) and B (z) are calculated separately, and the second filter has a transfer function of (1-μ _z
z ⁻¹ ) / (1−μ _p z ⁻¹ ) (where μ _z and μ _p are mutually independent filter coefficients having absolute values smaller than 1) and the first-order pole-zero type transmission A spectrum shape adjusting method for an audio signal, comprising at least a function.

3. The first parameter generating means for predicting a characteristic of A (z) or 1 / A (z) and generating a prediction coefficient thereof as the first parameter; and B ( z) or 1 / B (z), and a second predicting coefficient is generated as the second parameter.
3. The method of adjusting the spectrum shape of an audio signal according to claim 2, further comprising:

4. The μ _p is obtained from the A (z), and the B
2. The μ _z is obtained from (z).
Alternatively, the method of adjusting the spectrum shape of an audio signal according to the item 2.

5. When the first autocorrelation coefficient obtained from the parameter of A (z) is smaller than a threshold value close to 0 using the weighting coefficient of the relation of C1 <C3 <C0, the first self The correlation coefficient is weighted with the weighting coefficient C0, and when the first autocorrelation coefficient is larger than the threshold value, the first autocorrelation coefficient is weighted with the weighting coefficient C1. 5. The μ _p is obtained, and the μ _z is obtained from a value obtained by weighting the second autocorrelation coefficient obtained from the parameter of B (z) with a weighting coefficient C3. Shape adjustment method of the voice signal of the.

6. A method of adjusting a spectral shape of an audio signal by performing a predetermined filtering process on the audio signal, wherein the audio signal is multiplied to compensate for a power change of the audio signal due to the filtering process. When adjusting the gain
The gain for compensating the power change of the audio signal due to the filter processing is calculated by a predetermined method, the positive / negative judgment of the gain is performed before multiplying the audio signal, and if the determination result is negative, the predetermined method is used. A method for adjusting a spectral shape of an audio signal, characterized by replacing a gain with a given non-negative value.

7. A method of adjusting a spectral shape of an audio signal by performing a predetermined filtering process on the audio signal, wherein the audio signal is multiplied to compensate for a power change of the audio signal due to the filtering process. When adjusting the gain
A gain for compensating the power change of the audio signal due to the filtering process is calculated by a predetermined method, and whether the gain is positive or negative is determined before being multiplied to the audio signal. If the determination result is negative, 0 or more, 1 A method for adjusting the spectral shape of an audio signal, characterized in that the gain is replaced with a non-negative value less than.

8. A first filter having a pole-zero type transfer function for enhancing a spectrum envelope of an audio signal and a second filter for compensating for a spectrum inclination by the first filter are cascade-connected. A step of independently obtaining two filter coefficients used in the second filter for compensating the slope of the spectrum from the pole-zero type transfer function, and the pole-zero type according to the obtained filter coefficient. Compensating the slope of the spectrum corresponding to each transfer function, the second filter has a transfer function in the z-transform domain of (1-μ
_z z ^-1 ) / (1-μ _p z ^-1 ), where μ _z and μ _p are first-order pole-zero type transmissions represented by mutually independent filter coefficients having absolute values smaller than 1. A method for adjusting a spectral shape of an audio signal, the method including at least a function.

9. A first filter having a pole-zero type transfer function for emphasizing a spectrum envelope of an audio signal, and an input from the first filter for compensating for a slope of a spectrum by the first filter. From the pole-zero type transfer function, 2
A filter for compensating for the slope of the spectrum corresponding to each of the pole-zero type transfer functions by filtering the audio signal output from the first filter in accordance with a calculation section for independently calculating two filter coefficients and the calculated filter coefficient. And a second filter having a section, the second filter has a transfer function of (1-μ _z
z ⁻¹ ) / (1−μ _p z ⁻¹ ), where μ _z and μ _p are first-order pole-zero type transfer functions represented by mutually independent filter coefficients having absolute values smaller than 1. A spectrum shape adjusting device for an audio signal, comprising at least:

10. A synthesis filter analyzer for analyzing an input voice signal and outputting synthesis filter data, a filter data calculator for calculating weighting filter data based on the synthesis filter data from the synthesis filter analyzer, and a weighting filter. A weighting filter for filtering an input audio signal based on the data, the weighting filter having a pole-zero transfer function constituted by using the weighting filter data, and compensating for the slope of the spelltle by the first filter. A second filter having a transfer function for
The transfer function of the conversion region is (1-μ _z z ^-1 ) / (1-μ _p z
^-1 ), where μ _z and μ _p include at least a first-order pole-zero type transfer function represented by mutually independent filter coefficients having absolute values smaller than 1, .