JP2000250597A

JP2000250597A - Lsp correcting device, voice encoding device, and voice decoding device

Info

Publication number: JP2000250597A
Application number: JP11047079A
Authority: JP
Inventors: Hirohisa Tazaki; 裕久田崎
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 1999-02-24
Filing date: 1999-02-24
Publication date: 2000-09-14

Abstract

PROBLEM TO BE SOLVED: To obtain auditory equal effects over the entire frequency band by correcting an LSP(line spectrum pair) converted into the frequency range corresponding to auditory characteristics and putting the frequency range of the LSP back to a linear frequency range. SOLUTION: A bark conversion part 1 when inputting the LSP converts the respective dimensional values of the LSP from the linear frequency range into a bark frequency range and outputs a bark-LSP as the conversion result to an LSP deformation part 2. An inter-dimension distance calculation part 3 of the LSP deformation part 2 once receiving the bark-LSP from the bark conversion part 1 calculates inter-adjacent-dimension distances of the bark-LSP in order and outputs the inter-adjacent-dimension distances to an inter-dimension distance expansion part 4 of the LSP conversion part 2. The expansion part 4 once receiving the inter-adjacent-dimension distance of the bark-LSP from the inter-dimension distance calculation part 3 compares it with a specific threshold, and corrects a dimension value when the threshold is not exceeded and performs a process for widening the inter-adjacent-dimension distance, thereby outputting the corrected bark-LSP to a bark reconversion part 5.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】この発明は、音声符号化装置
や音声復号化装置が使用するスペクトルパラメータであ
るＬＳＰ（線スペクトル対）を補正するＬＳＰ補正装
置、ディジタル音声信号を少ない情報量に圧縮する音声
符号化装置、及び音声符号を復号化してディジタル音声
信号を再生する音声復号化装置に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an LSP correction device for correcting an LSP (line spectrum pair), which is a spectrum parameter used by a voice encoding device and a voice decoding device, and to compress a digital voice signal into a small amount of information. The present invention relates to an audio encoding device and an audio decoding device that decodes an audio code to reproduce a digital audio signal.

【０００２】[0002]

【従来の技術】ＬＳＰをスペクトルパラメータとして使
用する従来の音声符号化装置と音声復号化装置として
は、特開平４−５７００号公報及び特開平５−２７３９
９７号公報に開示されているものがある。特開平４−５
７００号公報（以下、従来例１と称する）は、ＬＳＰ符
号化誤差や伝送路誤りによる復号音の品質劣化を抑制す
ることを目的として、音声符号化装置内と音声復号化装
置内に同様なＬＳＰ補正手段を備えるようにしたもので
ある。2. Description of the Related Art Conventional speech coding apparatuses and speech decoding apparatuses using an LSP as a spectrum parameter are disclosed in JP-A-4-5700 and JP-A-5-2739.
No. 97 is disclosed. JP-A-4-5
Japanese Patent Application Publication No. 700 (hereinafter referred to as Conventional Example 1) has the same structure in a speech encoding device and a speech decoding device for the purpose of suppressing quality degradation of decoded sound due to LSP encoding errors and transmission path errors. An LSP correction means is provided.

【０００３】ここにおけるＬＳＰ補正方法としては、隣
接次元間の距離を算出して、これが閾値を下回るとき
に、その閾値まで間隔を広げる方法が開示されている。
具体的には、補正対象のＬＳＰをω（ｋ）、ｋ＝１〜Ｍ
とすると、隣接次元間距離ｄ（ｋ）は下記に示すように
なる。ｄ（ｋ）＝ω（ｋ＋１）−ω（ｋ）As the LSP correction method, a method is disclosed in which a distance between adjacent dimensions is calculated, and when the distance is smaller than a threshold, the interval is increased to the threshold.
Specifically, the LSP to be corrected is ω (k), k = 1 to M
Then, the distance d (k) between adjacent dimensions is as shown below. d (k) = ω (k + 1) −ω (k)

【０００４】この隣接次元間距離ｄ（ｋ）が閾値Ｄを下
回ると、下記に示すようにＬＳＰの補正処理を実行す
る。ただし、ω’が補正後のＬＳＰである。 ω’（ｋ）＝｛ω（ｋ）＋ω（ｋ＋１）｝／２−Ｄ／２ ω’（ｋ＋１）＝｛ω（ｋ）＋ω（ｋ＋１）｝／２＋Ｄ／２When the distance d (k) between adjacent dimensions falls below the threshold value D, an LSP correction process is executed as described below. Here, ω ′ is the corrected LSP. ω ′ (k) = {ω (k) + ω (k + 1)} / 2−D / 2 ω ′ (k + 1) = {ω (k) + ω (k + 1)} / 2 + D / 2

【０００５】特開平５−２７３９９７号公報（以下、従
来例２と称する）は、品質の劣化（量子化ノイズ）した
復号音を再分析してスペクトルパラメータを算出したと
きに起こる分析時の不安定化や量子化ノイズの影響を軽
減することを目的として、バックワード型のＣＥＬＰ系
の音声符号化装置と音声復号化装置内において、線形予
測分析手段によって得られたスペクトルパラメータに対
してＬＳＰ上での制御を行うＬＳＰ制御手段を備えるよ
うにしたものである。Japanese Unexamined Patent Publication No. Hei 5-273997 (hereinafter referred to as Conventional Example 2) discloses an instability at the time of analysis which occurs when a decoded sound having deteriorated quality (quantization noise) is re-analyzed to calculate a spectrum parameter. For the purpose of reducing the influence of quantization and quantization noise, in the backward type CELP-based speech coding apparatus and speech decoding apparatus, the spectral parameters obtained by the linear prediction analysis means are converted on the LSP. LSP control means for performing the above control is provided.

【０００６】ＬＳＰ制御手段としては、算出されたＬＳ
Ｐと予め固定的に与えておいたＬＳＰとを結合係数を用
いて線形加算する方法が開示されている。具体的には、
結合係数β、固定ＬＳＰをω_０とすれば、線形加算によ
って得られるＬＳＰは、下記の通りとなる。 ω’（ｋ）＝ω（ｋ）・β＋ω_０（ｋ）（１−β）[0006] As the LSP control means, the calculated LS
A method of linearly adding P and an LSP fixedly given in advance using a coupling coefficient is disclosed. In particular,
Coupling coefficient beta, if the fixed LSP and omega _0, LSP obtained by linear addition becomes as follows. ω ′ (k) = ω (k) · β + ω ₀ (k) (1−β)

【０００７】また、従来例２には、上記結合係数βの値
を制御するために用いるＬＳＰの隣接次元間距離の許容
限界値Ｄ（ｋ）を各次元毎に与えることが開示されてい
る。この許容限界値Ｄ（ｋ）の設定は、量子化ノイズが
含まれない入力音声を予め分析した線形予測係数より求
めたＬＳＰの隣接次元間距離に基づいて行われる。従来
例２では、この固定値であるＤ（ｋ）と、隣接次元間距
離ｄ（ｋ）を比較して、全ての次数でｄ（ｋ）＞Ｄ
（ｋ）となるように結合係数βを制御する。ｄ（ｋ）＝ω（ｋ＋１）−ω（ｋ）Further, the second conventional example discloses that an allowable limit value D (k) of the distance between adjacent dimensions of the LSP used for controlling the value of the coupling coefficient β is given for each dimension. The setting of the permissible limit value D (k) is performed based on the distance between adjacent dimensions of the LSP obtained from a linear prediction coefficient obtained by previously analyzing an input speech that does not include quantization noise. In Conventional Example 2, the fixed value D (k) is compared with the distance d (k) between adjacent dimensions, and d (k)> D for all orders.
The coupling coefficient β is controlled so as to satisfy (k). d (k) = ω (k + 1) −ω (k)

【０００８】ＬＳＰの補正処理を使用した従来のポスト
フィルタ(音声復号化装置における後処理フィルタ)とし
ては、特開平８−３０５３９７号公報に開示されている
ものがある。特開平８−３０５３９７号公報（以下、従
来例３と称する）は、ポストフィルタの設計自由度を高
めるために、ＬＳＰ補正処理によって算出した補正ＬＳ
Ｐによって音声強調処理を行うようにしたものである。A conventional post-filter (post-processing filter in an audio decoding device) using the LSP correction process is disclosed in Japanese Patent Application Laid-Open No. 8-305397. Japanese Patent Application Laid-Open No. 8-305397 (hereinafter referred to as Conventional Example 3) discloses a correction LS calculated by an LSP correction process in order to increase the degree of freedom in designing a post filter.
The voice emphasis processing is performed by P.

【０００９】ここにおけるＬＳＰ補正処理としては、従
来例２と同様の式で表されるＬＳＰ上の内分処理、従来
例１と類似する隣接次元間距離を広げる処理が開示され
ている。但し、ここでは隣接次元間距離が閾値未満の場
合に、その部分より高次のＬＳＰを一括して上にずらす
ことで隣接次元間距離を閾値まで広げ、（低次から順
に）全ての隣接次元に対する処理を行った結果、上にず
らした合計距離分だけ、均等に全隣接次元間距離を縮め
るという方法を開示している。As the LSP correction process, there are disclosed an internal division process on the LSP represented by the same formula as that of the conventional example 2 and a process of increasing the distance between adjacent dimensions similar to the conventional example 1. However, here, when the distance between adjacent dimensions is smaller than the threshold value, the distance between adjacent dimensions is increased to the threshold value by shifting the higher-order LSP collectively above that portion, and all the adjacent dimensions (in order from the lower order) As a result of performing the processing for, the method of uniformly reducing the distance between all adjacent dimensions by the total distance shifted upward is disclosed.

【００１０】[0010]

【発明が解決しようとする課題】従来の音声符号化装置
及び音声復号化装置は以上のように構成されているの
で、従来例１及び従来例３の場合、隣接次元間距離を広
げるＬＳＰの補正処理を実行する際、入力ＬＳＰの値に
依らず、固定的な閾値を使用するようにしている。しか
し、本来低域と高域では、線形周波数領域で同じ帯域幅
であっても聴覚器官内部の基底膜状の帯域幅は大きく異
なっており、線形周波数領域で固定的な閾値で行われる
補正の効果は低域と高域で大きく異なっている。このた
め全帯域に渡って適切な補正が行われない課題があっ
た。Since the conventional speech coding apparatus and speech decoding apparatus are constructed as described above, in the case of the prior art examples 1 and 3, the correction of the LSP for increasing the distance between adjacent dimensions is performed. When executing the process, a fixed threshold is used regardless of the value of the input LSP. However, in the low frequency band and the high frequency band, even though the bandwidth is the same in the linear frequency domain, the bandwidth of the basilar membrane inside the auditory organ is greatly different, and the correction performed with a fixed threshold in the linear frequency domain The effect is very different between low and high frequencies. For this reason, there has been a problem that appropriate correction is not performed over the entire band.

【００１１】例えば、音声符号化装置におけるＬＳＰ量
子化誤差の影響を軽減するためにＬＳＰ補正処理を導入
した場合を考える。閾値Ｄを使用することによって、低
域では振幅不安定を抑制できているとしても、同じ閾値
Ｄでは高域に聴覚的に気になる程度に急峻な極を抑制す
ることができない場合がある。逆に高域の急峻な極を抑
制できる程度まで閾値Ｄを大きく設定すると、低域の極
が広がりすぎて平均的な符号化歪が大きく劣化してしま
う。この様に、従来の周波数帯域に依らない閾値を使用
した次元間距離拡張処理には、適用した音声符号化装置
及び音声復号化装置の符号化復号化品質の劣化を解消で
きなかったり、逆に劣化をもたらしてしまう課題があっ
た。For example, consider a case where an LSP correction process is introduced to reduce the effect of LSP quantization error in a speech encoding device. Even if the amplitude instability can be suppressed in the low frequency range by using the threshold value D, there may be a case where the same threshold value D cannot suppress the steep pole to such an extent as to be auditoryly noticeable in the high frequency range. Conversely, if the threshold D is set large enough to suppress the steep poles in the high band, the poles in the low band will be too wide and the average coding distortion will be greatly degraded. As described above, in the conventional inter-dimension distance extension processing using a threshold independent of the frequency band, deterioration of the coding / decoding quality of the applied speech coding apparatus and speech decoding apparatus cannot be eliminated, or conversely, There was a problem of causing deterioration.

【００１２】さらに、音声符号化装置、音声復号化装
置、音声符号化装置、音声復号化装置、というように多
段階に接続される場合を考える。この場合、最初の音声
符号化装置と音声復号化装置によって、量子化ノイズや
ポストフィルタ等によってある程度の劣化やスペクトル
変形が導入される。このため、２つ目の音声符号化装置
と音声復号化装置内では、通常の入力音声には存在しな
いような急峻な極を持つＬＳＰが生成されることがあ
り、これを適切に補正しないと歪感の大きい符号化復号
化音が生成されてしまう。背景雑音が重畳した場合にも
同様の問題が発生する。従来のＬＳＰ補正手段では、高
域にこの様な歪感を生じる急峻な極がある場合でも、上
述の通り十分に補正できない課題があった。繰り返す
と、高域の急峻な極を抑制できる程度までＬＳＰの補正
を強く設定すると、低域の極が広がりすぎて平均的な符
号化歪が大きく劣化してしまう課題があった。[0012] Further, consider a case where connections are made in multiple stages, such as a speech coding apparatus, a speech decoding apparatus, a speech coding apparatus, and a speech decoding apparatus. In this case, the first speech coding apparatus and the first speech decoding apparatus introduce a certain degree of deterioration or spectrum deformation due to quantization noise, post-filter, or the like. For this reason, in the second speech encoding device and the speech decoding device, an LSP having a steep pole that does not exist in the normal input speech may be generated, and unless this is appropriately corrected. An encoded / decoded sound with a large sense of distortion is generated. A similar problem occurs when background noise is superimposed. As described above, the conventional LSP correction means has a problem in that even when there is a steep pole that causes such a sense of distortion in a high frequency band, it cannot be sufficiently corrected as described above. To repeat, if the LSP correction is set strong enough to suppress the steep pole in the high band, there is a problem that the pole in the low band is too wide and the average coding distortion is greatly deteriorated.

【００１３】従来例２及び従来例３に開示された内分処
理を用いてＬＳＰの補正処理を実行する場合、結合係数
βを適応制御したり次元毎に値を設定することも可能で
あるが、やはり補正対象のＬＳＰの周波数上における値
と関係なく補正が行われるため、この補正による聴覚的
な影響が周波数毎に差異が出て、適用した音声符号化装
置及び音声復号化装置の符号化復号化品質の劣化を解消
できなかったり、逆に劣化をもたらしてしまう課題があ
った。When the LSP correction processing is executed using the internal division processing disclosed in Conventional Examples 2 and 3, it is possible to adaptively control the coupling coefficient β or set a value for each dimension. Also, since the correction is performed irrespective of the value on the frequency of the LSP to be corrected, the perceptual effect of this correction differs for each frequency, and the coding of the applied voice coding apparatus and voice decoding apparatus is performed. There has been a problem that the deterioration of the decoding quality cannot be eliminated, or on the contrary, the deterioration occurs.

【００１４】従来例２では、各次元毎に隣接次元間距離
の許容限界値Ｄ（ｋ）を予め与えて結合係数βの制御に
使用しているが、同じ次元内でもＬＳＰの値は大きな分
散を持っているため、次元毎に１つの許容限界値Ｄ
（ｋ）を設定すると誤差が大きくなる。In the second conventional example, the allowable limit value D (k) of the distance between adjacent dimensions is given in advance for each dimension and used for controlling the coupling coefficient β. , One permissible limit value D per dimension
Setting (k) increases the error.

【００１５】この発明は上記のような課題を解決するた
めになされたもので、全周波数帯域に渡って聴覚的に均
等な効果をもたらすことができるＬＳＰ補正装置を得る
ことを目的とする。また、この発明は、符号化品質を高
めることができる音声符号化装置を得ることを目的とす
る。さらに、この発明は、復号化品質を高めることがで
きる音声復号化装置を得ることを目的とする。SUMMARY OF THE INVENTION The present invention has been made to solve the above-described problems, and has as its object to provide an LSP correction device capable of providing an acoustically uniform effect over the entire frequency band. Another object of the present invention is to provide a speech coding apparatus capable of improving coding quality. Another object of the present invention is to provide a speech decoding device capable of improving the decoding quality.

【００１６】[0016]

【課題を解決するための手段】この発明に係るＬＳＰ補
正装置は、聴覚的特性に対応する周波数領域に変換され
たＬＳＰを補正し、そのＬＳＰの周波数領域を線形周波
数領域に戻すようにしたものである。An LSP correction apparatus according to the present invention corrects an LSP converted to a frequency domain corresponding to auditory characteristics, and returns the LSP frequency domain to a linear frequency domain. It is.

【００１７】この発明に係るＬＳＰ補正装置は、ＬＳＰ
の周波数領域を線形周波数領域からバーク周波数領域に
変換して補正するようにしたものである。[0017] The LSP correction apparatus according to the present invention comprises an LSP
Is converted from the linear frequency domain to the Bark frequency domain to make correction.

【００１８】この発明に係るＬＳＰ補正装置は、ＬＳＰ
の周波数領域を線形周波数領域からメル周波数領域に変
換して補正するようにしたものである。The LSP correction device according to the present invention is an LSP correction device.
Is converted from the linear frequency domain to the mel frequency domain for correction.

【００１９】この発明に係るＬＳＰ補正装置は、ＬＳＰ
の周波数領域を線形周波数領域から対数周波数領域に変
換して補正するようにしたものである。An LSP correction device according to the present invention is an LSP correction device.
Is converted from the linear frequency domain to the logarithmic frequency domain for correction.

【００２０】この発明に係るＬＳＰ補正装置は、ＬＳＰ
の各次元値を基準にして、隣接次元間距離に関する閾値
を各次元毎に算出し、各次元毎の閾値に基づいてＬＳＰ
を補正するようにしたものである。An LSP correction device according to the present invention is an LSP correction device.
The threshold value for the distance between adjacent dimensions is calculated for each dimension on the basis of each dimension value of LSP, and the LSP is calculated based on the threshold value for each dimension.
Is corrected.

【００２１】この発明に係るＬＳＰ補正装置は、ＬＳＰ
の周波数領域を線形周波数領域から聴覚的特性に対応す
る周波数領域に変換し、その聴覚的特性に対応する周波
数領域で閾値を算出するようにしたものである。An LSP correction apparatus according to the present invention
Is converted from the linear frequency domain to the frequency domain corresponding to the auditory characteristics, and the threshold value is calculated in the frequency domain corresponding to the auditory characteristics.

【００２２】この発明に係るＬＳＰ補正装置は、聴覚的
特性に対応する周波数領域で算出した閾値と、線形周波
数領域で定義された閾値を比較し、大きい方の閾値を補
正手段に出力するようにしたものである。An LSP correction apparatus according to the present invention compares a threshold calculated in a frequency domain corresponding to an auditory characteristic with a threshold defined in a linear frequency domain, and outputs a larger threshold to the correction means. It was done.

【００２３】この発明に係るＬＳＰ補正装置は、ＬＳＰ
の周波数領域を線形周波数領域からバーク周波数領域に
変換して閾値を算出するようにしたものである。The LSP correction device according to the present invention is an LSP correction device.
Is converted from the linear frequency domain to the Bark frequency domain to calculate the threshold.

【００２４】この発明に係るＬＳＰ補正装置は、ＬＳＰ
の周波数領域を線形周波数領域からメル周波数領域に変
換して閾値を算出するようにしたものである。The LSP correction device according to the present invention is an LSP correction device.
Is converted from the linear frequency domain to the mel frequency domain to calculate the threshold value.

【００２５】この発明に係るＬＳＰ補正装置は、ＬＳＰ
の周波数領域を線形周波数領域から対数周波数領域に変
換して閾値を算出するようにしたものである。The LSP correction device according to the present invention is an LSP correction device.
Is converted from the linear frequency domain to the logarithmic frequency domain to calculate the threshold.

【００２６】この発明に係る音声符号化装置は、聴覚特
性に対応する周波数領域で補正されたＬＳＰを符号化し
てＬＳＰ符号と量子化ＬＳＰを出力するようにしたもの
である。A speech encoding apparatus according to the present invention encodes an LSP corrected in a frequency domain corresponding to auditory characteristics and outputs an LSP code and a quantized LSP.

【００２７】この発明に係る音声符号化装置は、ＬＳＰ
の各次元値を基準にして算出した各次元毎の閾値を用い
て補正されたＬＳＰを符号化してＬＳＰ符号と量子化Ｌ
ＳＰを出力するようにしたものである。[0027] The speech encoding apparatus according to the present invention comprises an LSP
The corrected LSP is encoded using a threshold value for each dimension calculated based on each dimension value of
The SP is output.

【００２８】この発明に係る音声符号化装置は、聴覚特
性に対応する周波数領域で補正された復号ＬＳＰと入力
音声から符号化音源を算出するようにしたものである。[0028] A speech encoding apparatus according to the present invention calculates an encoded sound source from a decoded LSP corrected in a frequency domain corresponding to auditory characteristics and an input speech.

【００２９】この発明に係る音声符号化装置は、ＬＳＰ
の各次元値を基準にして算出した各次元毎の閾値を用い
て補正された復号ＬＳＰと入力音声から符号化音源を算
出するようにしたものである。[0029] The speech encoding apparatus according to the present invention comprises an LSP
The encoded excitation is calculated from the decoded LSP and the input speech corrected using the threshold value for each dimension calculated on the basis of each dimension value.

【００３０】この発明に係る音声復号化装置は、聴覚特
性に対応する周波数領域で補正された復号ＬＳＰと音源
復号化手段により生成された音源信号から合成音を生成
するようにしたものである。The speech decoding apparatus according to the present invention is configured to generate a synthesized sound from a decoded LSP corrected in a frequency domain corresponding to the auditory characteristics and a sound source signal generated by a sound source decoding unit.

【００３１】この発明に係る音声復号化装置は、ＬＳＰ
の各次元値を基準にして算出した各次元毎の閾値を用い
て補正された復号ＬＳＰと音源復号化手段により生成さ
れた音源信号から合成音を生成するようにしたものであ
る。[0031] The speech decoding apparatus according to the present invention comprises an LSP.
The synthesized sound is generated from the decoded LSP corrected by using the threshold value for each dimension calculated based on each dimension value and the sound source signal generated by the sound source decoding means.

【００３２】この発明に係る音声復号化装置は、聴覚特
性に対応する周波数領域で補正された復号ＬＳＰを用い
て合成音に対するスペクトル強調処理を実行するように
したものである。The speech decoding apparatus according to the present invention executes a spectrum emphasis process on a synthesized sound using a decoded LSP corrected in a frequency domain corresponding to auditory characteristics.

【００３３】この発明に係る音声復号化装置は、ＬＳＰ
の各次元値を基準にして算出した各次元毎の閾値を用い
て補正された復号ＬＳＰを用いて合成音に対するスペク
トル強調処理を実行するようにしたものである。[0033] The speech decoding apparatus according to the present invention comprises an LSP
The spectrum emphasizing process is performed on the synthesized sound using the decoded LSP corrected using the threshold value for each dimension calculated based on each dimension value.

【００３４】[0034]

【発明の実施の形態】以下、この発明の実施の一形態を
説明する。実施の形態１．図１はこの発明の実施の形態
１によるＬＳＰ補正装置を示す構成図であり、図におい
て、１はスペクトルパラメータであるＬＳＰを入力する
と、そのＬＳＰの周波数領域を線形周波数領域からｂａ
ｒｋ周波数領域（聴覚的特性に対応する周波数領域）に
変換するｂａｒｋ変換部（変換手段）、２はｂａｒｋ変
換部１により周波数領域が変換されたＬＳＰを補正する
ＬＳＰ変形部（補正手段）、３はＬＳＰの隣接次元間距
離を算出する次元間距離算出部、４は次元間距離算出部
３により算出された隣接次元間距離が所定の閾値を下回
るとき、その隣接次元間距離を広げる次元間距離拡張
部、５はＬＳＰ変形部２により補正されたＬＳＰの周波
数領域を線形周波数領域に戻すｂａｒｋ逆変換部（逆変
換手段）である。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS One embodiment of the present invention will be described below. Embodiment 1 FIG. FIG. 1 is a block diagram showing an LSP correction apparatus according to Embodiment 1 of the present invention. In FIG. 1, when an LSP which is a spectrum parameter is input, the frequency domain of the LSP is shifted from the linear frequency domain to ba.
a bark transformation unit (conversion unit) 2 for converting to an rk frequency region (a frequency region corresponding to auditory characteristics); an LSP transformation unit (correction unit) 3 for correcting the LSP whose frequency domain has been transformed by the bark transformation unit 1; Is an inter-dimensional distance calculating unit that calculates the distance between adjacent dimensions of the LSP, and 4 is an inter-dimensional distance that increases the distance between adjacent dimensions when the distance between adjacent dimensions calculated by the inter-dimensional distance calculating unit 3 falls below a predetermined threshold. The extension unit 5 is a bark inverse transform unit (inverse transform unit) that returns the frequency domain of the LSP corrected by the LSP transforming unit 2 to the linear frequency domain.

【００３５】次に動作について説明する。まず、ＬＳＰ
はｂａｒｋ変換部１に入力されるが、ＬＳＰは音声符号
化装置や音声復号化装置等で算出されたり、復号化され
たりして得られたものであり、ＬＳＰパラメータの安定
化やスペクトルの平坦化のために、ＬＳＰ補正装置に入
力されるものである。Next, the operation will be described. First, LSP
Is input to the bark transform unit 1, and the LSP is obtained by being calculated or decoded by a speech coding device or a speech decoding device, and is used to stabilize LSP parameters and to flatten a spectrum. Is input to the LSP correction device for the purpose of conversion.

【００３６】ｂａｒｋ変換部１は、ＬＳＰを入力する
と、そのＬＳＰの各次元値を線形周波数領域からｂａｒ
ｋ周波数領域に変換し、その変換結果であるｂａｒｋ−
ＬＳＰをＬＳＰ変形部２に出力する。なお、線形周波数
領域からｂａｒｋ周波数領域への変換は、テーブル引き
に基づいて計算する方法と、以下に示すような近似式に
従って変換する方法がある。When the LSP is input, the bark transform unit 1 converts each dimension value of the LSP from the linear frequency domain into a bar.
k-frequency domain, and the result of the conversion, bark-
The LSP is output to the LSP transformation unit 2. The conversion from the linear frequency domain to the bark frequency domain includes a method of calculating based on a table lookup and a method of performing conversion according to an approximate expression as shown below.

【００３７】[0037]

【数１】 (Equation 1)

【００３８】そして、ＬＳＰ変形部２の次元間距離算出
部３は、ｂａｒｋ変換部１からｂａｒｋ−ＬＳＰを受け
ると、ｂａｒｋ−ＬＳＰの隣接次元間距離を順番に算出
して、その隣接次元間距離をＬＳＰ変形部２の次元間距
離拡張部４に出力する。When the distance-to-dimension calculating unit 3 of the LSP transforming unit 2 receives the bark-LSP from the barrier transforming unit 1, it calculates the distance between adjacent dimensions of the bark-LSP in order, and calculates the distance between adjacent dimensions. Is output to the inter-dimensional distance extension unit 4 of the LSP transformation unit 2.

【００３９】そして、次元間距離拡張部４は、次元間距
離算出部３からｂａｒｋ−ＬＳＰの隣接次元間距離を受
けると、その隣接次元間距離を所定の閾値Ｄと比較し、
その隣接次元間距離が閾値Ｄを下回る場合には、ｂａｒ
ｋ−ＬＳＰの対応する次元値を補正して、その隣接次元
間距離を広げる処理を実行し、その補正ｂａｒｋ−ＬＳ
Ｐをｂａｒｋ逆変換部５に出力する。Upon receiving the distance between adjacent dimensions of the bark-LSP from the interdimensional distance calculation section 3, the interdimensional distance extension section 4 compares the distance between adjacent dimensions with a predetermined threshold value D.
If the distance between adjacent dimensions is less than the threshold D, bar
A process for correcting the corresponding dimension value of the k-LSP to increase the distance between adjacent dimensions is performed, and the corrected bark-LS
P is output to the inverse inverse transform unit 5.

【００４０】具体的には、補正対象のｂａｒｋ−ＬＳＰ
をｂ（ｋ）、ｋ＝１〜Ｍとすると、隣接次元間距離ｄ
（ｋ）は下記に示すようになるので、この隣接次元間距
離ｄ（ｋ）が閾値Ｄを下回ると、下記に示すようにｂａ
ｒｋ−ＬＳＰの補正処理を実行する。ただし、ｂ’が補
正後のｂａｒｋ−ＬＳＰである。ｄ（ｋ）＝ｂ（ｋ＋１）−ｂ（ｋ）ｂ’（ｋ）＝｛ｂ（ｋ）＋ｂ（ｋ＋１）｝／２−Ｄ／２ｂ’（ｋ＋１）＝｛ｂ（ｋ）＋ｂ（ｋ＋１）｝／２＋Ｄ／２Specifically, the bark-LSP to be corrected is
Where b (k) and k = 1 to M, the distance d between adjacent dimensions
(K) is as shown below, and when the distance d (k) between adjacent dimensions falls below the threshold value D, as shown below, ba
The rk-LSP correction process is performed. Here, b ′ is the corrected bark-LSP. d (k) = b (k + 1) -b (k) b '(k) = {b (k) + b (k + 1)} / 2-D / 2b' (k + 1) = {b (k) + b (k + 1) )｝ / 2 + D / 2

【００４１】なお、次元間距離拡張部４における拡張処
理は、これに限られるものではなく、特開平８−３０５
３９７号公報に開示されている方法等様々なものを用い
ることができる。さらに、ＬＳＰ変形部２の構成につい
ても、この実施の形態１の構成に限定されるものではな
い。The extension processing in the inter-dimension distance extension unit 4 is not limited to this, but is described in Japanese Patent Laid-Open No. 8-305.
Various methods such as the method disclosed in Japanese Patent No. 397 can be used. Further, the configuration of the LSP deforming unit 2 is not limited to the configuration of the first embodiment.

【００４２】そして、ｂａｒｋ逆変換部５は、ＬＳＰ変
形部２から補正ｂａｒｋ−ＬＳＰを受けると、その補正
ｂａｒｋ−ＬＳＰの各次元値に対して、ｂａｒｋ周波数
領域から線形周波数領域に戻す逆変換処理を実行し、そ
の逆変換結果である補正ＬＳＰを出力する。なお、ｂａ
ｒｋ周波数領域から線形周波数領域への変換は、テーブ
ル引きに基づいて計算する方法と、以下に示すような近
似式に従って変換する方法がある。この式はｂａｒｋ変
換部１における近似式の逆変換に相当する。When receiving the corrected bark-LSP from the LSP transforming unit 2, the bark inverse transform unit 5 performs an inverse transform process for returning each dimension value of the corrected bark-LSP from the bark frequency domain to the linear frequency domain. And outputs a corrected LSP as a result of the inverse conversion. In addition, ba
The conversion from the rk frequency domain to the linear frequency domain includes a method of calculating based on a table lookup and a method of performing conversion according to an approximate expression as shown below. This expression corresponds to the inverse conversion of the approximate expression in the bark conversion unit 1.

【００４３】[0043]

【数２】 (Equation 2)

【００４４】図２はこの実施の形態１によるＬＳＰ補正
装置の補正結果を説明する説明図である。図２（ａ）は
入力ＬＳＰであり、この入力ＬＳＰは、説明の簡単のた
めに６次としているが、一般にＬＳＰは８〜１４次程度
が使用される。図２（ｂ）は入力ＬＳＰをｂａｒｋ変換
部１により変換したｂａｒｋ−ＬＳＰの例である。ｂａ
ｒｋ周波数領域に変換すると、低周波数帯域の隣接次元
間距離が広がり、高周波数帯域の隣接次元間距離が狭く
なる。これは人間の聴覚的な感覚に良く対応する。図２
（ｃ）はｂａｒｋ−ＬＳＰに対してＬＳＰ変形部２によ
る変形を行った結果である補正ｂａｒｋ−ＬＳＰの例で
ある。ｂａｒｋ周波数領域における隣接次元間距離に関
する閾値Ｄに基づいてω５とω６の次元間距離が広げら
れている。FIG. 2 is an explanatory diagram for explaining a correction result of the LSP correction device according to the first embodiment. FIG. 2A shows an input LSP. The input LSP has a sixth order for the sake of simplicity, but generally an LSP of the order of 8 to 14 is used. FIG. 2B is an example of a bark-LSP obtained by converting the input LSP by the bark converting unit 1. ba
When converted to the rk frequency domain, the distance between adjacent dimensions in the low frequency band increases, and the distance between adjacent dimensions in the high frequency band decreases. This corresponds well to the human auditory sensation. FIG.
(C) is an example of a corrected bark-LSP obtained as a result of performing deformation of the bark-LSP by the LSP deforming unit 2. The inter-dimensional distance between ω5 and ω6 is widened based on the threshold value D regarding the distance between adjacent dimensions in the bark frequency domain.

【００４５】従来のように線形周波数領域において同様
なＬＳＰ変形処理を行った場合、図２（ａ）における次
元間距離が狭いω１とω２、ω２とω３の間が広げられ
ることになる。しかし、これは聴覚的には、ω５とω６
の間に存在する極が最も急峻に感じられることと対応が
悪い。この実施の形態１の方法では良好にω５とω６の
間隔を広げることができる。When the same LSP deformation processing is performed in the linear frequency domain as in the conventional case, the distance between ω1 and ω2 and the distance between ω2 and ω3 in FIG. However, this is aurally ω5 and ω6
The pole that exists between the two is felt most steeply, and the response is poor. According to the method of the first embodiment, the interval between ω5 and ω6 can be favorably increased.

【００４６】以上で明らかなように、この実施の形態１
によれば、入力ＬＳＰの周波数領域を線形周波数領域か
らｂａｒｋ周波数領域に変換してから補正処理を行うよ
うにしたので、全周波数帯域に渡って聴覚的に均等な効
果をもたらすことができる効果を奏する。また、音声符
号化装置と音声復号化装置が多段階に接続されたり、背
景雑音が多い場合でも、低域の劣化を引き起こすことな
く、高域に生じる急峻な極を良好に補正することができ
る効果を奏する。As is clear from the above, the first embodiment
According to the method described above, since the correction process is performed after converting the frequency domain of the input LSP from the linear frequency domain to the bark frequency domain, an effect that an auditory equal effect can be obtained over the entire frequency band. Play. Further, even when the speech encoding device and the speech decoding device are connected in multiple stages or when there is a lot of background noise, it is possible to satisfactorily correct a steep pole generated in a high frequency band without causing deterioration in a low frequency region. It works.

【００４７】なお、ｂａｒｋ変換部１の近似式を用いた
ＬＳＰ補正装置を、音声符号化装置や音声復号化装置に
適用した時に、例えば、高域の符号化特性の劣化が大き
い傾向があったり、逆に低域の補正が弱くて復号音の振
幅が不安定になるような場合には、線形周波数領域とｂ
ａｒｋ周波数領域の中間的な周波数領域に変換するよう
に近似式を調整することもできる。この場合には、当然
ｂａｒｋ逆変換部５もこれに対応して調整することが必
要である。When the LSP correction device using the approximation formula of the bark conversion unit 1 is applied to a speech coding device or a speech decoding device, for example, there is a tendency that the degradation of the high-frequency coding characteristic is large. On the contrary, when the low-frequency correction is weak and the amplitude of the decoded sound becomes unstable, the linear frequency domain and b
The approximation formula can be adjusted so as to convert to an intermediate frequency domain of the ark frequency domain. In this case, the bark inverse transform unit 5 also needs to be adjusted accordingly.

【００４８】実施の形態２．図３はこの発明の実施の形
態２によるＬＳＰ補正装置を示す構成図であり、図にお
いて、６はスペクトルパラメータであるＬＳＰを入力す
ると、そのＬＳＰの周波数領域を線形周波数領域からメ
ル周波数領域（聴覚的特性に対応する周波数領域）に変
換するメル変換部（変換手段）、７はメル変換部６によ
り周波数領域が変換されたＬＳＰを補正するＬＳＰ変形
部（補正手段）、８はＬＳＰの隣接次元間距離を算出す
る次元間距離算出部、９は次元間距離算出部８により算
出された隣接次元間距離が所定の閾値を下回るとき、そ
の隣接次元間距離を広げる次元間距離拡張部、１０はＬ
ＳＰ変形部７により補正されたＬＳＰの周波数領域を線
形周波数領域に戻すメル逆変換部（逆変換手段）であ
る。Embodiment 2 FIG. 3 is a block diagram showing an LSP correction apparatus according to Embodiment 2 of the present invention. In FIG. 3, when an LSP which is a spectrum parameter is input, the frequency domain of the LSP is changed from a linear frequency domain to a mel frequency domain (audio Transform unit (conversion means) for converting the LSP into a frequency domain corresponding to the dynamic characteristic, 7 is an LSP transformation unit (correction means) for correcting the LSP whose frequency domain has been transformed by the mel transform unit 6, and 8 is an adjacent dimension of the LSP. When the distance between adjacent dimensions calculated by the distance calculating unit 8 is smaller than a predetermined threshold, the distance between dimension calculating unit 9 for calculating the distance between dimensions is expanded by the dimension distance expanding unit 9 for expanding the distance between adjacent dimensions. L
This is a mel inverse transform unit (inverse transform unit) that returns the frequency domain of the LSP corrected by the SP transforming unit 7 to the linear frequency domain.

【００４９】次に動作について説明する。まず、ＬＳＰ
はメル変換部６に入力されるが、ＬＳＰは音声符号化装
置や音声復号化装置等で算出されたり、復号化されたり
して得られたものであり、ＬＳＰパラメータの安定化や
スペクトルの平坦化のために、ＬＳＰ補正装置に入力さ
れるものである。Next, the operation will be described. First, LSP
Is input to the mel transform unit 6, where the LSP is obtained by being calculated or decoded by a speech coding device or a speech decoding device, etc., and is used to stabilize the LSP parameters and flatten the spectrum. Is input to the LSP correction device for the purpose of conversion.

【００５０】メル変換部６は、ＬＳＰを入力すると、そ
のＬＳＰの各次元値を線形周波数領域からメル周波数領
域に変換し、その変換結果であるメルＬＳＰをＬＳＰ変
形部７に出力する。なお、線形周波数領域からメル周波
数領域への変換は、テーブル引きに基づいて計算する方
法と、以下に示すような近似式に従って変換する方法が
ある。Upon receiving the LSP, the mel transform unit 6 converts each dimension value of the LSP from the linear frequency domain to the mel frequency domain, and outputs the mel LSP, which is the result of the conversion, to the LSP transforming unit 7. The conversion from the linear frequency domain to the mel frequency domain includes a method of calculating based on a table lookup and a method of performing conversion according to an approximate expression as shown below.

【００５１】[0051]

【数３】 (Equation 3)

【００５２】そして、ＬＳＰ変形部７の次元間距離算出
部８は、メル変換部６からメルＬＳＰを受けると、メル
ＬＳＰの隣接次元間距離を順番に算出して、その隣接次
元間距離をＬＳＰ変形部７の次元間距離拡張部９に出力
する。Upon receiving the mel LSP from the mel conversion unit 6, the inter-dimensional distance calculation unit 8 of the LSP transformation unit 7 calculates the distance between adjacent dimensions of the mel LSP in order, and calculates the distance between adjacent dimensions to the LSP. Output to the inter-dimension distance extension unit 9 of the deformation unit 7.

【００５３】そして、次元間距離拡張部９は、次元間距
離算出部８からメルＬＳＰの隣接次元間距離を受ける
と、その隣接次元間距離を所定の閾値Ｄと比較し、その
隣接次元間距離が閾値Ｄを下回る場合には、メルＬＳＰ
の対応する次元値を補正して、その隣接次元間距離を広
げる処理を実行し、その補正メルＬＳＰをメル逆変換部
１０に出力する。Upon receiving the distance between adjacent dimensions of the mel LSP from the distance calculating unit 8, the distance expanding unit 9 compares the distance between adjacent dimensions with a predetermined threshold value D, and determines the distance between adjacent dimensions. Is less than the threshold D, the mel LSP
Is performed to increase the distance between adjacent dimensions, and outputs the corrected mel LSP to the mel inverse transform unit 10.

【００５４】具体的には、補正対象のメルＬＳＰをｍ
（ｋ）、ｋ＝１〜Ｍとすると、隣接次元間距離ｄ（ｋ）
は下記に示すようになるので、この隣接次元間距離ｄ
（ｋ）が閾値Ｄを下回ると、下記に示すようにメルＬＳ
Ｐの補正処理を実行する。ただし、ｍ’が補正後のメル
ＬＳＰである。ｄ（ｋ）＝ｍ（ｋ＋１）−ｍ（ｋ）ｍ’（ｋ）＝｛ｍ（ｋ）＋ｍ（ｋ＋１）｝／２−Ｄ／２ｍ’（ｋ＋１）＝｛ｍ（ｋ）＋ｍ（ｋ＋１）｝／２＋Ｄ／２Specifically, the mel LSP to be corrected is m
(K), where k = 1 to M, distance d (k) between adjacent dimensions
Is as shown below, and this distance d between adjacent dimensions is
When (k) falls below the threshold value D, as shown below, the mel LS
The P correction process is performed. Here, m ′ is the corrected mel LSP. d (k) = m (k + 1) -m (k) m '(k) = {m (k) + m (k + 1)} / 2-D / 2 m' (k + 1) = {m (k) + m (k + 1) )｝ / 2 + D / 2

【００５５】なお、次元間距離拡張部９における拡張処
理は、これに限られるものではなく、特開平８−３０５
３９７号公報に開示されている方法等様々なものを用い
ることができる。さらに、ＬＳＰ変形部７の構成につい
ても、この実施の形態２の構成に限定されるものではな
い。The extension processing in the inter-dimension distance extension unit 9 is not limited to this, but is described in Japanese Patent Application Laid-Open No. 8-305.
Various methods such as the method disclosed in Japanese Patent No. 397 can be used. Further, the configuration of the LSP deforming unit 7 is not limited to the configuration of the second embodiment.

【００５６】そして、メル逆変換部１０は、ＬＳＰ変形
部７から補正メルＬＳＰを受けると、その補正メルＬＳ
Ｐの各次元値に対して、メル周波数領域から線形周波数
領域に戻す逆変換処理を実行し、その逆変換結果である
補正ＬＳＰを出力する。なお、メル周波数領域から線形
周波数領域への変換は、テーブル引きに基づいて計算す
る方法と、メル変換部６における近似式の逆変換に相当
する計算式によって算出する方法がある。Then, upon receiving the corrected mel LSP from the LSP transforming section 7, the mel inverse transform section 10 receives the corrected mel LS
For each dimension value of P, an inverse transformation process for returning from the mel frequency domain to the linear frequency domain is performed, and a corrected LSP as a result of the inverse transformation is output. The conversion from the mel frequency domain to the linear frequency domain includes a method of calculating based on a table lookup and a method of calculating using a calculation formula corresponding to the inverse conversion of the approximate expression in the mel conversion unit 6.

【００５７】以上で明らかなように、この実施の形態２
によれば、入力ＬＳＰの周波数領域を線形周波数領域か
らメル周波数領域に変換してから補正処理を行うように
したので、全周波数帯域に渡って聴覚的に均等な効果を
もたらすことができる効果を奏する。また、音声符号化
装置と音声復号化装置が多段階に接続されたり、背景雑
音が多い場合でも、低域の劣化を引き起こすことなく、
高域に生じる急峻な極を良好に補正することができる効
果を奏する。As is clear from the above, this embodiment 2
According to the method described above, since the correction process is performed after converting the frequency domain of the input LSP from the linear frequency domain to the mel frequency domain, an effect that can provide an auditory equal effect over the entire frequency band is obtained. Play. Also, even when the audio encoding device and the audio decoding device are connected in multiple stages, or when there is a lot of background noise, without causing deterioration of the low frequency band,
There is an effect that a steep pole generated in a high frequency range can be satisfactorily corrected.

【００５８】なお、メル変換部６の近似式を用いたＬＳ
Ｐ補正装置を、音声符号化装置や音声復号化装置に適用
した時に、例えば、高域の符号化特性の劣化が大きい傾
向があったり、逆に低域の補正が弱くて復号音の振幅が
不安定になるような場合には、線形周波数領域とメル周
波数領域の中間的な周波数領域に変換するように近似式
を調整することもできる。この場合には、当然メル逆変
換部１０もこれに対応して調整することが必要である。Note that LS using the approximate expression of the mel conversion unit 6
When the P correction device is applied to an audio encoding device or an audio decoding device, for example, the degradation of the high-frequency encoding characteristics tends to be large, or the low-frequency correction is weak and the amplitude of the decoded sound is low. In the case where the frequency becomes unstable, the approximation formula can be adjusted so as to convert to an intermediate frequency region between the linear frequency region and the mel frequency region. In this case, it is, of course, necessary to adjust the mel inverse transform unit 10 accordingly.

【００５９】実施の形態３．上記実施の形態１，２で
は、ｂａｒｋ周波数領域又はメル周波数領域でＬＳＰを
補正するものについて示したが、図１におけるｂａｒｋ
変換部１を対数周波数変換部（ＬＳＰの周波数領域を線
形周波数領域から対数周波数領域に変換する変換部）に
変更するとともに、ｂａｒｋ逆変換部５を対数周波数逆
変換部(ＬＳＰの周波数領域を線形周波数領域に戻す逆
変換部)に変更し、対数周波数領域でＬＳＰを補正する
ようにしてもよい。また、ｂａｒｋ周波数領域，メル周
波数領域又は対数周波数領域以外にも、聴覚的特性と比
較的対応が良い周波数領域に変換してＬＳＰを補正する
ようにしてもよい。Embodiment 3 In the first and second embodiments, the LSP is corrected in the bar frequency domain or the mel frequency domain.
The conversion unit 1 is changed to a logarithmic frequency conversion unit (a conversion unit that converts the LSP frequency domain from a linear frequency domain to a logarithmic frequency domain), and the bark inverse conversion unit 5 is replaced with a logarithmic frequency inverse conversion unit (the LSP frequency domain (Inverse transform unit that returns to the frequency domain), and the LSP may be corrected in the logarithmic frequency domain. Further, in addition to the bark frequency domain, the mel frequency domain, or the logarithmic frequency domain, the LSP may be corrected by converting to a frequency domain having relatively good correspondence with auditory characteristics.

【００６０】以上で明らかなように、この実施の形態３
によれば、入力ＬＳＰの周波数領域を線形周波数領域か
ら対数周波数領域に変換してから補正処理を行うように
したので、全周波数帯域に渡って聴覚的に均等な効果を
もたらすことができる効果を奏する。また、音声符号化
装置と音声復号化装置が多段階に接続されたり、背景雑
音が多い場合でも、低域の劣化を引き起こすことなく、
高域に生じる急峻な極を良好に補正することができる効
果を奏する。As is apparent from the above, the third embodiment
According to the method described above, since the correction process is performed after the frequency domain of the input LSP is converted from the linear frequency domain to the logarithmic frequency domain, an effect that can provide an auditory equal effect over the entire frequency band is obtained. Play. Also, even when the audio encoding device and the audio decoding device are connected in multiple stages, or when there is a lot of background noise, without causing deterioration of the low frequency band,
There is an effect that a steep pole generated in a high frequency range can be satisfactorily corrected.

【００６１】実施の形態４．図４はこの発明の実施の形
態４によるＬＳＰ補正装置を示す構成図であり、図にお
いて、１１はスペクトルパラメータであるＬＳＰを入力
すると、そのＬＳＰの各次元値を基準にして、隣接次元
間距離に関する閾値を各次元毎に算出する閾値算出部
（閾値算出手段）、１２はＬＳＰの周波数領域を線形周
波数領域からｂａｒｋ周波数領域（聴覚的特性に対応す
る周波数領域）に変換するｂａｒｋ変換部、１３はｂａ
ｒｋ変換部１２により周波数領域が変換されたｂａｒｋ
−ＬＳＰの各次元値に所定の閾値Ｄの半分値を加減算し
て、その加算結果と減算結果を仮ｂａｒｋ値として出力
する仮ｂａｒｋ値算出部、１４は仮ｂａｒｋ値算出部１
３から出力された仮ｂａｒｋ値の周波数領域をｂａｒｋ
周波数領域から線形周波数領域に逆変換して、その逆変
換結果を仮周波数値として出力するｂａｒｋ逆変換部、
１５はｂａｒｋ逆変換部１４から出力された各次元毎の
仮周波数値の差分を算出し、その算出結果を次元間距離
閾値として出力する差分算出部である。Embodiment 4 FIG. 4 is a block diagram showing an LSP correction apparatus according to Embodiment 4 of the present invention. In FIG. 4, when an LSP which is a spectrum parameter is input, the distance between adjacent dimensions is determined based on each dimension value of the LSP. A threshold calculating unit (threshold calculating means) for calculating a threshold value for each dimension, a bark converting unit for converting an LSP frequency domain from a linear frequency domain to a bark frequency domain (frequency domain corresponding to auditory characteristics), 13 Is ba
bark whose frequency domain has been transformed by the rk transform unit 12
A temporary bank value calculating unit that adds and subtracts a half value of a predetermined threshold value D to each dimension value of the LSP, and outputs the addition result and the subtraction result as a temporary bark value;
3 is the frequency domain of the provisional bark value output from bark
A bark inverse transform unit that performs inverse transform from the frequency domain to the linear frequency domain and outputs the inverse transform result as a provisional frequency value;
Reference numeral 15 denotes a difference calculation unit that calculates a difference between provisional frequency values for each dimension output from the bark inverse transform unit 14 and outputs the calculation result as an inter-dimension distance threshold.

【００６２】１６は閾値算出部１１により算出された各
次元毎の閾値に基づいてＬＳＰを補正するＬＳＰ変形部
（補正手段）、１７はＬＳＰの隣接次元間距離を算出す
る次元間距離算出部、１８は次元間距離算出部１７によ
り算出された隣接次元間距離が閾値算出部１１から出力
された次元間距離閾値より算出される閾値を下回ると
き、その隣接次元間距離を広げる次元間距離拡張部であ
る。Reference numeral 16 denotes an LSP deformation unit (correction means) for correcting the LSP based on the threshold value for each dimension calculated by the threshold value calculation unit 11, 17 an inter-dimensional distance calculation unit for calculating the distance between adjacent dimensions of the LSP, Reference numeral 18 denotes an inter-dimension distance extension unit that increases the inter-dimension distance when the inter-dimension distance calculated by the inter-dimension calculation unit 17 is smaller than a threshold calculated from the inter-dimension distance threshold output from the threshold calculation unit 11. It is.

【００６３】次に動作について説明する。まず、入力Ｌ
ＳＰは閾値算出部１１のｂａｒｋ変換部１２と、ＬＳＰ
変形部１６の次元間距離算出部１７と次元間距離拡張部
１８に入力される。ｂａｒｋ変換部１２は、ＬＳＰを入
力すると、そのＬＳＰの各次元値をｂａｒｋ周波数領域
に変換し、その変換結果であるｂａｒｋ−ＬＳＰを仮ｂ
ａｒｋ値算出部１３に出力する。Next, the operation will be described. First, input L
The SP includes a bark converter 12 of the threshold calculator 11 and an LSP
It is input to the inter-dimension distance calculation unit 17 and the inter-dimension distance expansion unit 18 of the deformation unit 16. When the LSP is input, the bark conversion unit 12 converts each dimension value of the LSP into a bark frequency domain, and outputs a bark-LSP, which is a result of the conversion, as a temporary b.
Output to the ark value calculation unit 13.

【００６４】そして、仮ｂａｒｋ値算出部１３は、ｂａ
ｒｋ変換部１２からｂａｒｋ−ＬＳＰを受けると、下記
に示すように、そのｂａｒｋ−ＬＳＰの各次元値に所定
の閾値Ｄの半分値を加減算して、その加算結果と減算結
果を仮ｂａｒｋ値としてｂａｒｋ逆変換部１４に出力す
る。即ち、各次元毎に、次元値を基準にして２個の仮ｂ
ａｒｋ値を算出して出力する。仮ｂａｒｋ値＝（ｂａｒｋ−ＬＳＰの各次元値）＋Ｄ／
２仮ｂａｒｋ値＝（ｂａｒｋ−ＬＳＰの各次元値）−Ｄ／
２Then, the provisional bark value calculating unit 13
When the bark-LSP is received from the rk conversion unit 12, a half value of a predetermined threshold value D is added to and subtracted from each dimension value of the bark-LSP as described below, and the addition result and the subtraction result are used as a temporary bark value. Output to the bark inverse transform unit 14. That is, for each dimension, two temporary b
Calculate and output the ark value. Temporary bark value = (each dimension value of bark-LSP) + D /
2 Temporary bark value = (bark-each dimension value of LSP)-D /
2

【００６５】ｂａｒｋ逆変換部１４は、仮ｂａｒｋ値算
出部１３から各次元毎に２個の仮ｂａｒｋ値を受ける
と、仮ｂａｒｋ値の周波数領域をｂａｒｋ周波数領域か
ら線形周波数領域に逆変換し、その逆変換結果を仮周波
数値として差分算出部１５に出力する。When the bark inverse transform unit 14 receives two temporary bark values for each dimension from the temporary bark value calculation unit 13, the bark inverse transform unit 14 inversely transforms the frequency domain of the temporary bark value from the bark frequency domain to the linear frequency domain. The result of the inverse conversion is output to the difference calculator 15 as a temporary frequency value.

【００６６】そして、差分算出部１５は、ｂａｒｋ逆変
換部１４から各次元毎に２個の仮周波数値を受けると、
各次元毎に２個の仮周波数値の差分を算出し、その算出
結果を各次元毎の次元間距離閾値としてＬＳＰ変形部１
６に出力する。When the difference calculating unit 15 receives two provisional frequency values for each dimension from the bark inverse transform unit 14,
A difference between two provisional frequency values is calculated for each dimension, and the calculation result is used as an inter-dimension distance threshold value for each dimension.
6 is output.

【００６７】一方、ＬＳＰ変形部１６の次元間距離算出
部１７は、ＬＳＰを入力すると、そのＬＳＰの隣接次元
間距離を算出する。そして、ＬＳＰ変形部１６の次元間
距離拡張部１８は、次元間距離算出部１７がＬＳＰの隣
接次元間距離を算出すると、そのＬＳＰの隣接次元間距
離と、閾値算出部１１の差分算出部１５から出力された
次元間距離閾値より算出される閾値と比較する。そし
て、ＬＳＰの隣接次元間距離が閾値を下回る場合、その
入力ＬＳＰの対応する次元値を補正して、その隣接次元
間距離を広げる処理を実行する。On the other hand, when the LSP transformation unit 16 receives the LSP, the interdimensional distance calculation unit 17 calculates the distance between adjacent dimensions of the LSP. Then, when the inter-dimensional distance calculating unit 17 calculates the distance between adjacent dimensions of the LSP, the inter-dimensional distance extending unit 18 of the LSP deforming unit 16 compares the distance between the adjacent dimensions of the LSP with the difference calculating unit 15 of the threshold calculating unit 11. Is compared with the threshold value calculated from the inter-dimensional distance threshold value output from. Then, when the distance between adjacent dimensions of the LSP is smaller than the threshold value, a process of correcting the corresponding dimension value of the input LSP and increasing the distance between adjacent dimensions is executed.

【００６８】具体的には、補正対象の入力ＬＳＰをｆ
（ｋ）、ｋ＝１〜Ｍとすると、隣接次元間距離ｄ（ｋ）
は下記に示すようになるので、この隣接次元間距離ｄ
（ｋ）が各次元毎の次元間距離閾値Ｄｆ（ｋ）から算出
される閾値Ｄ’（ｋ）を下回ると、下記に示すように入
力ＬＳＰの補正処理を実行する。ただし、ｆ’が補正後
の入力ＬＳＰである。ｄ（ｋ）＝ｆ（ｋ＋１）−ｆ（ｋ）Ｄ’ｆ（ｋ）＝（Ｄｆ（ｋ）＋Ｄｆ（ｋ＋１））／２ｆ’（ｋ）＝｛ｆ（ｋ）＋ｆ（ｋ＋１）｝／２−Ｄ’ｆ（ｋ）ｆ’（ｋ＋１）＝｛ｆ（ｋ）＋ｆ（ｋ＋１）｝／２＋Ｄ’ｆ（ｋ）Specifically, the input LSP to be corrected is represented by f
(K), where k = 1 to M, distance d (k) between adjacent dimensions
Is as shown below, and this distance d between adjacent dimensions is
When (k) falls below a threshold value D '(k) calculated from the inter-dimension distance threshold value Df (k) for each dimension, the input LSP is corrected as described below. Here, f ′ is the corrected input LSP. d (k) = f (k + 1) -f (k) D'f (k) = (Df (k) + Df (k + 1)) / 2 f '(k) = {f (k) + f (k + 1)} / 2-D'f (k) f '(k + 1) = {f (k) + f (k + 1)} / 2 + D'f (k)

【００６９】なお、次元間距離拡張部１８における拡張
処理は、これに限られるものではなく、特開平８−３０
５３９７号公報に開示されている方法等様々なものを用
いることができる。さらに、ＬＳＰ変形部１６の構成に
ついても、この実施の形態４の構成に限定されるもので
はない。The extension processing in the inter-dimension distance extension unit 18 is not limited to this, but is described in
Various methods such as the method disclosed in Japanese Patent No. 5397 can be used. Further, the configuration of LSP deforming section 16 is not limited to the configuration of the fourth embodiment.

【００７０】図５はこの実施の形態４によるＬＳＰ補正
装置の補正結果を説明する説明図である。図５（ａ）は
ｂａｒｋ変換部１２が出力したｂａｒｋ−ＬＳＰの第ｋ
次の値ｂ（ｋ）と、これに対応して仮ｂａｒｋ値算出部
１３が出力した仮ｂａｒｋ値ｂ_t1（ｋ）とｂ_t2（ｋ）を
示している。仮ｂａｒｋ値は、ｂａｒｋ−ＬＳＰの各次
元値に所定の閾値Ｄの半分値を加減算して算出される。
図１５（ｂ）は図１５（ａ）の仮ｂａｒｋ値をｂａｒｋ
逆変換部１４が逆変換して出力した仮周波数値ｆ
_t1（ｋ）とｆ_t2（ｋ）を示している。差分算出部１５
は、この２つの仮周波数値の差分をとることで、第ｋ次
の次元間距離閾値Ｄｆ（ｋ）を算出する。FIG. 5 is an explanatory diagram for explaining a correction result of the LSP correction device according to the fourth embodiment. FIG. 5A shows the k-th bar-LSP output from the bark conversion unit 12.
The following values b (k) and corresponding temporary bark values b _t1 (k) and b _t2 (k) output by the temporary bark value calculation unit 13 are shown. The temporary bark value is calculated by adding and subtracting a half value of a predetermined threshold D to each dimension value of the bark-LSP.
FIG. 15B shows the provisional bark value of FIG.
Temporary frequency value f output by inverse conversion by inverse conversion unit 14
_t1 (k) and _ft2 (k) are shown. Difference calculator 15
Calculates the k-th dimension distance threshold Df (k) by taking the difference between these two provisional frequency values.

【００７１】以上で明らかなように、この実施の形態４
によれば、入力ＬＳＰの各次元毎の値に応じて、隣接次
元間距離に関する閾値を各次元毎に算出し、この閾値に
基づいて入力ＬＳＰの補正を行うようにしたので、各次
元の周波数に最適な閾値に基づいて入力ＬＳＰが補正さ
れるようになり、その結果、補正による影響が周波数毎
に差異が出て、適用した音声符号化装置及び音声復号化
装置の符号化復号化品質の劣化を解消できなかったり、
逆に劣化をもたらしてしまう課題を解消することができ
る効果を奏する。As is apparent from the above, the fourth embodiment
According to the method, the threshold value for the distance between adjacent dimensions is calculated for each dimension according to the value of each dimension of the input LSP, and the input LSP is corrected based on the threshold value. The input LSP is corrected based on the optimal threshold value, and as a result, the influence of the correction is different for each frequency, and the coding / decoding quality of the applied voice coding apparatus and voice decoding apparatus is reduced. Deterioration cannot be eliminated,
On the contrary, there is an effect that the problem of causing deterioration can be solved.

【００７２】また、聴覚的特性に対応が良いｂａｒｋ周
波数領域で閾値を算出するようにしたので、全周波数帯
域に渡って聴覚的に均等な効果をもたらすことができる
効果を奏する。さらに、音声符号化装置と音声復号化装
置が多段階に接続されたり、背景雑音が多い場合でも、
低域の劣化を引き起こすことなく、高域に生じる急峻な
極を良好に補正することができる効果を奏する。Further, since the threshold value is calculated in the bark frequency region having a good correspondence to the auditory characteristics, an effect that an auditory equal effect can be obtained over the entire frequency band is obtained. Furthermore, even when the speech encoding device and speech decoding device are connected in multiple stages, or when there is much background noise,
There is an effect that a steep pole generated in a high frequency can be favorably corrected without causing deterioration in a low frequency.

【００７３】なお、ｂａｒｋ変換部１２の近似式を用い
たＬＳＰ補正装置を、音声符号化装置や音声復号化装置
に適用した時に、例えば、高域の符号化特性の劣化が大
きい傾向があったり、逆に低域の補正が弱くて復号音の
振幅が不安定になるような場合には、線形周波数領域と
ｂａｒｋ周波数領域の中間的な周波数領域に変換するよ
うに近似式を調整することもできる。この場合には、当
然ｂａｒｋ逆変換部１４もこれに対応して調整すること
が必要である。When an LSP correction device using the approximation formula of the bark conversion unit 12 is applied to a speech coding device or a speech decoding device, for example, the degradation of high-frequency coding characteristics tends to be large. On the other hand, if the low-frequency correction is weak and the amplitude of the decoded sound becomes unstable, the approximation formula may be adjusted to convert to an intermediate frequency domain between the linear frequency domain and the bark frequency domain. it can. In this case, the bark inverse transform unit 14 also needs to be adjusted accordingly.

【００７４】実施の形態５．図６はこの発明の実施の形
態５によるＬＳＰ補正装置を示す構成図であり、図にお
いて、図４と同一符号は同一または相当部分を示すので
説明を省略する。２１はスペクトルパラメータであるＬ
ＳＰを入力すると、そのＬＳＰの各次元値を基準にし
て、隣接次元間距離に関する閾値を各次元毎に算出する
閾値算出部（閾値算出手段）、２２はＬＳＰの周波数領
域を線形周波数領域からメル周波数領域（聴覚的特性に
対応する周波数領域）に変換するメル変換部、２３はメ
ル変換部２２により周波数領域が変換されたメルＬＳＰ
の各次元値に所定の閾値Ｄの半分値を加減算して、その
加算結果と減算結果を仮メル値として出力する仮メル値
算出部、２４は仮メル値算出部２３から出力された仮メ
ル値の周波数領域をメル周波数領域から線形周波数領域
に逆変換して、その逆変換結果を仮周波数値として出力
するメル逆変換部、２５はメル逆変換部２４から出力さ
れた各次元毎の仮周波数値の差分を算出し、その算出結
果を次元間距離閾値として出力する差分算出部である。Embodiment 5 FIG. 6 is a block diagram showing an LSP correction apparatus according to Embodiment 5 of the present invention. In the figure, the same reference numerals as those in FIG. 4 denote the same or corresponding parts, and a description thereof will be omitted. 21 is a spectrum parameter L
When an SP is input, a threshold calculator (threshold calculating means) for calculating a threshold for the distance between adjacent dimensions for each dimension based on each dimension value of the LSP. A mel transform unit 23 for transforming into a frequency domain (frequency domain corresponding to auditory characteristics), and a mel LSP 23 whose frequency domain is transformed by the mel transform unit 22
A temporary mel value calculator that adds and subtracts a half value of a predetermined threshold value D to and from each dimension value and outputs the addition result and the subtraction result as a temporary mel value. A mel inverse transform unit that inversely transforms the frequency domain of the value from the mel frequency domain to the linear frequency domain and outputs the result of the inverse transform as a temporary frequency value. This is a difference calculation unit that calculates the difference between the frequency values and outputs the calculation result as a distance threshold between dimensions.

【００７５】次に動作について説明する。まず、入力Ｌ
ＳＰは閾値算出部２１のメル変換部２２と、ＬＳＰ変形
部１６の次元間距離算出部１７と次元間距離拡張部１８
に入力される。メル変換部２２は、ＬＳＰを入力する
と、そのＬＳＰの各次元値をメル周波数領域に変換し、
その変換結果であるメルＬＳＰを仮メル値算出部２３に
出力する。Next, the operation will be described. First, input L
The SP is the mel conversion unit 22 of the threshold value calculation unit 21, the inter-dimension distance calculation unit 17 of the LSP deformation unit 16, and the inter-dimension distance expansion unit 18
Is input to Upon input of the LSP, the mel transform unit 22 converts each dimension value of the LSP into a mel frequency domain,
The mel LSP as a result of the conversion is output to the provisional mel value calculation unit 23.

【００７６】そして、仮メル値算出部２３は、メル変換
部２２からメルＬＳＰを受けると、下記に示すように、
そのメルＬＳＰの各次元値に所定の閾値Ｄの半分値を加
減算して、その加算結果と減算結果を仮メル値としてメ
ル逆変換部２４に出力する。即ち、各次元毎に、次元値
を基準にして２個の仮メル値を算出して出力する。仮メル値＝メルＬＳＰの各次元値＋Ｄ／２仮メル値＝メルＬＳＰの各次元値−Ｄ／２Upon receiving the mel LSP from the mel conversion unit 22, the temporary mel value calculation unit 23 calculates
A half value of a predetermined threshold value D is added to and subtracted from each dimension value of the mel LSP, and the addition result and the subtraction result are output to the mel inverse conversion unit 24 as a temporary mel value. That is, for each dimension, two temporary mel values are calculated and output based on the dimension value. Temporary Mel Value = Each Dimension Value of Mel LSP + D / 2 Temporary Mel Value = Each Dimension Value of Mel LSP−D / 2

【００７７】メル逆変換部２４は、仮メル値算出部２３
から各次元毎に２個の仮メル値を受けると、仮メル値の
周波数領域をメル周波数領域から線形周波数領域に逆変
換し、その逆変換結果を仮周波数値として差分算出部２
５に出力する。The inverse mel conversion unit 24 is provided with a temporary mel value calculation unit 23.
When two temporary mel values are received for each dimension from, the frequency domain of the tentative mel value is inversely transformed from the mel frequency domain to the linear frequency domain, and the result of the inverse transformation is used as a tentative frequency value in the difference calculation unit 2.
5 is output.

【００７８】そして、差分算出部２５は、メル逆変換部
２４から各次元毎に２個の仮周波数値を受けると、各次
元毎に２個の仮周波数値の差分を算出し、その算出結果
を各次元毎の次元間距離閾値としてＬＳＰ変形部１６に
出力する。以降のＬＳＰ変形部１６の動作は上記実施の
形態４と同様であるため説明を省略する。Upon receiving the two provisional frequency values for each dimension from the mel inverse transform unit 24, the difference calculation unit 25 calculates the difference between the two provisional frequency values for each dimension, and calculates the result. Is output to the LSP transformation unit 16 as an inter-dimensional distance threshold for each dimension. Subsequent operations of the LSP deforming unit 16 are the same as those of the fourth embodiment, and thus description thereof is omitted.

【００７９】以上で明らかなように、この実施の形態５
によれば、入力ＬＳＰの各次元毎の値に応じて、隣接次
元間距離に関する閾値を各次元毎に算出し、この閾値に
基づいて入力ＬＳＰの補正を行うようにしたので、各次
元の周波数に最適な閾値に基づいて入力ＬＳＰが補正さ
れるようになり、その結果、補正による影響が周波数毎
に差異が出て、適用した音声符号化装置及び音声復号化
装置の符号化復号化品質の劣化を解消できなかったり、
逆に劣化をもたらしてしまう課題を解消することができ
る効果を奏する。As is apparent from the above, the fifth embodiment
According to the method, the threshold value for the distance between adjacent dimensions is calculated for each dimension according to the value of each dimension of the input LSP, and the input LSP is corrected based on the threshold value. The input LSP is corrected based on the optimal threshold value, and as a result, the influence of the correction is different for each frequency, and the coding / decoding quality of the applied voice coding apparatus and voice decoding apparatus is reduced. Deterioration cannot be eliminated,
On the contrary, there is an effect that the problem of causing deterioration can be solved.

【００８０】また、聴覚的特性に対応が良いメル周波数
領域で閾値を算出するようにしたので、全周波数帯域に
渡って聴覚的に均等な効果をもたらすことができる効果
を奏する。さらに、音声符号化装置と音声復号化装置が
多段階に接続されたり、背景雑音が多い場合でも、低域
の劣化を引き起こすことなく、高域に生じる急峻な極を
良好に補正することができる効果を奏する。Further, since the threshold value is calculated in the mel frequency region having a good response to the auditory characteristics, an effect that an auditory equal effect can be obtained over the entire frequency band is achieved. Furthermore, even when the speech encoding device and the speech decoding device are connected in multiple stages or there is a lot of background noise, it is possible to satisfactorily correct a steep pole generated in a high frequency band without causing deterioration in a low frequency region. It works.

【００８１】なお、メル変換部２２の近似式を用いたＬ
ＳＰ補正装置を、音声符号化装置や音声復号化装置に適
用した時に、例えば、高域の符号化特性の劣化が大きい
傾向があったり、逆に低域の補正が弱くて復号音の振幅
が不安定になるような場合には、線形周波数領域とメル
周波数領域の中間的な周波数領域に変換するように近似
式を調整することもできる。この場合には、当然メル逆
変換部２４もこれに対応して調整することが必要であ
る。Note that L using the approximate expression of the mel conversion unit 22
When the SP correction device is applied to a voice coding device or a voice decoding device, for example, the degradation of the high-frequency coding characteristics tends to be large, or the low-frequency correction is weak and the amplitude of the decoded sound is low. In the case where the frequency becomes unstable, the approximation formula can be adjusted so as to convert to an intermediate frequency region between the linear frequency region and the mel frequency region. In this case, it is necessary to adjust the mel inverse transform unit 24 accordingly.

【００８２】実施の形態６．上記実施の形態４，５で
は、ｂａｒｋ周波数領域又はメル周波数領域での計算に
より閾値を算出するものについて示したが、図４におけ
るｂａｒｋ変換部１２を対数周波数変換部（ＬＳＰの周
波数領域を線形周波数領域から対数周波数領域に変換す
る変換部）に変更し、仮ｂａｒｋ値算出部１３を仮対数
周波数算出部（仮対数周波数値を算出する算出部）に変
更し、さらに、ｂａｒｋ逆変換部１４を対数周波数逆変
換部(ＬＳＰの周波数領域を線形周波数領域に戻す逆変
換部)に変更し、対数周波数領域での計算により閾値を
算出するようにしてもよい。また、ｂａｒｋ周波数領
域，メル周波数領域又は対数周波数領域以外にも、聴覚
的特性と比較的対応が良い周波数領域での計算により閾
値を算出するようにしてもよい。Embodiment 6 FIG. In the above fourth and fifth embodiments, the calculation of the threshold by the calculation in the bark frequency domain or the mel frequency domain has been described. However, the bark transform unit 12 in FIG. To a logarithmic frequency domain), the temporary bark value calculator 13 is changed to a temporary logarithmic frequency calculator (calculator for calculating a temporary logarithmic frequency value), and the bark inverse converter 14 The threshold may be calculated by changing to a logarithmic frequency inverse transform unit (an inverse transform unit for returning the frequency domain of the LSP to the linear frequency domain) and calculating in the logarithmic frequency domain. In addition, the threshold may be calculated by calculation in a frequency domain having relatively good correspondence with auditory characteristics, in addition to the bark frequency domain, the mel frequency domain, or the logarithmic frequency domain.

【００８３】実施の形態７．図７はこの発明の実施の形
態７によるＬＳＰ補正装置を示す構成図であり、図にお
いて、図４と同一符号は同一または相当部分を示すので
説明を省略する。３１は差分算出部１５により算出され
た各次元毎の次元間距離閾値と、線形周波数領域で定義
された固定閾値を比較し、大きい方の閾値をＬＳＰ変形
部１６に出力する最大値選択部である。Embodiment 7 FIG. FIG. 7 is a block diagram showing an LSP correction apparatus according to Embodiment 7 of the present invention. In the figure, the same reference numerals as those in FIG. 4 denote the same or corresponding parts, and a description thereof will be omitted. Reference numeral 31 denotes a maximum value selection unit that compares the inter-dimension distance threshold calculated for each dimension by the difference calculation unit 15 with a fixed threshold defined in the linear frequency domain, and outputs the larger threshold to the LSP transformation unit 16. is there.

【００８４】上記実施の形態４では、差分算出部１５に
より算出された各次元毎の次元間距離閾値を常にＬＳＰ
変形部１６に出力するものについて示したが、最大値選
択部３１が差分算出部１５により算出された各次元毎の
次元間距離閾値と、線形周波数領域で定義された固定閾
値を比較し、大きい方の閾値をＬＳＰ変形部１６に出力
するようにしてもよい。In the fourth embodiment, the inter-dimension distance threshold for each dimension calculated by the difference calculator 15 is always set to the LSP.
Although the output to the transformation unit 16 is shown, the maximum value selection unit 31 compares the inter-dimension distance threshold calculated for each dimension by the difference calculation unit 15 with the fixed threshold defined in the linear frequency domain, and The other threshold may be output to the LSP transformation unit 16.

【００８５】この実施の形態７によれば、入力ＬＳＰの
各次元毎の値に応じて、隣接次元間の距離に関する閾値
を聴覚的特性に対応が良い周波数領域で算出し、これと
線形周波数領域で定義される閾値の内の最大値を最終的
な閾値とするようにしたので、上記実施の形態４が奏す
る効果に加えて、全周波数帯域に渡って聴覚的に均等な
効果をもたらし、かつ、不安定なＬＳＰ(低域で次元間
距離が近づき過ぎてＬＳＰのフィルタゲインが大きくな
り過ぎる状態)を出力することがないＬＳＰ補正装置が
提供できる効果を奏する。According to the seventh embodiment, in accordance with the value of each dimension of the input LSP, the threshold value for the distance between adjacent dimensions is calculated in the frequency domain having a good correspondence to the auditory characteristics. Since the maximum value among the threshold values defined in is set as the final threshold value, in addition to the effect of the fourth embodiment, an acoustically uniform effect is provided over the entire frequency band, and And an LSP correction device that does not output an unstable LSP (a state in which the inter-dimensional distance is too small in a low frequency range and the filter gain of the LSP is too large) is provided.

【００８６】なお、この実施の形態７では、上記実施の
形態４のＬＳＰ補正装置に最大値選択部３１を適用する
ものについて示したが、上記実施の形態５，６のＬＳＰ
補正装置に最大値選択部３１を適用するようにしてもよ
い。In the seventh embodiment, the case where the maximum value selector 31 is applied to the LSP correction device of the fourth embodiment has been described.
The maximum value selection unit 31 may be applied to the correction device.

【００８７】実施の形態８．図８はこの発明の実施の形
態８によるＬＳＰ補正装置を示す構成図であり、図にお
いて、図４と同一符号は同一または相当部分を示すので
説明を省略する。３２はスペクトルパラメータであるＬ
ＳＰを入力すると、そのＬＳＰの各次元値(周波数値)に
近い２つの周波数値を周波数対閾値テーブル３３から探
索し、２つの周波数値に基づいて各次元の次元間距離閾
値を算出する閾値算出部（閾値算出手段）、３３は代表
的な周波数値と代表的な周波数値に対応する閾値を格納
する周波数対閾値テーブル、３４は代表的な周波数値に
対応する２個の閾値を検索するとともに、２個の閾値を
補間する補間部である。Embodiment 8 FIG. FIG. 8 is a configuration diagram showing an LSP correction apparatus according to Embodiment 8 of the present invention. In the figure, the same reference numerals as those in FIG. 4 denote the same or corresponding parts, and a description thereof will not be repeated. 32 is a spectrum parameter L
When an SP is input, two frequency values close to each dimension value (frequency value) of the LSP are searched from the frequency-to-threshold table 33, and a threshold calculation for calculating an inter-dimensional distance threshold of each dimension based on the two frequency values is performed. Unit (threshold value calculating means), 33 is a frequency-to-threshold table for storing a representative frequency value and a threshold value corresponding to the representative frequency value, and 34 is for searching for two threshold values corresponding to the representative frequency value. And an interpolation unit for interpolating two threshold values.

【００８８】次に動作について説明する。上記実施の形
態４〜７では、ｂａｒｋ周波数領域又はメル周波数領域
での計算により閾値を算出するものについて示したが、
入力ＬＳＰの各次元値に近い２つの周波数値を周波数対
閾値テーブル３３から探索し、２つの周波数値に基づい
て各次元の次元間距離閾値を算出するようにしてもよ
い。Next, the operation will be described. In the above fourth to seventh embodiments, the calculation of the threshold by the calculation in the bark frequency domain or the mel frequency domain has been described.
Two frequency values close to each dimension value of the input LSP may be searched from the frequency-threshold table 33, and the inter-dimension distance threshold of each dimension may be calculated based on the two frequency values.

【００８９】具体的には、まず、閾値算出部３２の補間
部３４が、入力ＬＳＰの各次元値(周波数値)に近い２つ
の周波数値を周波数対閾値テーブル３３から探索する。
そして、補間部３４は、２つの周波数値を検索すると、
その２つの周波数値に対応する２個の閾値を周波数対閾
値テーブル３３から取得する。More specifically, first, the interpolator 34 of the threshold calculator 32 searches the frequency-threshold table 33 for two frequency values close to each dimension value (frequency value) of the input LSP.
Then, when the interpolation unit 34 searches for two frequency values,
Two threshold values corresponding to the two frequency values are obtained from the frequency-threshold table 33.

【００９０】そして、補間部３４は、２つの周波数値と
入力ＬＳＰにおける当該次数値の差に応じて２個の閾値
の重み付け加算を実行し、当該次数の次元間距離閾値を
算出する。この算出処理を各次数に対して実行し、次元
毎の次元間距離閾値をＬＳＰ変形部１６の次元間距離拡
張部１８に出力する。以降のＬＳＰ変形部１６の動作は
上記実施の形態４と同様であるため説明を省略する。Then, the interpolation unit 34 performs weighted addition of the two thresholds according to the difference between the two frequency values and the order value in the input LSP, and calculates the dimension distance threshold of the order. This calculation process is executed for each order, and the dimension distance threshold for each dimension is output to the dimension extension section 18 of the LSP deforming section 16. Subsequent operations of the LSP deforming unit 16 are the same as those of the fourth embodiment, and thus description thereof is omitted.

【００９１】以上で明らかなように、この実施の形態８
によれば、入力ＬＳＰの各次元毎の値に応じて、隣接次
元間距離に関する閾値を各次元毎に算出し、この閾値に
基づいて入力ＬＳＰの補正を行うようにしたので、各次
元の周波数に最適な閾値に基づいて入力ＬＳＰが補正さ
れるようになり、その結果、補正による影響が周波数毎
に差異が出て、適用した音声符号化装置及び音声復号化
装置の符号化復号化品質の劣化を解消できなかったり、
逆に劣化をもたらしてしまう課題を解消することができ
る効果を奏する。As is clear from the above, this embodiment 8
According to the method, the threshold value for the distance between adjacent dimensions is calculated for each dimension according to the value of each dimension of the input LSP, and the input LSP is corrected based on the threshold value. The input LSP is corrected based on the optimal threshold value, and as a result, the influence of the correction is different for each frequency, and the coding / decoding quality of the applied voice coding apparatus and voice decoding apparatus is reduced. Deterioration cannot be eliminated,
On the contrary, there is an effect that the problem of causing deterioration can be solved.

【００９２】また、音声符号化装置と音声復号化装置が
多段階に接続されたり、背景雑音が多い場合でも、低域
の劣化を引き起こすことなく、高域に生じる急峻な極を
良好に補正することができる効果を奏する。なお、上記
実施の形態４と比べると、テーブルのためのメモリが新
たに必要になるが、処理が簡単になる効果を奏する。Further, even when the speech encoding device and the speech decoding device are connected in multiple stages or when there is a lot of background noise, the steep pole generated in the high frequency range is corrected well without causing deterioration in the low frequency range. The effect that can be achieved. Although a new memory for the table is required as compared with the fourth embodiment, the effect of simplifying the processing is obtained.

【００９３】実施の形態９．図９はこの発明の実施の形
態９による音声符号化装置を示す構成図であり、図にお
いて、図１と同一符号は同一または相当部分を示すので
説明を省略する。４１は入力音声を分析してＬＳＰを算
出するＬＳＰ分析部（ＬＳＰ分析手段）、４２はＬＳＰ
分析部４１により算出されたＬＳＰを補正するＬＳＰ補
正装置、４３はＬＳＰ補正装置４２が出力する補正ＬＳ
Ｐを符号化してＬＳＰ符号と量子化ＬＳＰを出力するＬ
ＳＰ符号化部（ＬＳＰ符号化手段）、４４はＬＳＰ符号
化部４３から出力された量子化ＬＳＰと入力音声から符
号化音源（音源符号）を算出する音源符号化部（音源符
号化手段）である。Embodiment 9 FIG. FIG. 9 is a block diagram showing a speech encoding apparatus according to Embodiment 9 of the present invention. In the figure, the same reference numerals as those in FIG. 1 denote the same or corresponding parts, and a description thereof will be omitted. 41 is an LSP analysis unit (LSP analysis means) for analyzing an input voice to calculate an LSP, and 42 is an LSP
An LSP correction device for correcting the LSP calculated by the analysis unit 41, and a correction LS 43 output by the LSP correction device 42
L that encodes P and outputs an LSP code and a quantized LSP
An SP encoding unit (LSP encoding unit) 44 is an excitation encoding unit (excitation encoding unit) that calculates an encoded excitation (excitation code) from the quantized LSP output from the LSP encoding unit 43 and the input speech. is there.

【００９４】次に動作について説明する。まず、ＬＳＰ
分析部４１は、入力音声を入力すると、その入力音声を
分析してＬＳＰを算出し、そのＬＳＰをＬＳＰ補正装置
４２に出力する。なお、ＬＳＰの算出は、線形予測分析
を実行して線形予測係数を計算し、これを変換して求め
るのが一般的である。Next, the operation will be described. First, LSP
When the input voice is input, the analysis unit 41 analyzes the input voice, calculates an LSP, and outputs the LSP to the LSP correction device 42. In general, the LSP is calculated by performing a linear prediction analysis, calculating a linear prediction coefficient, and converting the coefficient.

【００９５】そして、ＬＳＰ補正装置４２は、ＬＳＰ分
析部４１からＬＳＰを受けると、上記実施の形態１と同
様の手順でＬＳＰの補正を実施し、その補正ＬＳＰをＬ
ＳＰ符号化部４３に出力する。そして、ＬＳＰ符号化部
４３は、ＬＳＰ補正装置４２が補正ＬＳＰを出力する
と、その補正ＬＳＰを符号化してＬＳＰ符号を出力する
とともに、例えば、符号化処理の途中の段階で得られる
量子化ＬＳＰ（量子化ＬＳＰは符号化復号化結果に相当
する）を音源符号化部４４に出力する。When receiving the LSP from the LSP analysis unit 41, the LSP correction unit 42 corrects the LSP in the same procedure as in the first embodiment, and converts the corrected LSP to the LSP.
Output to the SP encoder 43. Then, when the LSP correction device 42 outputs the corrected LSP, the LSP encoding unit 43 encodes the corrected LSP and outputs an LSP code. For example, the LSP encoding unit 43 obtains a quantized LSP ( (The quantized LSP is equivalent to the result of encoding and decoding.)

【００９６】音源符号化部４４は、ＬＳＰ符号化部４３
から量子化ＬＳＰを受けると、その量子化ＬＳＰを用い
て、入力音声の音源情報を符号化し、その符号化結果を
音源符号として出力する。[0096] Excitation encoding section 44 includes LSP encoding section 43.
Receives the quantized LSP from the source codec, it encodes the sound source information of the input voice using the quantized LSP, and outputs the coding result as a sound source code.

【００９７】なお、この実施の形態９では、上記実施の
形態１のＬＳＰ補正装置を適用するものについて示した
が、上記実施の形態２〜８のＬＳＰ補正装置を適用する
ようにしてもよい。また、ＬＳＰ分析部４１の入力を復
号音声に変更して、特開平５−２７３９９７号公報のよ
うにバックワード型のＣＥＬＰ系構成にしてもかまわな
い。In the ninth embodiment, the LSP correction apparatus of the first embodiment is applied. However, the LSP correction apparatuses of the second to eighth embodiments may be applied. Alternatively, the input of the LSP analysis unit 41 may be changed to a decoded speech, and a backward CELP system configuration as disclosed in Japanese Patent Application Laid-Open No. Hei 5-273997 may be employed.

【００９８】以上で明らかなように、この実施の形態９
によれば、入力音声を分析して算出したＬＳＰをＬＳＰ
補正装置４２が補正し、その補正ＬＳＰを符号化するよ
うにしたので、入力音声の歪や分析誤差に伴う品質劣化
を良好に抑制することができる効果を奏する。As is apparent from the above, the ninth embodiment
According to the LSP calculated by analyzing the input voice,
Since the correction device 42 corrects and encodes the corrected LSP, there is an effect that the quality degradation due to the distortion of the input voice and the analysis error can be suppressed well.

【００９９】特に、ＬＳＰ補正装置４２がＬＳＰを補正
しているので、ＬＳＰの各次元の周波数に最適な閾値が
算出することができるようになり、その結果、補正によ
る影響が周波数毎に差異が出て符号化復号化品質の劣化
を解消できなかったり、逆に劣化をもたらしてしまう課
題を解消することができる効果を奏する。また、聴覚的
特性に対応が良い周波数領域に変換してからＬＳＰの補
正処理を行うようにした場合には、全周波数帯域に渡っ
て聴覚的に均等な効果をもたらすＬＳＰ補正が実現され
るため、良好な符号化特性が得られる効果を奏する。さ
らに、この音声符号化装置と、対応する音声復号化装置
が多段階に接続されたり、背景雑音が多い場合でも、低
域の劣化を引き起こすことなく、高域に生じる急峻な極
を良好に補正して、良好な符号化特性が得られる効果を
奏する。In particular, since the LSP correction device 42 corrects the LSP, it becomes possible to calculate the optimum threshold value for the frequency of each dimension of the LSP, and as a result, the effect of the correction is different for each frequency. There is an effect that it is possible to solve the problem that the degradation of the encoding / decoding quality cannot be eliminated or the degradation is caused. In addition, if the LSP correction process is performed after converting to a frequency region having good correspondence to the auditory characteristics, LSP correction that provides an auditory equal effect over the entire frequency band is realized. This provides an effect that good coding characteristics can be obtained. Furthermore, even when the speech encoding device and the corresponding speech decoding device are connected in multiple stages or when there is a lot of background noise, the steep poles generated in the high frequency range can be satisfactorily corrected without causing deterioration in the low frequency range. As a result, there is an effect that a good encoding characteristic can be obtained.

【０１００】実施の形態１０．図１０はこの発明の実施
の形態１０による音声符号化装置を示す構成図であり、
図において、図９と同一符号は同一または相当部分を示
すので説明を省略する。４５はＬＳＰ分析部４１により
算出されたＬＳＰを符号化してＬＳＰ符号を出力するＬ
ＳＰ符号化部（ＬＳＰ符号化手段）、４６はＬＳＰ符号
化部４５から出力されたＬＳＰ符号を復号化して復号Ｌ
ＳＰを出力するＬＳＰ復号化部（ＬＳＰ復号化手段）、
４７はＬＳＰ復号化部４６が出力する復号ＬＳＰを補正
するＬＳＰ補正装置、４８はＬＳＰ補正装置４７により
補正された復号ＬＳＰと入力音声から符号化音源（音源
符号）を算出する音源符号化部（音源符号化手段）であ
る。Embodiment 10 FIG. FIG. 10 is a configuration diagram showing a speech encoding apparatus according to Embodiment 10 of the present invention.
9, the same reference numerals as those in FIG. 9 denote the same or corresponding parts, and a description thereof will not be repeated. Reference numeral 45 denotes L for encoding the LSP calculated by the LSP analysis unit 41 and outputting an LSP code
The SP encoder (LSP encoder) 46 decodes the LSP code output from the LSP encoder 45 to decode LSP code.
An LSP decoding unit that outputs SP (LSP decoding means);
Reference numeral 47 denotes an LSP correction device for correcting the decoded LSP output from the LSP decoding unit 46, and reference numeral 48 denotes a sound source coding unit (calculated from the decoded LSP corrected by the LSP correction device 47 and the input voice, which calculates an encoded sound source (sound source code)). (Excitation coding means).

【０１０１】次に動作について説明する。まず、ＬＳＰ
分析部４１は、入力音声を入力すると、上記実施の形態
９と同様に、その入力音声を分析してＬＳＰを算出し、
そのＬＳＰをＬＳＰ符号化部４５に出力する。Next, the operation will be described. First, LSP
When inputting the input voice, the analysis unit 41 analyzes the input voice and calculates the LSP, as in the ninth embodiment.
The LSP is output to the LSP encoder 45.

【０１０２】そして、ＬＳＰ符号化部４５は、ＬＳＰ分
析部４１からＬＳＰを受けると、そのＬＳＰを符号化し
てＬＳＰ符号を出力する。そして、ＬＳＰ復号化部４６
は、ＬＳＰ符号化部４５がＬＳＰ符号を出力すると、そ
のＬＳＰ符号を復号化して復号ＬＳＰをＬＳＰ補正装置
４７に出力する。Then, when receiving the LSP from the LSP analyzer 41, the LSP encoder 45 encodes the LSP and outputs an LSP code. Then, the LSP decoding unit 46
When the LSP encoding unit 45 outputs the LSP code, the LSP encoding unit 45 decodes the LSP code and outputs the decoded LSP to the LSP correction device 47.

【０１０３】ＬＳＰ補正装置４７は、ＬＳＰ復号化部４
６から復号ＬＳＰを受けると、上記実施の形態１と同様
の手順で復号ＬＳＰの補正を実施し、その補正ＬＳＰを
音源符号化部４８に出力する。そして、音源符号化部４
８は、ＬＳＰ補正装置４７から補正ＬＳＰを受けると、
その補正ＬＳＰを用いて、入力音声の音源情報を符号化
し、その符号化結果を音源符号として出力する。The LSP correction unit 47 includes an LSP decoding unit 4
When the decoding LSP is received from 6, the decoding LSP is corrected in the same procedure as in the first embodiment, and the corrected LSP is output to excitation coding section 48. Then, excitation coding section 4
8 receives the correction LSP from the LSP correction device 47,
The sound source information of the input voice is encoded using the corrected LSP, and the encoded result is output as a sound source code.

【０１０４】なお、この実施の形態１０では、上記実施
の形態１のＬＳＰ補正装置を適用するものについて示し
たが、上記実施の形態２〜８のＬＳＰ補正装置を適用す
るようにしてもよい。In the tenth embodiment, the LSP correction apparatus of the first embodiment is applied. However, the LSP correction apparatuses of the second to eighth embodiments may be applied.

【０１０５】以上で明らかなように、この実施の形態１
０によれば、ＬＳＰ補正装置４７が復号ＬＳＰを補正
し、その補正ＬＳＰを使って音源情報の符号化を行うよ
うにしたので、ＬＳＰの符号化歪(量子化歪)による復号
音の不安定化を良好に抑制することができる効果を奏す
る。As is apparent from the above, the first embodiment
According to 0, the LSP correction device 47 corrects the decoded LSP and encodes the excitation information using the corrected LSP, so that the decoded sound becomes unstable due to LSP coding distortion (quantization distortion). This has the effect of successfully suppressing the formation.

【０１０６】特に、ＬＳＰ補正装置４７がＬＳＰを補正
しているので、ＬＳＰの各次元の周波数に最適な閾値が
算出することができるようになり、その結果、補正によ
る影響が周波数毎に差異が出て符号化復号化品質の劣化
を解消できなかったり、逆に劣化をもたらしてしまう課
題を解消することができる効果を奏する。また、聴覚的
特性に対応が良い周波数領域に変換してからＬＳＰの補
正処理を行うようにした場合には、全周波数帯域に渡っ
て聴覚的に均等な効果をもたらすＬＳＰ補正が実現され
るため、良好な符号化特性が得られる効果を奏する。さ
らに、この音声符号化装置と、対応する音声復号化装置
が多段階に接続されたり、背景雑音が多い場合でも、低
域の劣化を引き起こすことなく、高域に生じる急峻な極
を良好に補正して、良好な符号化特性が得られる効果を
奏する。In particular, since the LSP correction device 47 corrects the LSP, it becomes possible to calculate the optimum threshold value for the frequency of each dimension of the LSP, and as a result, the influence of the correction is different for each frequency. There is an effect that it is possible to solve the problem that the degradation of the encoding / decoding quality cannot be eliminated or the degradation is caused. In addition, if the LSP correction process is performed after converting to a frequency region having good correspondence to the auditory characteristics, LSP correction that provides an auditory equal effect over the entire frequency band is realized. This provides an effect that good coding characteristics can be obtained. Furthermore, even when the speech encoding device and the corresponding speech decoding device are connected in multiple stages or when there is a lot of background noise, the steep poles generated in the high frequency range can be satisfactorily corrected without causing deterioration in the low frequency range. As a result, there is an effect that a good encoding characteristic can be obtained.

【０１０７】実施の形態１１．図１１はこの発明の実施
の形態１１による音声復号化装置を示す構成図であり、
図において、図１と同一符号は同一または相当部分を示
すので説明を省略する。５１はＬＳＰ符号を復号化して
復号ＬＳＰを出力するＬＳＰ復号化部（ＬＳＰ復号化手
段）、５２はＬＳＰ復号化部５１が出力する復号ＬＳＰ
を補正するＬＳＰ補正装置、５３は音源符号を復号化し
て音源信号を生成する音源復号化部（音源復号化手
段）、５４はＬＳＰ補正装置５２が出力する補正ＬＳＰ
と音源復号化部５３により生成された音源信号から合成
音を生成する合成フィルタ（合成手段）である。Embodiment 11 FIG. FIG. 11 is a configuration diagram showing a speech decoding apparatus according to Embodiment 11 of the present invention.
In the figure, the same reference numerals as those in FIG. 51 is an LSP decoding unit (LSP decoding means) for decoding the LSP code and outputting a decoded LSP, 52 is a decoded LSP output from the LSP decoding unit 51
A sound source decoding unit (sound source decoding means) for decoding a sound source code to generate a sound source signal, and a correction LSP output from the LSP correction device 52.
And a synthesis filter (synthesis unit) for generating a synthesized sound from the sound source signal generated by the sound source decoding unit 53.

【０１０８】次に動作について説明する。まず、ＬＳＰ
復号化部５１は、ＬＳＰ符号を入力すると、そのＬＳＰ
符号を復号化して復号ＬＳＰをＬＳＰ補正装置５２に出
力する。このＬＳＰ符号は、上記実施の形態１０で説明
した音声符号化装置等から出力されたものである。Next, the operation will be described. First, LSP
When the decoding unit 51 receives the LSP code, the LSP code
The code is decoded and the decoded LSP is output to the LSP correction device 52. This LSP code is output from the speech coding device or the like described in the tenth embodiment.

【０１０９】ＬＳＰ補正装置５２は、ＬＳＰ復号化部５
１から復号ＬＳＰを受けると、上記実施の形態１と同様
の手順で復号ＬＳＰの補正を実施し、その補正ＬＳＰを
合成フィルタ５４に出力する。一方、音源復号化部５３
は、音源符号を入力すると、その音源符号を復号化して
音源信号を生成し、その音源信号を合成フィルタ５４に
出力する。The LSP correction device 52 includes an LSP decoding unit 5
When receiving the decoded LSP from the first LSP, the decoding LSP is corrected in the same procedure as in the first embodiment, and the corrected LSP is output to the synthesis filter 54. On the other hand, the sound source decoding unit 53
Receives the excitation code, generates the excitation signal by decoding the excitation code, and outputs the excitation signal to the synthesis filter 54.

【０１１０】そして、合成フィルタ５４は、ＬＳＰ補正
装置５２から補正ＬＳＰを受けると、その補正ＬＳＰを
用いて、その音源信号に対する合成フィルタリング処理
を実行し、その結果得られた合成音を出力音声として出
力する。なお、この実施の形態１１では、上記実施の形
態１のＬＳＰ補正装置を適用するものについて示した
が、上記実施の形態２〜８のＬＳＰ補正装置を適用する
ようにしてもよい。When receiving the corrected LSP from the LSP corrector 52, the synthesis filter 54 performs a synthesis filtering process on the sound source signal using the corrected LSP, and uses the resulting synthesized sound as an output voice. Output. In the eleventh embodiment, the LSP correction device of the first embodiment is applied. However, the LSP correction device of the second to eighth embodiments may be applied.

【０１１１】以上で明らかなように、この実施の形態１
１によれば、ＬＳＰ補正装置５２が復号ＬＳＰを補正
し、その補正ＬＳＰを使って合成フィルタリング処理を
行うようにしたので、ＬＳＰ符号の伝送誤りによる復号
音の不安定化を良好に抑制することができる効果を奏す
る。また、上記実施の形態１０の音声符号化装置等が出
力するＬＳＰ符号を入力として、音声復号化装置のＬＳ
Ｐ補正装置５２を音声符号化装置のＬＳＰ補正装置と同
様のものにする場合には、ＬＳＰの符号化歪(量子化歪)
による復号音の不安定化を良好に抑制した音声復号化装
置が提供できる効果を奏する。As is clear from the above, the first embodiment
According to No. 1, since the LSP correction device 52 corrects the decoded LSP and performs the synthesis filtering process using the corrected LSP, it is possible to appropriately suppress the instability of the decoded sound due to the transmission error of the LSP code. It has the effect of being able to. Further, the LSP code output from the speech encoding device or the like according to the tenth embodiment is input and the LS
When the P correction device 52 is the same as the LSP correction device of the speech coding device, the LSP coding distortion (quantization distortion)
Thus, it is possible to provide an audio decoding device that can appropriately suppress the instability of the decoded sound due to.

【０１１２】特に、ＬＳＰ補正装置５２が復号ＬＳＰを
補正しているので、復号ＬＳＰの各次元の周波数に最適
な閾値が算出することができるようになり、その結果、
補正による影響が周波数毎に差異が出て符号化復号化品
質の劣化を解消できなかったり、逆に劣化をもたらして
しまう課題を解消することができる効果を奏する。ま
た、聴覚的特性に対応が良い周波数領域に変換してから
ＬＳＰの補正処理を行うようにした場合には、全周波数
帯域に渡って聴覚的に均等な効果をもたらすＬＳＰ補正
が実現されるため、良好な復号化特性が得られる効果が
ある。さらに、この音声復号化装置に対応する音声符号
化装置と、この音声復号化装置が多段階に接続された
り、音声符号化装置に入力される背景雑音が多い場合で
も、低域の劣化を引き起こすことなく、高域に生じる急
峻な極を良好に補正して、良好な復号化特性が得られる
効果を奏する。In particular, since the LSP correction device 52 corrects the decoded LSP, it becomes possible to calculate the optimum threshold value for the frequency of each dimension of the decoded LSP.
There is an effect that it is possible to solve the problem that the influence of the correction is different for each frequency and the degradation of the encoding / decoding quality cannot be eliminated, or the degradation of the quality can be solved. In addition, if the LSP correction process is performed after converting to a frequency region having good correspondence to the auditory characteristics, LSP correction that provides an auditory equal effect over the entire frequency band is realized. Thus, there is an effect that good decoding characteristics can be obtained. Further, even if the speech encoding device corresponding to the speech decoding device and the speech decoding device are connected in multiple stages, or if there is much background noise input to the speech encoding device, low-frequency degradation is caused. Without this, it is possible to satisfactorily correct a steep pole generated in a high frequency range and obtain an effect of obtaining a good decoding characteristic.

【０１１３】実施の形態１２．図１２はこの発明の実施
の形態１２による音声復号化装置を示す構成図であり、
図において、図１１と同一符号は同一または相当部分を
示すので説明を省略する。５５はＬＳＰ復号化部５１か
ら出力された復号ＬＳＰと音源復号化部５３により生成
された音源信号から合成音を生成する合成フィルタ（合
成手段）、５６はＬＳＰ補正装置５２から出力された補
正ＬＳＰを用いて合成音に対するスペクトル強調処理を
実行するポストフィルタ（ポストフィルタ手段）であ
る。Embodiment 12 FIG. FIG. 12 is a configuration diagram showing a speech decoding apparatus according to Embodiment 12 of the present invention.
In the figure, the same reference numerals as those in FIG. 11 denote the same or corresponding parts, and a description thereof will not be repeated. Reference numeral 55 denotes a synthesis filter (synthesis unit) that generates a synthesized sound from the decoded LSP output from the LSP decoding unit 51 and the sound source signal generated by the sound source decoding unit 53, and reference numeral 56 denotes a corrected LSP output from the LSP correction device 52. Is a post-filter (post-filter means) for executing a spectrum emphasis process on a synthesized sound by using.

【０１１４】次に動作について説明する。まず、ＬＳＰ
復号化部５１は、ＬＳＰ符号を入力すると、そのＬＳＰ
符号を復号化して復号ＬＳＰをＬＳＰ補正装置５２及び
合成フィルタ５５に出力する。このＬＳＰ符号は、上記
実施の形態１０で説明した音声符号化装置等から出力さ
れたものである。Next, the operation will be described. First, LSP
When the decoding unit 51 receives the LSP code, the LSP code
The code is decoded and the decoded LSP is output to the LSP correction device 52 and the synthesis filter 55. This LSP code is output from the speech coding device or the like described in the tenth embodiment.

【０１１５】一方、音源復号化部５３は、音源符号を入
力すると、その音源符号を復号化して音源信号を生成
し、その音源信号を合成フィルタ５５に出力する。そし
て、ＬＳＰ補正装置５２は、ＬＳＰ復号化部５１から復
号ＬＳＰを受けると、上記実施の形態１と同様の手順で
復号ＬＳＰの補正を実施し、その補正ＬＳＰをポストフ
ィルタ５６に出力する。On the other hand, when excitation codec 53 receives the excitation code, it decodes the excitation code to generate an excitation signal and outputs the excitation signal to synthesis filter 55. Then, when receiving the decoded LSP from the LSP decoding unit 51, the LSP correction device 52 corrects the decoded LSP in the same procedure as in the first embodiment, and outputs the corrected LSP to the post filter 56.

【０１１６】そして、合成フィルタ５５は、ＬＳＰ復号
化部５１から復号ＬＳＰを受けると、その復号ＬＳＰを
用いて、その音源信号に対する合成フィルタリング処理
を実行し、その結果得られた合成音をポストフィルタ５
６に出力する。そして、ポストフィルタ５６は、ＬＳＰ
補正装置５２から補正ＬＳＰを受けると、その補正ＬＳ
Ｐを用いて合成音に対する音声強調処理を実行し、その
結果得られた加工合成音を出力音声として出力する。When receiving the decoded LSP from the LSP decoding unit 51, the synthesis filter 55 executes synthesis filtering processing on the sound source signal using the decoded LSP, and converts the resultant synthesized sound into a post-filter. 5
6 is output. And the post filter 56 is an LSP
When receiving the correction LSP from the correction device 52, the correction LS
A voice emphasizing process is performed on the synthesized sound using P, and the processed synthesized sound obtained as a result is output as an output sound.

【０１１７】なお、この実施の形態１２では、上記実施
の形態１のＬＳＰ補正装置を適用するものについて示し
たが、上記実施の形態２〜８のＬＳＰ補正装置を適用す
るようにしてもよい。また、上記実施の形態１１と組み
合わせて、合成フィルタ５５にも補正ＬＳＰが入力され
るようにしてもよい。In the twelfth embodiment, the LSP correction apparatus of the first embodiment is applied. However, the LSP correction apparatuses of the second to eighth embodiments may be applied. Further, in combination with the eleventh embodiment, the correction LSP may also be input to the synthesis filter 55.

【０１１８】以上で明らかなように、この実施の形態１
２によれば、補正ＬＳＰを使ってポストフィルタ処理を
行うようにしたので、ポストフィルタの極が急峻過ぎて
過強調を引き起こすことを抑制することができる効果を
奏する。また、ＬＳＰ補正装置５２が補正を行うように
したので、復号ＬＳＰの各次元の周波数に最適な閾値が
算出することができるようになり、その結果、良好な音
声強調が得られる効果を奏する。As is clear from the above, the first embodiment
According to 2, since the post-filter processing is performed using the correction LSP, it is possible to prevent the pole of the post-filter from being too steep to cause over-emphasis. Further, since the LSP correction device 52 performs the correction, it is possible to calculate the optimum threshold value for the frequency of each dimension of the decoded LSP, and as a result, it is possible to obtain an effect of obtaining good voice enhancement.

【０１１９】また、聴覚的特性に対応が良い周波数領域
に変換してからＬＳＰの補正処理を行うようにした場合
には、全周波数帯域に渡って聴覚的に均等な効果をもた
らすＬＳＰ補正が実現されるため、良好な音声強調特性
が得られる効果を奏する。さらに、この音声復号化装置
に対応する音声符号化装置と、この音声復号化装置が多
段階に接続されたり、音声符号化装置に入力される背景
雑音が多い場合でも、低域の劣化を引き起こすことな
く、高域に生じる急峻な極を良好に補正して、良好な音
声強調特性が得られる効果を奏する。Further, when the LSP correction process is performed after converting to a frequency region having a good correspondence to the auditory characteristics, the LSP correction that provides an auditory equal effect over the entire frequency band is realized. Therefore, there is an effect that a good voice emphasis characteristic can be obtained. Further, even if the speech encoding device corresponding to the speech decoding device and the speech decoding device are connected in multiple stages, or if there is much background noise input to the speech encoding device, low-frequency degradation is caused. Without this, it is possible to satisfactorily correct a steep pole generated in a high frequency range and obtain an effect of obtaining a good voice emphasis characteristic.

【０１２０】実施の形態１３．図１３はこの発明の実施
の形態１３による音声復号化装置におけるポストフィル
タを示す構成図であり、図において、６１，６２は復号
ＬＳＰを補正するＬＳＰ補正装置、６３，６４はＬＳＰ
補正装置６１，６２が出力する補正ＬＳＰをＬＰＣ領域
に変換するＬＰＣ変換部、６５はＬＰＣ変換部６３が出
力するＬＰＣを用いて、音声復号化装置内で別途生成さ
れた合成音に対する合成フィルタ処理を実行するＬＰＣ
合成フィルタ、６６はＬＰＣ変換部６４が出力するＬＰ
Ｃを用いて、ＬＰＣ合成フィルタ６５の出力信号に対す
る逆フィルタ処理を実行するＬＰＣ逆フィルタである。
なお、音声復号化装置の全体構成は、図１２と同じもの
でも、他の構成でもかまわない。Embodiment 13 FIG. FIG. 13 is a block diagram showing a post filter in an audio decoding apparatus according to Embodiment 13 of the present invention. In the figure, reference numerals 61 and 62 denote LSP correction apparatuses for correcting a decoded LSP, and reference numerals 63 and 64 denote LSPs.
An LPC conversion unit for converting the corrected LSP output from the correction devices 61 and 62 into an LPC area, and a synthesis filter process 65 for the synthesized sound separately generated in the speech decoding device using the LPC output from the LPC conversion unit 63 LPC to execute
The synthesis filter 66 is an LP output from the LPC conversion unit 64
This is an LPC inverse filter that performs inverse filtering on the output signal of the LPC synthesis filter 65 using C.
Note that the overall configuration of the speech decoding device may be the same as that of FIG. 12 or may be another configuration.

【０１２１】次に動作について説明する。まず、復号Ｌ
ＳＰがＬＳＰ補正装置６１，６２に入力されるが、この
復号ＬＳＰは、音声復号化装置内で生成されたものであ
り、音声復号化装置内の合成フィルタに使用されるもの
と同一か、それを実施の形態１２と同様にして補正した
ものである。Next, the operation will be described. First, decryption L
The SP is input to the LSP correction devices 61 and 62. The decoded LSP is generated in the audio decoding device and is the same as or different from the one used for the synthesis filter in the audio decoding device. Is corrected in the same manner as in the twelfth embodiment.

【０１２２】ＬＳＰ補正装置６１は、復号ＬＳＰを入力
すると、その復号ＬＳＰを補正して、その補正ＬＳＰを
ＬＰＣ変換部６３に出力する。そして、ＬＰＣ変換部６
３は、ＬＳＰ補正装置６１から補正ＬＳＰを受けると、
その補正ＬＳＰをＬＰＣ領域に変換し、その結果得られ
たＬＰＣをＬＰＣ合成フィルタ６５に出力する。When receiving the decoded LSP, the LSP correction device 61 corrects the decoded LSP and outputs the corrected LSP to the LPC conversion unit 63. Then, the LPC conversion unit 6
3 receives the correction LSP from the LSP correction device 61,
The corrected LSP is converted into an LPC area, and the resulting LPC is output to the LPC synthesis filter 65.

【０１２３】一方、ＬＳＰ補正装置６２は、復号ＬＳＰ
を入力すると、その復号ＬＳＰを補正して、その補正Ｌ
ＳＰをＬＰＣ変換部６４に出力する。そして、ＬＰＣ変
換部６４は、ＬＳＰ補正装置６２から補正ＬＳＰを受け
ると、その補正ＬＳＰをＬＰＣ領域に変換し、その結果
得られたＬＰＣをＬＰＣ逆フィルタ６６に出力する。On the other hand, the LSP correction device 62 outputs the decrypted LSP
Is input, the decoded LSP is corrected, and the corrected LSP is corrected.
The SP is output to the LPC conversion unit 64. Then, when receiving the correction LSP from the LSP correction device 62, the LPC conversion unit 64 converts the correction LSP into an LPC area, and outputs the LPC obtained as a result to the LPC inverse filter 66.

【０１２４】そして、ＬＰＣ合成フィルタ６５は、ＬＰ
Ｃ変換部６３がＬＰＣを出力すると、そのＬＰＣを用い
て、音声復号化装置内で別途生成された合成音に対する
合成フィルタ処理を実行し、その結果得られた信号をＬ
ＰＣ逆フィルタ６６に出力する。Then, the LPC synthesis filter 65
When the C conversion unit 63 outputs the LPC, the LPC is used to perform a synthesis filter process on a synthesized sound separately generated in the speech decoding apparatus, and the resultant signal is converted to an L signal.
Output to the PC inverse filter 66.

【０１２５】そして、ＬＰＣ逆フィルタ６６は、ＬＰＣ
変換部６４が出力するＬＰＣを用いて、ＬＰＣ合成フィ
ルタ６５の出力信号に対する逆フィルタ処理を実行し、
その結果得られた信号を加工合成音として出力する。こ
の加工合成音が音声復号化装置全体の最終的な出力音声
となる場合と、更に後段で加工処理を加えられて最終的
な出力音声とする場合がある。The LPC inverse filter 66 is an LPC inverse filter.
Using the LPC output from the conversion unit 64, an inverse filter process is performed on the output signal of the LPC synthesis filter 65,
The signal obtained as a result is output as processed synthetic sound. In some cases, the processed synthesized sound is the final output sound of the entire speech decoding device, and in another case, the processed sound is further processed in a later stage to be the final output sound.

【０１２６】なお、この実施の形態１３では、上記実施
の形態１のＬＳＰ補正装置を適用するものについて示し
たが、上記実施の形態２〜８のＬＳＰ補正装置を適用す
るようにしてもよい。In the thirteenth embodiment, the LSP correction apparatus of the first embodiment is applied. However, the LSP correction apparatuses of the second to eighth embodiments may be applied.

【０１２７】以上で明らかなように、この実施の形態１
３によれば、ＬＳＰ補正装置６１，６２が復号ＬＳＰを
補正し、その補正ＬＳＰを使ってポストフィルタ処理を
行うようにしたので、復号ＬＳＰの各次元の周波数に最
適な閾値が算出することができるようになり、その結
果、良好な音声強調が得られる効果を奏する。また、聴
覚的特性に対応が良い周波数領域に変換してからＬＳＰ
の補正処理を行うようにした場合には、全周波数帯域に
渡って聴覚的に均等な効果をもたらすＬＳＰ補正が実現
されるため、良好な音声強調特性が得られる効果を奏す
る。さらに、この音声復号化装置に対応する音声符号化
装置と、この音声復号化装置が多段階に接続されたり、
音声符号化装置に入力される背景雑音が多い場合でも、
低域の劣化を引き起こすことなく、高域に生じる急峻な
極を良好に補正して、良好な音声強調特性が得られる効
果を奏する。As is apparent from the above, the first embodiment
According to No. 3, since the LSP correction devices 61 and 62 correct the decoded LSP and perform the post-filter processing using the corrected LSP, it is possible to calculate the optimum threshold value for the frequency of each dimension of the decoded LSP. As a result, it is possible to obtain an effect that good voice enhancement can be obtained. In addition, after converting to a frequency domain that is compatible with auditory characteristics, LSP
When the correction processing is performed, the LSP correction that provides an equal effect perceptually over the entire frequency band is realized, so that an effect that a good voice emphasis characteristic is obtained can be obtained. Furthermore, a speech encoding device corresponding to the speech decoding device and the speech decoding device are connected in multiple stages,
Even if there is much background noise input to the speech coding device,
Without causing deterioration in the low frequency range, steep poles generated in the high frequency range are satisfactorily corrected, and an effect of obtaining good voice emphasis characteristics can be obtained.

【０１２８】[0128]

【発明の効果】以上のように、この発明によれば、聴覚
的特性に対応する周波数領域に変換されたＬＳＰを補正
し、そのＬＳＰの周波数領域を線形周波数領域に戻すよ
うに構成したので、全周波数帯域に渡って聴覚的に均等
な効果をもたらすことができる効果がある。As described above, according to the present invention, the LSP converted into the frequency domain corresponding to the auditory characteristic is corrected, and the frequency domain of the LSP is returned to the linear frequency domain. There is an effect that an auditory equal effect can be provided over the entire frequency band.

【０１２９】この発明によれば、ＬＳＰの周波数領域を
線形周波数領域からバーク周波数領域に変換して補正す
るように構成したので、全周波数帯域に渡って聴覚的に
均等な効果をもたらすことができる効果がある。According to the present invention, since the LSP frequency domain is converted from the linear frequency domain to the Bark frequency domain for correction, it is possible to provide an auditory equal effect over the entire frequency band. effective.

【０１３０】この発明によれば、ＬＳＰの周波数領域を
線形周波数領域からメル周波数領域に変換して補正する
ように構成したので、全周波数帯域に渡って聴覚的に均
等な効果をもたらすことができる効果がある。According to the present invention, since the LSP frequency domain is converted from the linear frequency domain to the mel frequency domain for correction, it is possible to provide an acoustically uniform effect over the entire frequency band. effective.

【０１３１】この発明によれば、ＬＳＰの周波数領域を
線形周波数領域から対数周波数領域に変換して補正する
ように構成したので、全周波数帯域に渡って聴覚的に均
等な効果をもたらすことができる効果がある。According to the present invention, since the LSP frequency domain is converted from the linear frequency domain to the logarithmic frequency domain and corrected, it is possible to provide an acoustically uniform effect over the entire frequency band. effective.

【０１３２】この発明によれば、ＬＳＰの各次元値を基
準にして、隣接次元間距離に関する閾値を各次元毎に算
出し、各次元毎の閾値に基づいてＬＳＰを補正するよう
に構成したので、補正による影響が周波数毎に差異が出
て、適用した音声符号化装置及び音声復号化装置の符号
化復号化品質の劣化を解消できなかったり、逆に劣化を
もたらしてしまう課題を解消することができる効果があ
る。According to the present invention, the threshold for the distance between adjacent dimensions is calculated for each dimension based on each dimension value of the LSP, and the LSP is corrected based on the threshold for each dimension. To solve the problem that the influence of the correction is different for each frequency, and the deterioration of the coding / decoding quality of the applied speech coding apparatus and the speech decoding apparatus cannot be eliminated or, conversely, causes the degradation. There is an effect that can be.

【０１３３】この発明によれば、ＬＳＰの周波数領域を
線形周波数領域から聴覚的特性に対応する周波数領域に
変換し、その聴覚的特性に対応する周波数領域で閾値を
算出するように構成したので、全周波数帯域に渡って聴
覚的に均等な効果をもたらすことができる効果がある。According to the present invention, the LSP frequency domain is converted from the linear frequency domain to the frequency domain corresponding to the auditory characteristics, and the threshold value is calculated in the frequency domain corresponding to the auditory characteristics. There is an effect that an auditory equal effect can be provided over the entire frequency band.

【０１３４】この発明によれば、聴覚的特性に対応する
周波数領域で算出した閾値と、線形周波数領域で定義さ
れた閾値を比較し、大きい方の閾値を補正手段に出力す
るように構成したので、全周波数帯域に渡って聴覚的に
均等な効果をもたらし、かつ、不安定なＬＳＰ(低域で
次元間距離が近づき過ぎてＬＳＰのフィルタゲインが大
きくなり過ぎる状態)を出力することがないＬＳＰ補正
装置が提供できる効果がある。According to the present invention, the threshold calculated in the frequency domain corresponding to the auditory characteristic is compared with the threshold defined in the linear frequency domain, and the larger threshold is output to the correction means. An LSP that produces an auditory equal effect over the entire frequency band and does not output an unstable LSP (a state in which the filter gain of the LSP becomes too large due to the inter-dimensional distance being too close in the low frequency range). There is an effect that the correction device can provide.

【０１３５】この発明によれば、ＬＳＰの周波数領域を
線形周波数領域からバーク周波数領域に変換して閾値を
算出するように構成したので、全周波数帯域に渡って聴
覚的に均等な効果をもたらすことができる効果がある。According to the present invention, since the threshold is calculated by converting the LSP frequency domain from the linear frequency domain to the Bark frequency domain, it is possible to provide an auditory equal effect over the entire frequency band. There is an effect that can be.

【０１３６】この発明によれば、ＬＳＰの周波数領域を
線形周波数領域からメル周波数領域に変換して閾値を算
出するように構成したので、全周波数帯域に渡って聴覚
的に均等な効果をもたらすことができる効果がある。According to the present invention, since the LSP frequency domain is converted from the linear frequency domain to the mel frequency domain to calculate the threshold value, an auditory equal effect can be obtained over the entire frequency band. There is an effect that can be.

【０１３７】この発明によれば、ＬＳＰの周波数領域を
線形周波数領域から対数周波数領域に変換して閾値を算
出するように構成したので、全周波数帯域に渡って聴覚
的に均等な効果をもたらすことができる効果がある。According to the present invention, the threshold is calculated by converting the frequency domain of the LSP from the linear frequency domain to the logarithmic frequency domain, so that an auditory equal effect is provided over the entire frequency band. There is an effect that can be.

【０１３８】この発明によれば、聴覚特性に対応する周
波数領域で補正されたＬＳＰを符号化してＬＳＰ符号と
量子化ＬＳＰを出力するように構成したので、入力音声
の歪や分析誤差に伴う品質劣化を良好に抑制することが
できる効果がある。According to the present invention, since the LSP corrected in the frequency domain corresponding to the auditory characteristics is encoded and the LSP code and the quantized LSP are output, the quality of the input speech due to distortion and analysis error is increased. There is an effect that deterioration can be favorably suppressed.

【０１３９】この発明によれば、ＬＳＰの各次元値を基
準にして算出した各次元毎の閾値を用いて補正されたＬ
ＳＰを符号化してＬＳＰ符号と量子化ＬＳＰを出力する
ように構成したので、入力音声の歪や分析誤差に伴う品
質劣化を良好に抑制することができる効果がある。According to the present invention, the LSP corrected using the threshold value for each dimension calculated based on each dimension value of the LSP.
Since the SP is encoded and the LSP code and the quantized LSP are output, there is an effect that the quality deterioration due to the distortion and the analysis error of the input voice can be suppressed well.

【０１４０】この発明によれば、聴覚特性に対応する周
波数領域で補正された復号ＬＳＰと入力音声から符号化
音源を算出するように構成したので、ＬＳＰの符号化歪
(量子化歪)による復号音の不安定化を良好に抑制するこ
とができる効果がある。According to the present invention, since the coded excitation is calculated from the decoded LSP corrected in the frequency domain corresponding to the auditory characteristics and the input voice, the coding distortion of the LSP is reduced.
There is an effect that the instability of the decoded sound due to (quantization distortion) can be favorably suppressed.

【０１４１】この発明によれば、ＬＳＰの各次元値を基
準にして算出した各次元毎の閾値を用いて補正された復
号ＬＳＰと入力音声から符号化音源を算出するように構
成したので、ＬＳＰの符号化歪(量子化歪)による復号音
の不安定化を良好に抑制することができる効果がある。According to the present invention, the coded excitation is calculated from the decoded LSP and the input speech which are corrected using the threshold value for each dimension calculated based on each dimension value of the LSP. Thus, there is an effect that the instability of the decoded sound due to the encoding distortion (quantization distortion) can be favorably suppressed.

【０１４２】この発明によれば、聴覚特性に対応する周
波数領域で補正された復号ＬＳＰと音源復号化手段によ
り生成された音源信号から合成音を生成するように構成
したので、ＬＳＰ符号の伝送誤りによる復号音の不安定
化を良好に抑制することができる効果がある。According to the present invention, since the synthesized sound is generated from the decoded LSP corrected in the frequency domain corresponding to the auditory characteristics and the sound source signal generated by the sound source decoding means, the transmission error of the LSP code can be improved. Thus, there is an effect that the instability of the decoded sound due to the above can be suppressed well.

【０１４３】この発明によれば、ＬＳＰの各次元値を基
準にして算出した各次元毎の閾値を用いて補正された復
号ＬＳＰと音源復号化手段により生成された音源信号か
ら合成音を生成するように構成したので、ＬＳＰ符号の
伝送誤りによる復号音の不安定化を良好に抑制すること
ができる効果がある。According to the present invention, a synthesized sound is generated from the decoded LSP corrected by using the threshold value for each dimension calculated based on each dimension value of the LSP and the sound source signal generated by the sound source decoding means. With such a configuration, there is an effect that the instability of the decoded sound due to the transmission error of the LSP code can be favorably suppressed.

【０１４４】この発明によれば、聴覚特性に対応する周
波数領域で補正された復号ＬＳＰを用いて合成音に対す
るスペクトル強調処理を実行するように構成したので、
ポストフィルタの極が急峻過ぎて過強調を引き起こすこ
とを抑制しつつ良好な音声強調が得られる効果がある。According to the present invention, the spectrum emphasis processing is performed on the synthesized sound using the decoded LSP corrected in the frequency domain corresponding to the auditory characteristics.
There is an effect that good voice enhancement can be obtained while suppressing that the pole of the post-filter is too steep to cause over-emphasis.

【０１４５】この発明によれば、ＬＳＰの各次元値を基
準にして算出した各次元毎の閾値を用いて補正された復
号ＬＳＰを用いて合成音に対するスペクトル強調処理を
実行するように構成したので、ポストフィルタの極が急
峻過ぎて過強調を引き起こすことを抑制しつつ良好な音
声強調が得られる効果がある。According to the present invention, the spectrum emphasis processing for the synthesized sound is performed using the decoded LSP corrected using the threshold value for each dimension calculated based on each dimension value of the LSP. In addition, there is an effect that good voice enhancement can be obtained while suppressing that the pole of the post filter is too steep to cause over-emphasis.

[Brief description of the drawings]

【図１】この発明の実施の形態１によるＬＳＰ補正装
置を示す構成図である。FIG. 1 is a configuration diagram showing an LSP correction device according to a first embodiment of the present invention.

【図２】この実施の形態１によるＬＳＰ補正装置の補
正結果を説明する説明図である。FIG. 2 is an explanatory diagram illustrating a correction result of the LSP correction device according to the first embodiment.

【図３】この発明の実施の形態２によるＬＳＰ補正装
置を示す構成図である。FIG. 3 is a configuration diagram showing an LSP correction device according to a second embodiment of the present invention.

【図４】この発明の実施の形態４によるＬＳＰ補正装
置を示す構成図である。FIG. 4 is a configuration diagram showing an LSP correction device according to a fourth embodiment of the present invention.

【図５】この実施の形態４によるＬＳＰ補正装置の補
正結果を説明する説明図である。FIG. 5 is an explanatory diagram illustrating a correction result of the LSP correction device according to the fourth embodiment.

【図６】この実施の形態５によるＬＳＰ補正装置の補
正結果を説明する説明図である。FIG. 6 is an explanatory diagram illustrating a correction result of the LSP correction device according to the fifth embodiment.

【図７】この実施の形態７によるＬＳＰ補正装置の補
正結果を説明する説明図である。FIG. 7 is an explanatory diagram illustrating a correction result of the LSP correction device according to the seventh embodiment.

【図８】この実施の形態８によるＬＳＰ補正装置の補
正結果を説明する説明図である。FIG. 8 is an explanatory diagram illustrating a correction result of the LSP correction device according to the eighth embodiment.

【図９】この実施の形態９による音声符号化装置を示
す構成図である。FIG. 9 is a configuration diagram illustrating a speech encoding device according to a ninth embodiment.

【図１０】この実施の形態１０による音声符号化装置
を示す構成図である。FIG. 10 is a configuration diagram illustrating a speech encoding device according to a tenth embodiment.

【図１１】この実施の形態１１による音声復号化装置
を示す構成図である。FIG. 11 is a configuration diagram illustrating a speech decoding apparatus according to an eleventh embodiment.

【図１２】この実施の形態１２による音声復号化装置
を示す構成図である。FIG. 12 is a configuration diagram illustrating a speech decoding apparatus according to a twelfth embodiment.

【図１３】この実施の形態１３による音声復号化装置
におけるポストフィルタを示す構成図である。FIG. 13 is a configuration diagram showing a post filter in a speech decoding apparatus according to Embodiment 13;

[Explanation of symbols]

１ｂａｒｋ変換部（変換手段）、２，７，１６ＬＳ
Ｐ変形部（補正手段）、５ｂａｒｋ逆変換部（逆変換
手段）、６メル変換部（変換手段）、１０メル逆変換
部（逆変換手段）、１１，２１，３２閾値算出部（閾
値算出手段）、４１ＬＳＰ分析部（ＬＳＰ分析手
段）、４３，４５ＬＳＰ符号化部（ＬＳＰ符号化手
段）、４４，４８音源符号化部（音源符号化手段）、
４６，５１ＬＳＰ復号化部（ＬＳＰ復号化手段）、５３
音源復号化部（音源復号化手段）、５４，５５合成
フィルタ（合成手段）、５６ポストフィルタ（ポスト
フィルタ手段）。1 bark conversion unit (conversion means), 2, 7, 16 LS
P transformation unit (correction unit), 5 bark inverse conversion unit (inversion unit), 6 mel conversion unit (conversion unit), 10 mel inverse conversion unit (inverse conversion unit), 11, 21, 32 Threshold calculation unit (threshold calculation) Means), 41 LSP analysis section (LSP analysis means), 43, 45 LSP encoding section (LSP encoding means), 44, 48 excitation encoding section (excitation encoding means),
46, 51 LSP decoding section (LSP decoding means), 53
Sound source decoding unit (sound source decoding means), 54, 55 synthesis filter (synthesis means), 56 post filter (post filter means).

Claims

[Claims]

When an LSP which is a spectrum parameter is input, a conversion means for converting a frequency domain of the LSP from a linear frequency domain to a frequency domain corresponding to an auditory characteristic, and an LSP whose frequency domain is converted by the conversion means
An LSP correction apparatus comprising: a correction unit that corrects the LSP;

2. The LSP correction device according to claim 1, wherein the conversion means converts the LSP frequency domain from a linear frequency domain to a Bark frequency domain.

3. The LSP correction apparatus according to claim 1, wherein the conversion means converts the frequency domain of the LSP from a linear frequency domain to a mel frequency domain.

4. The LSP correction device according to claim 1, wherein the conversion means converts the frequency domain of the LSP from a linear frequency domain to a logarithmic frequency domain.

5. When a LSP which is a spectrum parameter is input, a threshold value calculating means for calculating a threshold value for a distance between adjacent dimensions for each dimension based on each dimension value of the LSP, and a threshold value calculating means for calculating a threshold value. Correction means for correcting the LSP based on a threshold value for each dimension.

6. The threshold value calculating means converts a frequency domain of an LSP from a linear frequency domain to a frequency domain corresponding to an auditory characteristic when calculating a threshold value regarding a distance between adjacent dimensions, and calculates a frequency corresponding to the auditory characteristic. The LSP correction device according to claim 5, wherein a threshold value is calculated in the area.

7. The threshold calculating means compares a threshold calculated in a frequency domain corresponding to an auditory characteristic with a threshold defined in a linear frequency domain, and outputs a larger threshold to the correcting means. The LSP correction device according to claim 6, wherein

8. The LSP correction device according to claim 6, wherein the threshold value calculating means calculates the threshold value by converting the frequency domain of the LSP from the linear frequency domain to the Bark frequency domain.

9. The LSP correction device according to claim 6, wherein the threshold value calculating means calculates the threshold value by converting a frequency domain of the LSP from a linear frequency domain to a mel frequency domain.

10. The LSP correction device according to claim 6, wherein the threshold value calculating means calculates the threshold value by converting a frequency domain of the LSP from a linear frequency domain to a logarithmic frequency domain.

11. An LSP analyzer for analyzing an input voice to calculate an LSP, and a converter for converting a frequency domain of the LSP calculated by the LSP analyzer from a linear frequency domain to a frequency domain corresponding to auditory characteristics. Correction means for correcting the LSP whose frequency domain has been converted by the conversion means, inverse conversion means for returning the frequency domain of the LSP corrected by the correction means to the linear frequency domain, and output from the inverse conversion means. LSP encoding means for encoding the LSP and outputting an LSP code and a quantized LSP;
An audio encoding device comprising: a quantized LSP output from a P encoding unit; and an excitation encoding unit that calculates an encoded excitation from the input audio.

12. An LSP analyzing means for analyzing an input voice to calculate an LSP, and a threshold value for a distance between adjacent dimensions is calculated for each dimension based on each dimension value of the LSP calculated by the LSP analyzing means. Threshold calculation means, correction means for correcting the LSP based on the threshold value for each dimension calculated by the threshold calculation means, LSP code corrected by the correction means, and an LSP code and quantization LS
An audio encoding apparatus comprising: an LSP encoding unit that outputs P; and an excitation encoding unit that calculates an encoded excitation from the quantized LSP output from the LSP encoding unit and the input audio.

13. An LSP analyzing unit for analyzing an input voice to calculate an LSP, an LSP encoding unit for encoding the LSP calculated by the LSP analyzing unit and outputting an LSP code, and an LSP encoding unit. Output LSP
LSP decoding means for decoding a code and outputting a decoded LSP, and a decoded LSP output from the LSP decoding means
Transforming the frequency domain from the linear frequency domain to the frequency domain corresponding to the auditory characteristic, correcting means for correcting the decoded LSP whose frequency domain has been converted by the converting means, and decoding corrected by the correcting means. A speech encoding apparatus comprising: an inverse transform unit for returning a frequency domain of an LSP to a linear frequency domain; and a sound source encoding unit for calculating an encoded excitation from the decoded LSP output from the inverse transform unit and the input speech.

14. An LSP analyzing means for analyzing an input voice to calculate an LSP, an LSP encoding means for encoding the LSP calculated by the LSP analyzing means and outputting an LSP code, and Output LSP
LSP decoding means for decoding a code and outputting a decoded LSP, and a decoded LSP output from the LSP decoding means
A threshold value calculating means for calculating a threshold value for the distance between adjacent dimensions for each dimension with reference to each dimension value of, and an LS value based on the threshold value for each dimension calculated by the threshold value calculating means.
A speech coding apparatus comprising: a correction unit for correcting P; a decoding LSP corrected by the correction unit; and a sound source coding unit for calculating a coding sound source from the input voice.

15. An LSP decoding means for decoding an LSP code and outputting a decoded LSP, and converting a frequency domain of the decoded LSP output from the LSP decoding means from a linear frequency domain to a frequency domain corresponding to auditory characteristics. Conversion means for converting; a correction means for correcting the decoded LSP whose frequency domain has been converted by the conversion means; an inverse conversion means for returning the frequency domain of the decoded LSP corrected by the correction means to the linear frequency domain; Comprising a sound source decoding means for decoding a sound signal to generate a sound source signal, and a synthesis means for generating a synthesized sound from the decoded LSP output from the inverse conversion means and the sound source signal generated by the sound source decoding means. Decryption device.

16. An LSP decoding unit for decoding an LSP code and outputting a decoded LSP, and a threshold value for a distance between adjacent dimensions is set based on each dimension value of the decoded LSP output from the LSP decoding unit. Threshold calculating means for calculating each dimension, correcting means for correcting the LSP based on the threshold for each dimension calculated by the threshold calculating means, and excitation decoding means for decoding an excitation code to generate an excitation signal A speech decoding device comprising: a decoding LSP corrected by the correction means; and a synthesis means for generating a synthesized sound from the sound source signal generated by the sound source decoding means.

17. An LSP decoding means for decoding an LSP code and outputting a decoded LSP, and converting a frequency domain of the decoded LSP output from the LSP decoding means from a linear frequency domain to a frequency domain corresponding to auditory characteristics. Conversion means for converting; a correction means for correcting the decoded LSP whose frequency domain has been converted by the conversion means; an inverse conversion means for returning the frequency domain of the decoded LSP corrected by the correction means to the linear frequency domain; Sound source decoding means for decoding the sound signal to generate a sound source signal; synthesizing means for generating a synthesized sound from the decoded LSP output from the LSP decoding means and the sound source signal generated by the sound source decoding means; Post decoding means for performing spectrum enhancement processing on the synthesized sound using the decoding LSP output from the conversion means. Device.

18. An LSP decoding unit for decoding an LSP code and outputting a decoded LSP, and a threshold value for a distance between adjacent dimensions is set based on each dimension value of the decoded LSP output from the LSP decoding unit. Threshold calculating means for calculating each dimension, correcting means for correcting the LSP based on the threshold for each dimension calculated by the threshold calculating means, and excitation decoding means for decoding an excitation code to generate an excitation signal Synthesizing means for generating a synthesized sound from the decoded LSP output from the LSP decoding means and the sound source signal generated by the sound source decoding means, and the synthesized sound using the decoded LSP corrected by the correcting means. And a post-filter means for performing a spectrum emphasis process on the audio signal.