JP3557413B2

JP3557413B2 - LSP parameter decoding apparatus and decoding method

Info

Publication number: JP3557413B2
Application number: JP2002110037A
Authority: JP
Inventors: 直也田中
Original assignee: Panasonic Corp; Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Corp; Panasonic Holdings Corp
Priority date: 2002-04-12
Filing date: 2002-04-12
Publication date: 2004-08-25
Anticipated expiration: 2019-08-25
Also published as: JP2002372997A

Description

【０００１】
【発明の属する技術分野】
本発明は、音声信号のスペクトル情報の特徴パラメータであるＬＳＰパラメータの符号化復号化装置に関するものである。
【０００２】
【従来の技術】
従来、４〜８ｋｂｑｓ程度のビットレートの音声符号化装置では、音声信号を分析することによってスペクトル情報と音源情報とに分離して符号化する方法が主流である。ＬＳＰパラメータは、スペクトル情報を表す特徴パラメータであり、通常、フレームあたり１０次程度必要である。ＬＳＰパラメータを符号化する最も基本的な方法としては、個々の値をスカラーとして量子化する方法があるが、量子化効果が低いため、複数のＬＳＰパラメータをまとめて量子化するベクトル量子化が良く用いられる。また、ＬＳＰパラメータは、隣接するフレーム間に大きな相関があるため、フレーム間の相関を利用することによって、量子化効率を上げることができる。
【０００３】
図６は従来のフレーム間の相関を利用するＬＳＰパラメータ量子化装置の構成を示すブロック図であり、６００はＬＳＰパラメータ算出手段、６０１は過去の量子化値を蓄えておくバッファ、６０２は過去の量子化値から現フレームの値を線形に予測する予測手段、６０３は予測値と入力値との誤差を最小にする符号を符号帳から選択する誤差最小化手段、６０４は符号帳、６０５は出力符号から量子化値を復号する復号化手段である。また、６０６は入力音声信号、６０７は現フレームのＬＳＰパラメータ、６０８は出力符号、６０９は現フレームの量子化値、６１０は過去の量子化値、６１１は予測された現フレームのＬＳＰパラメータである。
【０００４】
以上のように、構成された従来のＬＳＰパラメータ量子化装置における処理について説明する。ＬＳＰパラメータ算出手段６００は、入力音声信号６０６から現フレームのＬＳＰパラメータ６０７を算出する。予測手段６０２はバッファ６０１に蓄えられた過去の量子化値６１０から現フレームのＬＳＰパラメータを線形に予測する。誤差最小化手段６０３は、入力音声信号から算出されたＬＳＰパラメータ６０７と、過去の量子化値から予測されたＬＳＰパラメータ６１１の誤差を算出し、誤差を最小にする符号を符号帳６０４から選択し、その符号を出力する。復号手段６０５は、出力符号６０８から量子化値を復号し、復号された量子化値６０９は、バッファ６０１に格納される。
【０００５】
【発明が解決しようとする課題】
しかしながら、上記従来の装置では、入力音声信号が定常に近い状態では、高い予測ゲインが得られ、精度の高い量子化が行なえるものの、入力音声信号が過渡的な状態では、予測ゲインが低下し、量子化の精度も低下する。フレーム長が長くなると、隣接フレーム間で過渡的要素が大きくなり、フレーム間相関が小さくなるため、同様に予測ゲインが低下する。したがって、隣接フレーム間相関を利用して予測を行なう量子化方法は、入力音声信号が隣接フレーム間で定常とみなされやすく、フレーム長の短い音声符号化方法には適するが、フレーム長が長い音声符号化方法に適用するのは難しかった。
【０００６】
また、過去の量子化値から現在の値を予測するため、伝送路で生じる符号誤りの影響が、誤りフレームだけではなく以降のフレームに伝搬するため、誤りに弱いという問題があった。
【０００７】
本発明は、上記従来の問題を解決するものであり、入力音声信号が過渡的な状態でも、高い量子化精度を確保するとともに、誤りに対する耐性を高めることのできるＬＳＰパラメータ復号化装置及び復号化方法を提供することを目的とする。
【０００８】
【課題を解決するための手段】
本発明は、上記目的を達成するために、音声信号のスペクトル情報の特徴パラメータであるＬＳＰパラメータを復号化する復号化装置であって、フレーム単位で独立にベクトル量子化されたＬＳＰパラメータを復号化して量子化値を得る第１の復号化手段と、前記第１の復号化手段で得られた量子化値と参照フレームの量子化値から求めた予測値を用いてフレーム間の相関を利用してベクトル量子化されたＬＳＰパラメータを復号化する第２の復号化手段とを備えたものである。
【０００９】
また本発明は、音声信号のスペクトル情報の特徴パラメータであるＬＳＰパラメータを復号化する復号化方法であって、フレーム単位で独立にベクトル量子化されたＬＳＰパラメータを復号化する第１の復号化ステップと、前記第１の復号化ステップで得られた量子化値と参照フレームの量子化値から予測値を求め、この予測値を用いてフレーム間の相関を利用してベクトル量子化されたＬＳＰパラメータを復号化する第２の復号化ステップの各処理動作によりＬＳＰパラメータを復号化するようにしたものである。
【００１０】
【作用】
したがって、本発明によれば、隣接フレーム間の相関が小さい部分では、フレーム単位で独立に復号化する第１の復号化手段を用い、隣接フレーム間の相関が大きい部分では、前記第１の復号化手段で得られた量子化値と参照フレームの量子化値から求めた予測値を用いてフレーム間の相関を利用してベクトル量子化されたＬＳＰパラメータを復号化することにより、入力音声信号の状態に関わらず、安定した高い復号化精度が得ることができる。
【００１１】
また本発明は、隣接フレーム間の相関を利用する復号化動作では、第１の復号化手段で得られた量子化値と参照フレームの量子化値から予測値を求め、この予測値を用いてフレーム間の相関を利用してベクトル量子化されたＬＳＰパラメータを復号化することにより、伝送誤りに対する耐性を高めることができる。
【００１２】
【発明の実施の形態】
（実施の形態１）
以下、本発明の第１の実施の形態を図を用いて説明する。図１は本発明の第１の実施の形態におけるＬＳＰパラメータ符号化装置の構成を示すブロック図であり、１００はＬＳＰパラメータ算出手段、１０１はフレーム単位で独立に量子化を行なう第１の量子化手段、１０２は隣接フレーム間の相関を利用して量子化を行なう第２の量子化手段、１０３、１０４は復号化手段、１０５は誤差比較手段、１０６は量子化手段を切り換えるスイッチである。また、１０７は入力音声信号、１０８は算出したＬＳＰパラメータ、１０９は第１の量子化手段１０１の出力符号、１１０は第２の量子化手段１０２の出力符号、１１１は第１の量子化手段１０１による量子化値、１１２は第２の量子化手段１０２による量子化値、１１３はスイッチ１０６の切り換えを制御する信号、１１４は出力符号である。
【００１３】
次に、上記実施の形態の動作について説明する。ＬＳＰパラメータ算出手段１００によって算出したＬＳＰパラメータ１０８は、それぞれ第１の量子化手段１０１と、第２の量子化手段１０２に入力される。第１の量子化手段１０１は、フレーム単位で独立に量子化を行ない、符号１０９を出力する。同様に、第２の量子化手段１０２は、隣接フレーム間の相関を利用して量子化を行ない、符号１１０を出力する。復号化手段１０３は、符号１０９から第１の量子化手段１０１による量子化値１１１を復号し、復号化手段１０４は、符号１１０から第２の量子化手段１０２による量子化値１１２を復号する。誤差比較手段１０５は、量子化値１１１および１１２とＬＳＰパラメータ１０８との誤差をそれぞれ算出、比較し、スイッチ１０６を切り換えることによって、誤差の小さい方の量子化手段を選択し、選択した量子化手段の出力符号をこの符号化装置の出力符号１１４として出力する。
【００１４】
このように、本実施の形態によれば、入力音声信号の状態に関わらず安定した量子化精度が期待できる第１の量子化手段１０１と、入力音声信号が定常に近い状態で高い量子化精度が期待できる第２の量子化手段１０２の、２つの異なる量子化方法の量子化手段を切り換えて使用することにより、入力音声信号の状態に関わらず、高い安定した量子化精度を得ることができる。
【００１５】
また、第２の量子化手段１０２は、隣接フレーム間の相関を利用して量子化を行なうため、伝送誤りにより影響が次フレーム以降に伝搬するが、第１の量子化手段１０１は、フレーム単位で独立に量子化を行なうため、誤りによる影響は伝搬しない。したがって、誤りによる影響の伝搬は、第２の量子化手段が連続して選択されている区間に限られ、第１の量子化手段が選択されたフレーム以降には伝搬しない。第１の量子化手段と第２の量子化手段とがそれぞれ選択される確率は、入力音声信号の性質によって大きく変化するが、通常の会話では１対１から１対２程度であり、どちらかの量子化手段が長い区間にわたって連続して選択されることは少ない。したがって、誤りによる影響の伝搬は短い区間に限定され、誤りによる影響が伝搬し続ける従来例に対して、誤りに対する耐性が高い。
【００１６】
（実施の形態２）
図２は本発明の第２の実施の形態の構成を示すブロック図であり、図１の第２の量子化手段１０２の詳細を示すものである。２００はＬＳＰパラメータ算出手段であり、図１のＬＳＰパラメータ算出手段１００と同じものである。２０１は第１段目の誤差最小化手段、２０２は第１の符号帳、２０３、２０７は復号化手段、２０４は過去の量子化値から現フレームの値を線形に予測する予測手段、２０５は第２段目の誤差最小化手段、２０６は第２の符号帳、２０８は過去の量子化値を蓄えておくバッファである。また、２１０は入力音声信号、２１１は算出した現フレームのＬＳＰパラメータ、２１２は第１段階の出力符号、２１３は第１段階の量子化値、２１４は第２段階の出力符号、２１５は現フレームの量子化値、２１６は過去の量子化値、２１７は予測された現フレームのＬＳＰパラメータである。
【００１７】
次に上記実施の形態の動作について説明する。ＬＳＰパラメータ算出手段２００は、入力音声信号２１０から現フレームのＬＳＰパラメータ２１１を算出する。第１段階として、第１段目の誤差最小化手段２０１は、第１の符号帳２０２からＬＳＰパラメータ２１１との誤差が最小となる符号を選択し、出力符号２１２として出力する。第２段階として、予測手段２０４は、復号化手段２０３によって復号された第１段階の量子化値２１３と、バッファ２０８に蓄えられた過去の量子化値２１６とから現フレームのＬＳＰパラメータ２１７を線形に予測する。第２段目の誤差最小化手段２０５は、予測されたＬＳＰパラメータ２１７と入力音声信号２１０とから算出された現フレームのＬＳＰパラメータ２１１との誤差が最小となる符号を、第２の符号帳２０６から選択し、出力符号２１４として出力する。復号化手段２０７は、出力符号２１４とから、現フレームの量子化値２１５を復号し、バッファ２０８に格納する。
【００１８】
ここで、第２段階の処理を図３を用いて説明する。図３において、３００は前フレームのＬＳＰパラメータの量子化前の値、３０１は現フレームのＬＳＰパラメータの量子化前の値、３０２は前フレームの量子化値、３０３は現フレームの第１段階の量子化値、３０４は現フレームの予測値、３０５は予測値と量子化前の値との誤差、３０６は現フレームの量子化値である。
【００１９】
現フレームの予測値３０４は、前フレームの量子化値３０２と現フレームの第１段階の量子化値３０３を用いて、
ｐ_ｎ＝αｑ_ｎ−１＋（１−α）υ_ｎ
よって、誤差３０５は、

また、現フレームの量子化値３０６は、

と表される。ここで、αは予測係数、ｄ＾_ｎは誤差３０５を近似する符号ベクトルである。第２段目の誤差最小化手段２０５は、現フレームのＬＳＰパラメータ３０１と現フレームの量子化値３０６の誤差を最小にする予測係数αと符号ベクトルｄ＾_ｎの組を第２の符号帳２０６から選択し、符号を出力する。
【００２０】
なお、予測係数αを固定とすることにより、第２段目の誤差最小化の処理は、誤差３０５に対して誤差が最小となる符号ベクトルを選択するのみとなり、演算量が削減される。
【００２１】
このように、本実施の形態によれば、現フレームの予測値を、過去のフレームの情報と現フレームの情報とから予測するため、復号化する際に、過去のフレームの情報に伝送誤りによる影響があっても、現フレームの予測値に値する影響を低減することができ、伝送誤りに値する耐性を高めることができる。
【００２２】
（実施の形態３）
図４は本発明の第３の実施の形態の構成を示すブロック図であり、上記第１および第２の実施の形態の符号化装置に対応する復号化装置の構成を示すものである。図４において、４００は伝送誤り検出手段、４０１はスイッチ制御手段、４０２は第１の量子化手段による符号ベクトルを格納する符号帳、４０３は第２の量子化手段の第１段階による符号ベクトルを格納する符号帳、４０４は第２の量子化手段の第２段階による符号ベクトルを格納する符号帳、４０５は予測手段、４０６は復号化手段、４０７、４０８は復号化手段を切り換えるスイッチ、４０９は出力する復号値を切り換えるスイッチ、４１０は前フレームの量子化値を蓄えるバッファである。また、４１１は伝送符号、４１２は第１の量子化手段による量子化値、４１３は第２の量子化手段の第１段階での量子化値、４１４は現フレームの予測値、４１５は第２の量子化手段の第２段階での量子化値、４１６は復号化装置の出力量子化値である。
【００２３】
次に上記実施例の動作について説明する。伝送符号が前記符号化装置における第１の量子化手段による符号であれば、スイッチ４０７、４０８を連動してａ側に、第２の量子化手段による符号であれば、スイッチ４０７、４０８をｂ側に切り換えることによって、第１、第２のそれぞれの量子化手段に対応する復号手段で量子化値を復号することができる。第２の量子化手段による伝送を復号化する場合において、伝送符号に誤りがないフレームでは、スイッチ制御手段４０１は、スイッチ４０９のＡ、Ｂ、Ｃ、Ｄ、Ｅ、Ｆの６つの端子のうちＡ−Ｂ間と、Ｃ−Ｄ間を接続する。この状態では、各復号手段からの復号値は正しく復号されて出力される。伝送誤り検出手段４００が伝送誤りを検出したフレームでは、スイッチ制御手段４０１は、スイッチ４０９の端子のうち、Ｄ−Ｅ間を接続する。この状態では、伝送符号４１１は無視され、バッファ４１０に蓄えられた前フレームの量子化値が出力される。伝送誤り検出手段４００が誤りを検出したフレームの次フレーム以降、第２の量子化手段による符号が連続する限り、スイッチ制御手段４０１は、スイッチ４０９の端子のうちＡＦ間を接続する。この状態では、第２の量子化手段による符号のうち、第１段階の符号のみによって復号された量子化値４１３が出力され、第２段階は無視される。伝送誤り検出手段４００が誤りを検出したフレームの次フレーム以降、最初に第１の量子化手段による符号が伝送されたフレームで、スイッチ制御手段４０１は、スイッチ４０９の端子のうちＡ−Ｂ、Ｃ−Ｄ間を接続し、誤りを検出する前の状態に戻る。
【００２４】
このように、本実施の形態によれば、誤りが生じたフレームの次フレーム以降で、過去の誤りの影響を伝搬する第２の量子化手段の第２段階をパスすることにより、誤りによる影響が次フレーム以降に伝搬することを防ぎ、誤りによる影響を最小限に抑えることができる。
【００２５】
（参考例）
次に、上記各実施例１乃至３を適用した符号化復号化装置を参考例として示す。図５は本発明の参考例の構成を示すブロック図であり、上記第１および第２の実施例の符号化装置と第３の実施例の復号化装置とを組み合わせたものである。図５の符号化側において、５００は第１の量子化手段、５０１は第２の量子化手段、５０２は量子化手段５００、５０１を切り換えるスイッチ、５０８は出力符号であり、これら以外の詳細な構成は上記第１および第２の実施例と同じである。復号化側において、５０３は伝送誤り検出手段、５０４は誤り頻度判定手段、５０５は第１の復号化手段、５０６は第２の復号化手段、５０７は復号化手段５０５、５０６を切り換えるスイッチ、５０９は復号化側の入力符号であり、これら以外の詳細な構成は上記第３の実施例と同じである。
【００２６】
次に、上記参考例の動作について説明する。復号化側の誤り検出手段５０３は、伝送されてきた入力符号５０９の伝送誤りを検出する。誤り頻度判定手段５０４は、検出された伝送誤りの頻度を定められたしきい値と比較し、誤り頻度がしきい値未満であれば、第１の量子化手段５００と第２の量子化手段５０１のうち、量子化誤差が小さい方の量子化手段をスイッチ５０２により選択し、誤り頻度がしきい値以上であれば、スイッチ５０２を第１の量子化手段５００側に固定する。復号化側の動作は、上記第３の実施例と同じである。
【００２７】
伝送誤りの頻度が高くなると、復号化において第２の量子化手段５０１の第２段階がパスされる割合が増加し、復号した量子化値の精度が低下する。したがって、本参考例のように、誤りの頻度を監視し、頻度が高い場合には、相手の符号化側のスイッチを第１の量子化手段５００に固定することにより、復号化側で復号した量子化値の精度の低下を少なくすることができる。また、双方向の伝送路では、復号化側が受信した入力符号５０９の誤り頻度から、符号化側が送信する出力符号５０８の相手側受信時の誤り頻度が推定できるので、本参考例のように、復号化側での誤り頻度による自分の符号化側の量子化手段を切り換えスイッチ５０２の制御を双方で行なえば、付加情報を付け加えることなく、伝送誤りに対する耐性を高めることができる。
【００２８】
【発明の効果】
以上のように、本発明は、ＬＳＰパラメータを復号化する復号化装置に、フレーム単位で独立にベクトル量子化されたＬＳＰパラメータを復号化して量子化値を得る第１の復号化手段と、前記第１の復号化手段で得られた量子化値と参照フレームの量子化値から求めた予測値を用いてフレーム間の相関を利用してベクトル量子化されたＬＳＰパラメータを復号化する第２の復号化手段とを備えたことにより、入力音声信号の状態に関わらず、安定した高い復号化精度が得られるという効果がある。
【００２９】
また、本発明は、隣接フレーム間の相関を利用する復号化動作では、第１の復号化手段で得られた量子化値と参照フレームの量子化値から予測値を求め、この予測値を用いてフレーム間の相関を利用してベクトル量子化されたＬＳＰパラメータを復号化することにより、伝送誤りに対する耐性を高めるという効果がある。
【図面の簡単な説明】
【図１】本発明の第１の実施の形態におけるＬＳＰパラメータ符号化装置の構成を示すブロック図
【図２】本発明の第２の実施の形態の構成として、図１中の第２の量子化手段の詳細を示すブロック図
【図３】本発明の第２の実施の形態の第２の量子化手段における第２段階の処理を示す模式図
【図４】本発明の第３の実施の形態の構成として、第１および第２の実施の形態の符号化装置に対応する復号化装置の構成を示すブロック図
【図５】本発明の参考例として、第１および第２の実施の形態の符号化装置と第３の実施の形態の復号化装置とを組み合わせた符号復号化装置の構成を示すブロック図
【図６】従来例のフレーム間の相関を利用するＬＳＰパラメータ量子化装置の構成を示すブロック図
【符号の説明】
１００ＬＳＰパラメータ
１０１フレーム単位で独立に量子化を行なう第１の量子化手段
１０２隣接フレーム間の相関を利用して量子化を行なう第２の量子化手段
１０３、１０４復号化手段
１０５誤差比較手段
１０６量子化手段を切り換えるスイッチ
１０７入力音声信号
１０８算出したＬＳＰパラメータ
１０９第１の量子化手段１０１の出力符号
１１０第２の量子化手段１０２の出力符号
１１１第１の量子化手段１０１による量子化値
１１２第２の量子化手段１０２による量子化値
１１３スイッチ１０６の切り換えを制御する信号
１１４出力符号
２００ＬＳＰパラメータ算出手段
２０１第１段目の誤差最小化手段
２０２第１の符号帳
２０３、２０７復号化手段
２０４過去の量子化値から現フレームの値を線形に予測する予測手段
２０５第２段目の誤差最小化手段
２０６第２の符号帳
２０８過去の量子化値を蓄えておくバッファ
２１０入力音声信号
２１１現フレームのＬＳＰパラメータ
２１２第１段階の出力符号
２１３第１段階の量子化値
２１４第２段階の出力符号
２１５現フレームの量子化値
２１６過去の量子化値
２１７予測された現フレームのＬＳＰパラメータ
３００前フレームのＬＳＰパラメータの量子化前の値
３０１現フレームのＬＳＰパラメータの量子化前の値
３０２前フレームの量子化値
３０３現フレームの第１段階の量子化値
３０４現フレームの予測値
３０５予測値と量子化前の値との誤差
３０６現フレームの量子化値
４００伝送誤り検出手段
４０１スイッチ制御手段
４０２第１の量子化手段による符号ベクトルを格納する符号帳
４０３第２の量子化手段の第１段階による符号ベクトルを格納する符号帳
４０４第２の量子化手段の第２段階による符号ベクトルを格納する符号帳
４０５予測手段
４０６復号化手段
４０７、４０８復号化手段を切り換えるスイッチ
４０９出力する復号値を切り換えるスイッチ
４１０前フレームの量子化値を蓄えるバッファ
４１２第１の量子化手段による量子化値
４１３第２の量子化手段の第１段階による量子化値
４１４現フレームの予測値
４１５第２の量子化手段の第２段階による量子化値
４１６復号化手段の出力量子化値
５００第１の量子化手段
５０１第２の量子化手段
５０２量子化手段を切り換えるスイッチ
５０３誤り検出手段
５０４誤り頻度判定手段
５０５第１の復号化手段
５０６第２の復号化手段
５０７スイッチ
５０８符号化側の出力符号
５０９復号化側の入力符号
６００ＬＳＰパラメータ
６０１過去の量子化値を蓄えておくバッファ
６０２過去の量子化値から現フレームの値を線形に予測する予測手段
６０３予測値と入力値との誤差を最小にする符号を符号帳から選択する誤差最小化手段
６０４符号帳
６０５出力符号から量子化値を復号する復号化手段
６０６入力音声信号
６０７現フレームのＬＳＰパラメータ
６０８出力符号
６０９現フレームの量子化値
６１０過去の量子化値
６１１予測された現フレームのＬＳＰパラメータ[0001]
TECHNICAL FIELD OF THE INVENTION
The present invention relates to an encoding / decoding device for an LSP parameter, which is a characteristic parameter of spectrum information of an audio signal.
[0002]
[Prior art]
2. Description of the Related Art Conventionally, in an audio encoding device having a bit rate of about 4 to 8 kbqs, a method of analyzing an audio signal to separate and encode spectral information and sound source information is mainly used. The LSP parameter is a characteristic parameter representing spectrum information, and usually requires about ten orders per frame. The most basic method of encoding LSP parameters is a method of quantizing individual values as a scalar. However, since the quantization effect is low, vector quantization for collectively quantizing a plurality of LSP parameters is often used. Used. In addition, since the LSP parameter has a large correlation between adjacent frames, quantization efficiency can be improved by using the correlation between frames.
[0003]
FIG. 6 is a block diagram showing a configuration of a conventional LSP parameter quantization device using correlation between frames, 600 is an LSP parameter calculation means, 601 is a buffer for storing past quantization values, and 602 is a past buffer. Prediction means for linearly predicting the value of the current frame from the quantized value; 603, an error minimizing means for selecting a code for minimizing an error between the predicted value and the input value from a codebook; 604, a codebook; It is a decoding means for decoding a quantized value from a code. Reference numeral 606 denotes an input audio signal, 607 denotes an LSP parameter of the current frame, 608 denotes an output code, 609 denotes a quantization value of the current frame, 610 denotes a past quantization value, and 611 denotes a predicted LSP parameter of the current frame. .
[0004]
The processing in the conventional LSP parameter quantization device configured as described above will be described. The LSP parameter calculation means 600 calculates an LSP parameter 607 of the current frame from the input audio signal 606. The prediction unit 602 linearly predicts the LSP parameters of the current frame from the past quantization values 610 stored in the buffer 601. The error minimizing means 603 calculates an error between the LSP parameter 607 calculated from the input audio signal and the LSP parameter 611 predicted from the past quantization value, and selects a code that minimizes the error from the codebook 604. , Its sign is output. The decoding unit 605 decodes the quantized value from the output code 608, and the decoded quantized value 609 is stored in the buffer 601.
[0005]
[Problems to be solved by the invention]
However, in the above-described conventional apparatus, a high prediction gain is obtained in a state where the input audio signal is almost steady, and high-precision quantization can be performed. Also, the accuracy of quantization is reduced. When the frame length increases, the transient factor between adjacent frames increases, and the inter-frame correlation decreases, so that the prediction gain similarly decreases. Therefore, the quantization method of performing prediction using the correlation between adjacent frames is suitable for a speech coding method with a short frame length while the input speech signal is easily regarded as stationary between adjacent frames, but is suitable for a speech coding method with a short frame length. It was difficult to apply to the encoding method.
[0006]
In addition, since the present value is predicted from the past quantization value, the effect of a code error occurring on the transmission path propagates not only to the error frame but also to the subsequent frames, and thus is susceptible to errors.
[0007]
An object of the present invention is to solve the above-mentioned conventional problem, and to provide an LSP parameter decoding apparatus and a decoding method capable of securing high quantization accuracy and improving error resistance even when an input audio signal is in a transient state. The aim is to provide a method.
[0008]
[Means for Solving the Problems]
In order to achieve the above object, the present invention provides a decoding device for decoding an LSP parameter which is a characteristic parameter of spectrum information of an audio signal, and decodes an LSP parameter which is independently vector-quantized for each frame. A first decoding means for obtaining a quantization value by using a correlation between frames using a quantization value obtained by the first decoding means and a prediction value obtained from a quantization value of a reference frame. And a second decoding means for decoding the LSP parameter which has been vector-quantized.
[0009]
The present invention also relates to a decoding method for decoding an LSP parameter which is a characteristic parameter of spectrum information of an audio signal, wherein a first decoding step for decoding an LSP parameter which is vector-quantized independently in a frame unit. An LSP parameter that is vector-quantized using a correlation between frames using the prediction value and a prediction value from the quantization value obtained in the first decoding step and the quantization value of the reference frame. The LSP parameter is decoded by each processing operation of the second decoding step of decoding
[0010]
[Action]
Therefore, according to the present invention, the first decoding unit that performs decoding independently on a frame basis is used in a portion where the correlation between adjacent frames is small, and the first decoding unit is used in a portion where the correlation between adjacent frames is large. By decoding the vector-quantized LSP parameter using the correlation between frames using the quantization value obtained by the quantization means and the prediction value obtained from the quantization value of the reference frame, the input speech signal is decoded. Regardless of the state, stable high decoding accuracy can be obtained.
[0011]
Further, in the present invention, in a decoding operation using correlation between adjacent frames, a prediction value is obtained from a quantization value obtained by the first decoding unit and a quantization value of a reference frame, and the prediction value is calculated using the prediction value. By decoding the vector-quantized LSP parameters using the correlation between frames, it is possible to increase the resistance to transmission errors.
[0012]
BEST MODE FOR CARRYING OUT THE INVENTION
(Embodiment 1)
Hereinafter, a first embodiment of the present invention will be described with reference to the drawings. FIG. 1 is a block diagram showing a configuration of an LSP parameter encoding apparatus according to a first embodiment of the present invention, wherein 100 is an LSP parameter calculating means, and 101 is a first quantization which performs quantization independently on a frame basis. Means 102, second quantizing means for performing quantization using the correlation between adjacent frames, 103 and 104 decoding means, 105 error comparing means, and 106 a switch for switching the quantizing means. Further, 107 is an input audio signal, 108 is a calculated LSP parameter, 109 is an output code of the first quantization means 101, 110 is an output code of the second quantization means 102, and 111 is a first quantization means 101 , 112 is a quantization value by the second quantization means 102, 113 is a signal for controlling switching of the

switch

106, and 114 is an output code.
[0013]
Next, the operation of the above embodiment will be described. The LSP parameters 108 calculated by the LSP parameter calculation means 100 are input to the first quantization means 101 and the second quantization means 102, respectively. The first quantization means 101 performs quantization independently for each frame, and outputs a code 109. Similarly, the second quantization means 102 performs quantization using the correlation between adjacent frames, and outputs a code 110. The decoding unit 103 decodes the quantization value 111 by the first quantization unit 101 from the code 109, and the decoding unit 104 decodes the quantization value 112 by the second quantization unit 102 from the code 110. The error comparing means 105 calculates and compares the errors between the

quantized values

111 and 112 and the LSP parameter 108, and switches the switch 106 to select the quantizing means with the smaller error, and selects the selected quantizing means. Is output as an output code 114 of the encoding apparatus.
[0014]
As described above, according to the present embodiment, the first quantization means 101 which can expect stable quantization accuracy irrespective of the state of the input audio signal, and the high quantization accuracy when the input audio signal is almost stationary. By switching and using the quantization means of the two different quantization methods of the second quantization means 102 which can be expected to obtain high stable quantization accuracy regardless of the state of the input audio signal. .
[0015]
In addition, since the second quantization means 102 performs quantization using the correlation between adjacent frames, the effect of the transmission error propagates to the next and subsequent frames, but the first quantization means 101 , Quantization is performed independently, so that the effects of errors do not propagate. Therefore, the propagation of the influence of the error is limited to the section in which the second quantization means is continuously selected, and does not propagate beyond the frame in which the first quantization means is selected. The probability that the first quantization means and the second quantization means are selected greatly varies depending on the properties of the input speech signal, but is about one-to-one to one-to-two in normal conversation. Is rarely selected continuously over a long interval. Therefore, the propagation of the influence of the error is limited to a short section, and the resistance to the error is higher than that of the conventional example in which the influence of the error continues to propagate.
[0016]
(Embodiment 2)
FIG. 2 is a block diagram showing the configuration of the second embodiment of the present invention, and shows the details of the second quantization means 102 in FIG. Reference numeral 200 denotes an LSP parameter calculation unit, which is the same as the LSP parameter calculation unit 100 in FIG. 201 is a first stage error minimizing means, 202 is a first codebook, 203 and 207 are decoding means, 204 is a predicting means for linearly predicting the value of the current frame from past quantized values, and 205 is a predicting means. The error minimizing means at the second stage, 206 is a second codebook, and 208 is a buffer for storing past quantized values. 210 is an input audio signal, 211 is a calculated LSP parameter of the current frame, 212 is a first-stage output code, 213 is a first-stage quantization value, 214 is a second-stage output code, and 215 is a current frame. , 216 are past quantization values, and 217 is a predicted LSP parameter of the current frame.
[0017]
Next, the operation of the above embodiment will be described. The LSP parameter calculation means 200 calculates the LSP parameter 211 of the current frame from the input audio signal 210. In the first stage, the first-stage error minimizing means 201 selects a code that minimizes an error from the LSP parameter 211 from the first codebook 202 and outputs it as an output code 212. As a second stage, the prediction unit 204 linearly converts the LSP parameter 217 of the current frame from the first stage quantization value 213 decoded by the decoding unit 203 and the past quantization value 216 stored in the buffer 208. To predict. The second-stage error minimizing means 205 assigns a code that minimizes an error between the predicted LSP parameter 217 and the LSP parameter 211 of the current frame calculated from the input audio signal 210 to the second codebook 206. And outputs it as an output code 214. The decoding means 207 decodes the quantized value 215 of the current frame from the output code 214 and stores it in the buffer 208.
[0018]
Here, the second stage processing will be described with reference to FIG. In FIG. 3, reference numeral 300 denotes a pre-quantization value of the LSP parameter of the previous frame, 301 denotes a pre-quantization value of the LSP parameter of the current frame, 302 denotes a quantization value of the previous frame, and 303 denotes a first stage of the current frame. The quantized value, 304 is the predicted value of the current frame, 305 is the error between the predicted value and the value before quantization, and 306 is the quantized value of the current frame.
[0019]
The prediction value 304 of the current frame is obtained by using the quantization value 302 of the previous frame and the quantization value 303 of the first stage of the current frame.
_{_{p n = αq n-1 +}} (1-α) υ n
Therefore, the error 305 is

Also, the quantization value 306 of the current frame is

It is expressed as Here, α is a prediction coefficient, and d ＾ _n is a code vector approximating the error 305. The second-stage error minimizing means 205 converts a set of a prediction coefficient α and a code vector d ＾ _n that minimizes an error between the LSP parameter 301 of the current frame and the quantized value 306 of the current frame into a second codebook 206. And outputs the sign.
[0020]
By fixing the prediction coefficient α, the error minimization process in the second stage only selects a code vector that minimizes the error with respect to the error 305, thereby reducing the amount of calculation.
[0021]
As described above, according to the present embodiment, the prediction value of the current frame is predicted from the information of the past frame and the information of the current frame. Even if there is an effect, it is possible to reduce the effect worth the predicted value of the current frame, and it is possible to increase the robustness worth the transmission error.
[0022]
(Embodiment 3)
FIG. 4 is a block diagram showing the configuration of the third embodiment of the present invention, and shows the configuration of a decoding device corresponding to the encoding devices of the first and second embodiments. In FIG. 4, 400 is a transmission error detecting means, 401 is a switch control means, 402 is a codebook storing a code vector by the first quantizing means, and 403 is a code vector in the first stage of the second quantizing means. A codebook to be stored, 404 is a codebook to store a code vector in the second stage of the second quantization means, 405 is prediction means, 406 is decoding means, 407 and 408 are switches for switching decoding means, and 409 is A switch 410 for switching the output decoded value is a buffer for storing the quantized value of the previous frame. 411 is a transmission code, 412 is a quantization value by the first quantization means, 413 is a quantization value in the first stage of the second quantization means, 414 is a predicted value of the current frame, and 415 is a second frame prediction value. , 416 are output quantization values of the decoding device.
[0023]
Next, the operation of the above embodiment will be described. If the transmission code is a code by the first quantization means in the encoding apparatus, the

switches

407 and 408 are linked to a side, and if the transmission code is a code by the second quantization means, the

switches

407 and 408 are set to b By switching to the side, the quantization value can be decoded by the decoding means corresponding to the first and second quantization means. In the case of decoding the transmission by the second quantizing means, in a frame having no error in the transmission code, the switch control means 401 sets the switch 409 out of the six terminals A, B, C, D, E, and F A connection is made between AB and CD. In this state, the decoded value from each decoding means is correctly decoded and output. In the frame in which the transmission error detection unit 400 has detected the transmission error, the switch control unit 401 connects the terminals D and E of the switch 409. In this state, the transmission code 411 is ignored, and the quantized value of the previous frame stored in the buffer 410 is output. The switch control unit 401 connects the AF among the terminals of the switch 409 as long as the code by the second quantization unit continues from the frame following the frame in which the transmission error detection unit 400 has detected the error. In this state, the quantized value 413 decoded by only the code of the first stage among the codes by the second quantization means is output, and the second stage is ignored. After the frame following the frame in which the transmission error detecting means 400 has detected an error, in the frame in which the code by the first quantization means is transmitted first, the switch control means 401 -Connect between D and return to the state before error detection.
[0024]
As described above, according to the present embodiment, the second stage of the second quantization unit that propagates the influence of the past error is passed after the next frame of the frame in which the error has occurred, so that the influence of the error can be reduced. Can be prevented from propagating after the next frame, and the effect of errors can be minimized.
[0025]
(Reference example)
Next, an encoding / decoding apparatus to which each of the first to third embodiments is applied will be described as a reference example. FIG. 5 is a block diagram showing the configuration of the reference example of the present invention, which is a combination of the encoding devices of the first and second embodiments and the decoding device of the third embodiment. On the encoding side in FIG. 5, reference numeral 500 denotes a first quantization unit, 501 denotes a second quantization unit, 502 denotes a switch for switching between the

quantization units

500 and 501, and 508 denotes an output code. The configuration is the same as in the first and second embodiments. On the decoding side, 503 is a transmission error detecting means, 504 is an error frequency judging means, 505 is a first decoding means, 506 is a second decoding means, 507 is a switch for switching between the decoding means 505 and 506, 509 Is an input code on the decoding side, and the other detailed configuration is the same as that of the third embodiment.
[0026]
Next, the operation of the above reference example will be described. The error detection means 503 on the decoding side detects a transmission error of the transmitted input code 509. The error frequency determination means 504 compares the frequency of the detected transmission error with a predetermined threshold, and if the error frequency is less than the threshold, the first quantization means 500 and the second quantization means The switch 502 selects the quantization unit having the smaller quantization error from among the 501, and fixes the switch 502 to the first quantization unit 500 if the error frequency is equal to or greater than the threshold value. The operation on the decoding side is the same as in the third embodiment.
[0027]
When the frequency of transmission errors increases, the rate at which the second stage of the second quantization means 501 is passed in decoding increases, and the accuracy of the decoded quantization value decreases. Therefore, as in the present reference example, the frequency of errors is monitored, and when the frequency is high, decoding is performed on the decoding side by fixing the switch on the encoding side of the other party to the first quantization means 500. A decrease in precision of the quantization value can be reduced. On the other hand, in the bidirectional transmission path, the error frequency of the output code 508 transmitted by the encoding side at the receiving side can be estimated from the error frequency of the input code 509 received by the decoding side. By controlling the switch 502 to switch the quantization means on the encoding side based on the error frequency on the decoding side, the resistance to transmission errors can be increased without adding additional information.
[0028]
【The invention's effect】
As described above, the present invention provides a decoding device that decodes LSP parameters, a first decoding unit that obtains a quantized value by decoding LSP parameters that are vector-quantized independently on a frame-by-frame basis, A second decoding unit that decodes the vector-quantized LSP parameter using the correlation between frames using the quantization value obtained by the first decoding unit and the prediction value obtained from the quantization value of the reference frame; By providing the decoding means, there is an effect that stable and high decoding accuracy can be obtained regardless of the state of the input audio signal.
[0029]
Further, in the present invention, in a decoding operation utilizing correlation between adjacent frames, a prediction value is obtained from a quantization value obtained by the first decoding means and a quantization value of a reference frame, and the prediction value is used. By decoding the vector-quantized LSP parameter using the correlation between the frames, there is an effect of increasing the resistance to transmission errors.
[Brief description of the drawings]
FIG. 1 is a block diagram showing a configuration of an LSP parameter encoding device according to a first embodiment of the present invention; FIG. 2 is a block diagram showing a configuration of an LSP parameter encoding device according to a second embodiment of the present invention; FIG. 3 is a block diagram showing details of the quantization means; FIG. 3 is a schematic diagram showing a second-stage process in the second quantization means according to the second embodiment of the present invention; FIG. 5 is a block diagram showing a configuration of a decoding device corresponding to the encoding devices of the first and second embodiments as a configuration of the embodiment. FIG. 5 is a block diagram showing the first and second embodiments as a reference example of the present invention. FIG. 6 is a block diagram showing a configuration of a code decoding apparatus in which the encoding apparatus of FIG. 1 and the decoding apparatus of the third embodiment are combined. FIG. 6 is a configuration of a conventional LSP parameter quantization apparatus using correlation between frames. [Description of reference numerals]
100 LSP parameter 101 First quantization means 102 for performing quantization independently in frame units Second quantization means 103 and 104 for performing quantization using the correlation between adjacent frames Decoding means 105 Error comparison means 106 Switch 107 for switching quantization means Input audio signal 108 Calculated LSP parameter 109 Output code 110 of first quantization means 101 Output code 111 of second quantization means 102 Quantized value 112 of first quantization means 101 Quantized value 113 by second quantizing means 102 Signal 114 for controlling switching of switch 106 Output code 200 LSP parameter calculating means 201 First-stage error minimizing means 202 First codebook 203, 207 Decoding means 204 Prediction means 2 for linearly predicting the value of the current frame from the past quantized value 05 Second-stage error minimizing means 206 Second codebook 208 Buffer 210 for storing past quantization values Input audio signal 211 LSP parameter 212 of current frame First-stage output code 213 First-stage quantum Quantized value 214 Output code of second stage 215 Quantized value of current frame 216 Past quantized value 217 LSP parameter of predicted current frame 300 Value of LSP parameter of previous frame before quantization 301 LSP parameter of current frame Value before quantization 302 Quantized value of previous frame 303 Quantized value of first stage of current frame 304 Predicted value of current frame 305 Error between predicted value and value before quantization 306 Quantized value of current frame 400 Transmission Error detection means 401 Switch control means 402 Codebook 403 storing code vectors by first quantization means Second Codebook 404 storing the code vector according to the first stage of the quantization means of the above (2) Codebook 405 storing the code vector according to the second step of the second quantization means Prediction means 406 Decoding means 407, 408 Switching between the decoding means Switch 409 Switch for switching the decoded value to be output 410 Buffer 412 for storing the quantization value of the previous frame Quantization value 413 by the first quantization means Quantization value 414 by the first stage of the second quantization means 414 Prediction of the current frame Value 415 Quantized value in second stage of second quantizing means 416 Output quantized value of decoding means 500 First quantizing means 501 Second quantizing means 502 Switch for switching quantization means 503 Error detecting means 504 Error frequency determination means 505 First decoding means 506 Second decoding means 507 Switch 508 Encoding side Output code 509 Decoding-side input code 600 LSP parameter 601 Buffer 602 for storing past quantized values Predicting means 603 for linearly predicting the value of the current frame from past quantized values Error between predicted value and input value Error minimizing means 604 for selecting a code that minimizes from the codebook 604 codebook 605 decoding means 606 for decoding the quantized value from the output code input audio signal 607 LSP parameter 608 of current frame output code 609 quantization of current frame Value 610 Past quantization value 611 Predicted LSP parameter of current frame

Claims

What is claimed is: 1. A decoding device for decoding an LSP parameter, which is a characteristic parameter of spectrum information of an audio signal, comprising: And decoding a vector-quantized LSP parameter using correlation between frames using a quantization value obtained by the first decoding unit and a prediction value obtained from a quantization value of a reference frame. An LSP parameter decoding device comprising: a second decoding unit.

A decoding method for decoding an LSP parameter, which is a feature parameter of spectrum information of an audio signal, comprising: a first decoding step of decoding a vector-quantized LSP parameter independently for each frame; A predicted value is obtained from the quantized value obtained in the decoding step and the quantized value of the reference frame, and the vector-quantized LSP parameter is decoded using the correlation between frames using the predicted value. 2. An LSP parameter decoding method, comprising: