JP4583093B2

JP4583093B2 - Bit rate extended speech encoding and decoding apparatus and method

Info

Publication number: JP4583093B2
Application number: JP2004203105A
Authority: JP
Inventors: 昌用孫; 康殷李; 尚遠姜; 相鉉池
Original assignee: Samsung Electronics Co Ltd
Current assignee: Samsung Electronics Co Ltd
Priority date: 2003-07-09
Filing date: 2004-07-09
Publication date: 2010-11-17
Anticipated expiration: 2024-07-09
Also published as: DE602004004950T2; EP1496500A1; DE602004004950D1; US7702504B2; US20050010404A1; EP1496500B1; JP2005031683A

Abstract

Provided are a signal-to-noise ratio (SNR) bitrate scalable speech coding and decoding apparatus and a method of using the same. The coding apparatus includes a base layer (100), a speech quality enhancement layer (130), and a multiplexer (140). The base layer filters an input speech signal using linear prediction coding (104) and generates an excitation signal corresponding to the filtered speech signal through fixed codebook search (117) and adaptive codebook search (115). The speech quality enhancement layer searches a fixed codebook (131) using parameters obtained through the fixed codebook search in the base layer, or searches the fixed codebook using a target signal, which is obtained by removing a contribution of a fixed codebook of the base layer and a signal which is obtained by synthesizing and filtering a previous fixed codebook of the speech quality enhancement layer from a target signal for the fixed codebook search of the base layer. The multiplexer (140) multiplexes signals generated by the base layer and the at least one speech quality enhancement layer. <IMAGE>

Description

本発明はＣＥＬＰ（ＣｏｄｅＥｘｃｉｔｅｄＬｉｎｅａｒＰｒｅｄｉｃｔｉｏｎ）アルゴリズムを使用する音声コーデックに係り、特に、音質を向上させるために信号対雑音比（ＳＮＲ：ＳｉｇｎａｌｔｏＮｏｉｓｅＲａｔｉｏ）ビット率を拡張する音声符号化及び復号化装置とその方法に関する。 The present invention relates to a speech codec that uses a Code Excited Linear Prediction (CELP) algorithm, and more particularly, speech coding and decoding that extends a signal-to-noise ratio (SNR) bit rate to improve sound quality. The present invention relates to an apparatus and a method thereof.

ＣＥＬＰ構造を有する音声コーデックは現在、移動通信システムで最も広く使われるものであって、線形予測符号化（ＬｉｎｅａｒＰｒｅｄｉｃｔｉｏｎＣｏｄｉｎｇ：ＬＰＣ）を基本とする。このようなＣＥＬＰ構造を有する音声コーデックはサービスの種類によって要求される伝送率及び帯域幅が異なる。 A speech codec having a CELP structure is currently most widely used in mobile communication systems, and is based on linear prediction coding (LPC). A voice codec having such a CELP structure differs in required transmission rate and bandwidth depending on the type of service.

しかし、一般的な音声コーデックは伝送率及び帯域幅が符号化装置で設定されるので、復号化装置で伝送率及び帯域幅が選択できない。また、ネットワーク上で１つの送信端から幾つかの受信端にパケット情報を伝送するマルチキャスティングが行われる時、送信端の音声コーデックが固定されたビット率を有せば、相異なるビット率を要求する受信端に伝送されるパケット情報の質が低下されうる。 However, since the transmission rate and bandwidth are set by the encoding device in a general audio codec, the transmission rate and bandwidth cannot be selected by the decoding device. Also, when performing multicasting to transmit packet information from one transmitting end to several receiving ends on the network, if the audio codec at the transmitting end has a fixed bit rate, a different bit rate is required. The quality of packet information transmitted to the receiving end can be reduced.

これを改善するために、ビット率拡張音声符号化方式を採択した音声コーデックが提案された。このような音声コーデックは基本コーデックの情報だけでなく復元する信号をさらに正確にする情報が追加されるようにビットストリームを構成する。 In order to improve this, a speech codec adopting the bit rate extended speech coding scheme has been proposed. Such an audio codec configures a bitstream so that not only information on the basic codec but also information that makes the signal to be restored more accurate is added.

既存のビット率拡張音声符号化方式はＳＮＲビット率拡張方法と帯域幅拡張方法とに大別できる。 Existing bit rate extended speech coding schemes can be broadly divided into SNR bit rate extension methods and bandwidth extension methods.

ＳＮＲビット率拡張方法による音声符号化は階層的コーディング方式で音声信号を符号化してから復号化する。すなわち、音声信号を基本階層と音質向上階層とに分けて音声信号を符号化する。基本階層は最小限の音質が復元できる情報のみを伝送する。音質向上階層では音質を向上させうる追加情報を伝送する。 In speech coding by the SNR bit rate extension method, a speech signal is encoded by a hierarchical coding method and then decoded. That is, the audio signal is encoded by dividing the audio signal into a basic layer and a sound quality improvement layer. The basic layer transmits only information that can restore the minimum sound quality. In the sound quality improvement layer, additional information that can improve sound quality is transmitted.

しかし、既存に提案されたＳＮＲビット率拡張音声符号化装置は基本階層と音質向上階層とを独立的に符号化するように構成されている。したがって、固定コードブックを探索する時に要求される対象信号（または、ターゲットベクトル）と、インパルス応答との相関度と、エネルギーとを検出するための演算が基本階層及び音質向上階層でそれぞれ行われるので、固定コードブック探索のための媒介変数を求めるために多くの演算量が要求される。 However, the existing proposed SNR bit rate extended speech encoding apparatus is configured to encode the base layer and the sound quality improvement layer independently. Therefore, since the calculation for detecting the correlation between the target signal (or target vector) required when searching the fixed codebook, the impulse response, and the energy is performed in the basic layer and the sound quality improvement layer, respectively. Therefore, a large amount of computation is required to obtain a parametric variable for fixed codebook search.

そして、既存に提案されたＳＮＲビット率拡張音声符号化装置は、前記音質向上階層を追加で運営するために、既存の標準化されたＣＥＬＰ音声符号化器の構造を変更しており、既存の標準化されたＣＥＬＰ音声符号化器と互換されない短所を有している。 In addition, the proposed SNR bit rate extended speech coder has changed the structure of an existing standardized CELP speech coder in order to additionally operate the sound quality enhancement layer. It has the disadvantage that it is not compatible with the CELP speech encoder.

本発明が解決しようとする技術的課題は、既存の標準化された音声コーデックの固定コードブックと多層構造をなす固定コードブックとを含み、既存の標準化された音声コーデックと互換性を有するＳＮＲビット率を拡張する音声符号化及び復号化装置とその方法を提供するところにある。 The technical problem to be solved by the present invention includes a fixed codebook of an existing standardized audio codec and a fixed codebook having a multilayer structure, and an SNR bit rate compatible with an existing standardized audio codec. The present invention provides a speech encoding and decoding apparatus and method for extending the above.

本発明が解決しようとする他の技術的課題は、固定コードブック探索のための媒介変数を求める演算量が減少したＳＮＲビット率拡張音声符号化及び復号化装置とその方法を提供するところにある。 Another technical problem to be solved by the present invention is to provide an SNR bit rate enhanced speech coding and decoding apparatus and method with a reduced amount of computation for determining a parameter for searching a fixed codebook. .

本発明が解決しようとするさらに他の技術的課題は、基本階層で探索された固定コードブックの寄与度と、音質向上階層の合成された励起信号が除去された対象信号とを利用して、音質向上階層の固定コードブックを探索するＳＮＲビット率拡張音声符号化及び復号化装置とその方法を提供するところにある。 Still another technical problem to be solved by the present invention is to use the contribution of the fixed codebook searched in the basic layer and the target signal from which the synthesized excitation signal of the sound quality improvement layer is removed, It is an object of the present invention to provide an SNR bit rate extended speech encoding and decoding apparatus and method for searching a fixed codebook in a sound quality enhancement layer.

本発明が解決しようとするさらに他の技術的課題は、基本階層で探索されたパルスの位置と音質向上階層で探索されたパルスの位置とが重複されることを許容することによって、代数コードブックの限界が克服できるＳＮＲビット率拡張音声符号化及び復号化装置とその方法を提供するところにある。 Yet another technical problem to be solved by the present invention is that an algebraic codebook allows a position of a pulse searched in a basic layer and a position of a pulse searched in a sound quality improvement layer to be overlapped. The present invention provides an SNR bit rate extended speech encoding and decoding apparatus and method capable of overcoming these limitations.

本発明が解決しようとするさらに他の技術的課題は、音質向上階層での固定コードブックの利得値に対する量子化ビットを減らせるＳＮＲビット率拡張音声符号化及び復号化装置とその方法を提供するところにある。 Still another technical problem to be solved by the present invention is to provide an SNR bit rate enhanced speech coding and decoding apparatus and method capable of reducing quantization bits for a gain value of a fixed codebook in a sound quality enhancement layer. By the way.

音声信号の符号化装置において、線形予測符号化を使用して入力音声信号をフィルタリングし、固定コードブック探索及び適応コードブック探索により前記フィルタリングされた音声信号の励起信号を生成する基本階層と、前記基本階層で検出された前記固定コードブック探索のために必要な対象信号及びインパルス応答信号間の相関度ｄ（ｎ）と、パルスの符号及び前記相関度ｄ（ｎ）により定義される、前記相関度ｄ（ｎ）のサイズ情報に該当する相関度Ｃと、インパルス応答信号のエネルギーＥと、を含み、前記基本階層での固定コードブック探索により得られる媒介変数に基づいて、各パルス、各パルスの符号及び各パルスの位置を対応付けた代数コードブックを探索することで固定コードブックを探索する音質向上階層と、を少なくとも１つ含み、前記基本階層で生成される信号と前記音質向上階層で生成される信号とを多重化し、前記多重化された信号を出力する多重化器を含む音声信号の符号化装置を提供する。 In speech signal coding apparatus, and a base layer to produce an excitation signal of the linear using predictive coding filter the input audio signal, the filtered audio signal by a fixed codebook search and adaptive codebook search, the The correlation defined by the correlation d (n) between the target signal and the impulse response signal required for the fixed codebook search detected in the basic layer, and the code of the pulse and the correlation d (n) Each pulse, each pulse based on a parametric variable obtained by a fixed codebook search in the basic layer, including a degree of correlation C corresponding to size information of degree d (n) and an energy E of an impulse response signal small code and the speech quality enhancement layer searching fixed codebook by searching the algebraic codebook which associates the position of each pulse, the And a speech signal encoding device including a multiplexer that multiplexes the signal generated in the base layer and the signal generated in the sound quality enhancement layer and outputs the multiplexed signal. To do.

ここで、前記基本階層での固定コードブック探索及び音質向上階層での固定コードブック探索は代数コードブックを使用して行われる。 Here, the fixed codebook search in the basic layer and the fixed codebook search in the sound quality improvement layer are performed using an algebraic codebook.

ここで、前記音質向上階層は前記基本階層により行われた固定コードブック探索により得られた第１利得値と前記音質向上階層での固定コードブック探索により得られた第２利得値間の差を量子化する機能をさらに含む。 Here, the sound quality enhancement layer is a difference between a first gain value obtained by a fixed codebook search performed by the basic layer and a second gain value obtained by a fixed codebook search by the sound quality enhancement layer. It further includes a function to quantize.

ここで、前記多重化器は前記基本階層で生成される線形予測符号化係数の量子化情報、前記基本階層での固定コードブックインデックス、前記基本階層での適応コードブックインデックス、前記基本階層での固定コードブック利得値の量子化情報と前記基本階層での適応コードブック利得値の量子化情報、前記音質向上階層で生成される固定コードブックインデックス、及び前記基本階層の固定コードブック利得値と前記音質向上階層の固定コードブック利得値間の差値に対する量子化した情報を多重化することを特徴とする。 Here, the multiplexer includes quantization information of linear predictive coding coefficients generated in the base layer, a fixed codebook index in the base layer, an adaptive codebook index in the base layer, Quantization information of a fixed codebook gain value and quantization information of an adaptive codebook gain value in the base layer, a fixed codebook index generated in the sound quality enhancement layer, and a fixed codebook gain value of the base layer and the It is characterized in that the quantized information for the difference value between the fixed codebook gain values of the sound quality enhancement layer is multiplexed.

ここで、前記音質向上階層が複数であれば、前記多重化器は複数の音質向上階層で出力される固定コードブックインデックスと前記固定コードブック利得値間の差値に関する量子化した情報を多重化することを特徴とする。 Here, if there are a plurality of sound quality enhancement layers, the multiplexer multiplexes quantized information related to a difference value between the fixed codebook index output from the plurality of sound quality enhancement layers and the fixed codebook gain value. It is characterized by doing.

また、音声信号の符号化装置において、入力される音声信号を線形予測符号化フィルタリングし、固定コードブック探索及び適応コードブック探索により前記フィルタリングされた音声信号に対応する励起信号を生成する基本階層と、前記基本階層で検出された前記固定コードブック探索のために必要な対象信号及びインパルス応答信号間の相関度ｄ（ｎ）と、パルスの符号及び前記相関度ｄ（ｎ）により定義される、前記相関度ｄ（ｎ）のサイズ情報に該当する相関度Ｃと、インパルス応答信号のエネルギーＥと、を含み、前記基本階層での固定コードブック探索によって生成される媒介変数に基づいて、各パルス、各パルスの符号及び各パルスの位置を対応付けた代数コードブックを探索することで固定コードブックを探索する固定コードブック探索部、前記基本階層の前記固定コードブック探索により生成された第１固定コードブック利得値と前記固定コードブック探索部から出力される第２固定コードブック利得値間の差を検出し、検出された差を量子化する利得値差の量子化器を含む音質向上階層と、を複数具備し、前記基本階層で生成される信号と前記音質向上階層で生成される信号とを多重化する多重化器を含む音声信号の符号化装置を提供する。 Further, in the speech signal encoding apparatus, a base layer that performs linear predictive encoding filtering on an input speech signal and generates an excitation signal corresponding to the filtered speech signal by fixed codebook search and adaptive codebook search; , Defined by the correlation d (n) between the target signal and the impulse response signal necessary for the fixed codebook search detected in the basic layer, the code of the pulse and the correlation d (n), Each pulse includes a correlation degree C corresponding to the size information of the correlation degree d (n) and an energy E of an impulse response signal , based on a parameter generated by a fixed codebook search in the basic layer. , fixed code searching fixed codebook by searching the algebraic codebook which associates code and the position of each pulse of each pulse A book search unit for detecting a difference between a first fixed codebook gain value generated by the fixed codebook search of the base layer and a second fixed codebook gain value output from the fixed codebook search unit; A plurality of sound quality enhancement layers including a gain value difference quantizer for quantizing the difference, and multiplexing the signals generated in the basic layer and the signals generated in the sound quality enhancement layer An audio signal encoding device including an encoder is provided.

また、前記いずれかに記載の符号化装置により、基本階層と少なくとも１つの音質向上階層とに分けられて符号化された音声信号をデコードするための音声信号復号化装置において、前記符号化された音声信号のうち基本階層での符号化情報をデコードするための第１復号化ユニットと、前記音声信号復号化装置の動作環境によって前記符号化された音声信号のうち音質向上階層での符号化情報を復元する第２復号化ユニットと、前記音声信号復号化装置の動作環境によって前記第１復号化ユニットで復元された信号と前記第２復号化ユニットで復元された信号とを演算する演算ユニットと、前記第１復号化ユニットから出力される線形予測符号化係数を利用して前記演算ユニットから出力される信号を合成して音声信号を復元する音声信号復元ユニットと、を含む音声信号復号化装置を提供する。 Further, in the audio signal decoding apparatus for decoding an audio signal encoded by the encoding device according to any one of the above, the basic layer and at least one sound quality improvement layer are encoded. A first decoding unit for decoding encoded information in the basic layer of the audio signal, and encoded information in the sound quality improving layer of the encoded audio signal according to an operating environment of the audio signal decoding device A second decoding unit that restores the signal, an arithmetic unit that computes the signal restored by the first decoding unit and the signal restored by the second decoding unit according to the operating environment of the audio signal decoding device A speech signal for recovering a speech signal by synthesizing a signal output from the arithmetic unit using a linear predictive coding coefficient output from the first decoding unit; Providing an audio signal decoding apparatus including a source unit, a.

ここで、前記第１復号化ユニットは、前記基本階層での符号化情報に含まれている線形予測符号化係数の量子化情報をデコードする線形予測符号化係数の複合化部と、前記基本階層での符号化情報に含まれている固定コードブックインデックスをデコードする第１固定コードブック復号化部と、前記基本階層での符号化情報に含まれている適応コードブックインデックスをデコードする適応コードブック復号化部と、前記基本階層での符号化情報に含まれている固定コードブック利得値と適応コードブック利得値とをそれぞれデコードする利得値復号化部と、を含む。 Here, the first decoding unit includes a linear prediction coding coefficient decoding unit that decodes quantization information of linear prediction coding coefficients included in the coding information in the base layer, and the base layer. A first fixed codebook decoding unit that decodes a fixed codebook index included in the encoded information in the code, and an adaptive codebook that decodes the adaptive codebook index included in the encoded information in the base layer A decoding unit; and a gain value decoding unit that respectively decodes a fixed codebook gain value and an adaptive codebook gain value included in the encoding information in the base layer.

ここで、前記第２復号化ユニットは、前記音声向上階層での符号化情報に含まれている固定コードブック利得値間の差の量子化情報をデコードする利得値差の復号化部と、前記音質向上階層での符号化情報に含まれている固定コードブックインデックスをデコードする第２固定コードブック復号化部と、を含む。 Here, the second decoding unit includes a gain value difference decoding unit for decoding quantization information of a difference between fixed codebook gain values included in encoding information in the speech enhancement layer; A second fixed codebook decoding unit that decodes a fixed codebook index included in the encoded information in the sound quality enhancement layer.

ここで、前記演算ユニットは、前記利得値復号化部から出力される前記デコードされた固定コードブック利得値と前記利得値差の復号化部から出力される前記デコードされた利得値との差を加算する第１加算器と、前記音声信号復号化装置の動作条件によって前記利得値復号化部から出力される前記デコードされた固定コードブック利得値または前記第１加算器から出力される利得値を伝送する第１選択スイッチと、前記第２固定コードブック復号化部から出力されるデコードされた音質向上階層の固定コードブックと前記第１固定コードブック復号化部から出力されるデコードされた基本階層の固定コードブックとを加算する第２加算器と、前記音声信号復号化装置の動作条件によって前記第２加算器から出力される信号または前記第１固定コードブック復号化部から出力される前記デコードされた固定コードブックを伝送する第２選択スイッチと、前記適応コードブック復号化部から出力されるデコードされた適応コードブックと前記利得値復号化部から出力されるデコードされた適応コードブックとの利得値を乗算する第１乗算器と、前記第１選択スイッチから出力される信号と前記第２選択スイッチから出力される信号とを乗算する第２乗算器と、前記第１乗算岐路から出力される信号と前記第２乗算岐路から出力される信号とを加算する第３加算器と、を含む。 Here, the arithmetic unit calculates a difference between the decoded fixed codebook gain value output from the gain value decoding unit and the decoded gain value output from the gain value difference decoding unit. A first adder to be added, and the decoded fixed codebook gain value output from the gain value decoding unit or the gain value output from the first adder according to an operating condition of the audio signal decoding device. A first selection switch to be transmitted, a fixed codebook of a decoded sound quality enhancement layer output from the second fixed codebook decoding unit, and a decoded base layer output from the first fixed codebook decoding unit And a signal output from the second adder or the first fixed codebook depending on operating conditions of the audio signal decoding apparatus. A second selection switch for transmitting the decoded fixed codebook output from a codebook decoding unit; a decoded adaptive codebook output from the adaptive codebook decoding unit; and a gain value decoding unit A first multiplier that multiplies a gain value of the decoded adaptive codebook that is output; a second multiplication that multiplies the signal output from the first selection switch and the signal output from the second selection switch; And a third adder for adding the signal output from the first multiplication branch and the signal output from the second multiplication branch.

ここで、前記音声信号復元ユニットは、前記線形予測符号化係数を利用して前記第３加算器から出力される信号を合成する合成フィルタと、前記線形予測符号化係数と前記合成フィルタから出力される信号とを利用して前記復元された音声信号を得るための後処理ユニットと、を含む。 Here, the speech signal restoration unit outputs a synthesis filter that synthesizes a signal output from the third adder using the linear prediction coding coefficient, and is output from the linear prediction coding coefficient and the synthesis filter. And a post-processing unit for obtaining the reconstructed audio signal using a signal.

ここで、前記第２復号化ユニットは、前記音声向上階層での符号化情報に含まれている固定コードブック利得値間の差の量子化情報をデコードする利得値差の復号化部と、前記音質向上階層での符号化情報に含まれている固定コードブックインデックスをデコードする固定コードブック復号化部と、を含む。 Here, the second decoding unit includes a gain value difference decoding unit for decoding quantization information of a difference between fixed codebook gain values included in encoding information in the speech enhancement layer; A fixed codebook decoding unit that decodes a fixed codebook index included in the encoding information in the sound quality enhancement layer.

また、基本階層で入力された音声信号の線形予測符号化係数を抽出し、固定コードブック探索及び適応コードブック探索により前記入力された音声信号に対応する励起信号を生成する段階と、少なくとも１つの音質向上階層で、前記基本階層で検出された前記固定コードブック探索のために必要な対象信号及びインパルス応答信号間の相関度ｄ（ｎ）と、パルスの符号及び前記相関度ｄ（ｎ）により定義される、前記相関度ｄ（ｎ）のサイズ情報に該当する相関度Ｃと、インパルス応答信号のエネルギーＥと、を含み、前記基本階層で前記固定コードブックの探索によって生成された媒介変数に基づいて、各パルス、各パルスの符号及び各パルスの位置を対応付けた代数コードブックを探索することで固定コードブックを探索する段階と、前記基本階層と前記音質向上階層とにより生成される信号を多重化する段階と、を含む音声信号符号化方法を提供する。 Extracting a linear predictive coding coefficient of the speech signal input in the base layer and generating an excitation signal corresponding to the input speech signal by fixed codebook search and adaptive codebook search; and at least one In the sound quality enhancement layer, the correlation d (n) between the target signal and the impulse response signal required for the fixed codebook search detected in the basic layer, the code of the pulse, and the correlation d (n) A parameter C defined by the size information of the correlation d (n) and an energy E of an impulse response signal defined in the basic variable and generated by searching the fixed codebook based on each pulse, the steps of searching fixed codebook by searching the algebraic codebook which associates code and the position of each pulse of each pulse, before Providing an audio signal encoding method comprising the steps of multiplexing the signals generated by said speech quality enhancement layer and base layer, an.

ここで、前記音質向上階層処理段階は複数段階で行われることを特徴とする。 Here, the sound quality improvement hierarchy processing step is performed in a plurality of steps.

ここで、前記音質向上階層処理段階は、前記基本階層処理段階で前記固定コードブック探索により得られた固定コードブック利得値と前記音質向上階層の固定コードブック探索により得られた利得値間の差値を量子化することを特徴とする。 Here, the sound quality enhancement layer processing step includes a difference between the fixed codebook gain value obtained by the fixed codebook search in the basic layer processing step and the gain value obtained by the fixed codebook search of the sound quality improvement layer. It is characterized by quantizing the value.

ここで、前記媒介変数は前記基本階層処理段階で生成された対象信号とインパルス応答間の第１相関度、前記第１相関度の大きさに対応する第２相関度及びインパルス応答信号のエネルギーを含むことを特徴とする。 Here, the parameter includes the first correlation between the target signal generated in the base layer processing step and the impulse response, the second correlation corresponding to the magnitude of the first correlation, and the energy of the impulse response signal. It is characterized by including.

ここで、前記いずれかに記載の音声信号符号化方法により、基本階層と少なくとも１つの音質向上階層とに符号化された音声信号を復号化するための音声信号復号化方法において、前記符号化された音声信号を復号化する段階と、前記復号化段階で復号化された基本階層に対するコードブックと音質向上階層に対するコードブックとを前記音声信号符号化の動作条件によって選択的に伝送する段階と、前記選択的に伝送されるコードブックと前記復号化段階で復号化された線形予測係数とを合成して復元された音声信号を生成する段階と、を含む。
ここで、前記復号化段階は、前記符号化された音声信号を基本階層に関する符号化情報と音質向上階層に関する符号化情報とにデマルチプレクスし、デマルチプレクスされた符号化情報をデコードすることを特徴とする。
ここで、前記音声信号復号化方法は、前記復号化段階でデコードされた基本階層での固定コードブックの利得値とデコードされた音質向上階層で含まれる固定コードブックの利得値間の差を加算して前記音質向上階層での固定コードブックの利得値を復元する段階をさらに含むことを特徴とする。
Here, in the audio signal decoding method for decoding the audio signal encoded into the base layer and at least one sound quality improvement layer by any of the audio signal encoding methods described above, the encoding is performed. Decoding the voice signal, and selectively transmitting the codebook for the base layer and the codebook for the sound quality enhancement layer decoded in the decoding step according to the operation condition of the voice signal encoding, Synthesizing the selectively transmitted codebook and the linear prediction coefficient decoded in the decoding step to generate a reconstructed speech signal.
Here, in the decoding step, the encoded audio signal is demultiplexed into encoded information related to a base layer and encoded information related to a sound quality improvement layer, and the demultiplexed encoded information is decoded. It is characterized by.
Here, the audio signal decoding method adds a difference between a fixed codebook gain value in the base layer decoded in the decoding step and a fixed codebook gain value included in the decoded sound quality enhancement layer. The method further comprises the step of restoring the gain value of the fixed codebook in the sound quality enhancement layer.

本発明によれば、既存の標準化されたＣＬＥＰ音声符号化構造を変更せずにビット率が拡張できる構造を提示することによって、既存の標準化されたＣＬＥＰ音声符号化装置を具備したシステムと互換可能である。 According to the present invention, by presenting a structure in which the bit rate can be expanded without changing an existing standardized CLEP speech coding structure, it is compatible with a system equipped with an existing standardized CLEP speech coding apparatus. It is.

また、前述した本願発明の一実施例によれば、基本階層の固定コードブック探索対象信号と音質向上階層の固定コードブック探索対象信号とを同様にすることによって、音質向上階層で探索されたコードブックは次のフレームのために保存されない、そのため、基本階層の動作に影響を与えない。 In addition, according to the above-described embodiment of the present invention, the code searched for in the sound quality improvement layer is obtained by making the fixed codebook search target signal in the base layer similar to the fixed codebook search target signal in the sound quality improvement layer. The book is not saved for the next frame, so it does not affect the operation of the base hierarchy.

そして、音質向上階層の固定コードブック探索時に、基本階層の固定コードブック探索時に求めた媒介変数値を使用することによって、音質向上階層の固定コードブック探索に要求される演算量を減らせる。 Then, the amount of calculation required for the fixed codebook search in the sound quality improvement layer can be reduced by using the parameter value obtained in the fixed codebook search in the basic layer during the fixed codebook search in the sound quality improvement layer.

また、本願発明の他の実施例によれば、音質向上階層の固定コードブック探索のために要求される対象信号は、基本階層の固定コードブック対象信号から基本階層の固定コードブック寄与度と、音質向上階層の合成フィルタを通じて提供される以前の音質向上階層の固定コードブックの合成信号とを除去することによって、音質向上階層専用の対象信号を利用した固定コードブック探索が行われることによってさらに正確な固定コードブック探索が期待できる。 Further, according to another embodiment of the present invention, the target signal required for the fixed codebook search of the sound quality improvement layer, the fixed codebook contribution of the basic layer from the fixed codebook target signal of the basic layer, More accurate by performing a fixed codebook search using the target signal dedicated to the sound quality improvement layer by removing the synthesized signal of the fixed codebook of the previous sound quality improvement layer provided through the synthesis filter of the sound quality improvement layer Can be expected.

さらに、音質向上階層で探索されたパルスの位置と基本階層で探索されたパルスの位置とが同じになって、代数コードブックのパルスが同じ大きさを有する限界点を克服し、最終固定コードブックのパルスが多重大きさを有するので、復元される音声信号の音質が改善できる。 Furthermore, the position of the pulse searched for in the sound quality enhancement layer and the position of the pulse searched for in the basic layer become the same, overcoming the limit point where the pulses of the algebraic codebook have the same magnitude, and the final fixed codebook Therefore, the sound quality of the restored audio signal can be improved.

そして、音質向上階層の利得値は基本階層の量子化された利得値と音質向上階層の利得値間の差を量子化して相対的に動的範囲の小さな利得値差を量子化した値を伝送することによって、音質向上階層で利得値量子化に必要なビットが節約できる。 Then, the gain value of the sound quality improvement layer is a value obtained by quantizing the difference between the quantized gain value of the base layer and the gain value of the sound quality improvement layer and quantizing the gain value difference having a relatively small dynamic range. By doing so, it is possible to save bits necessary for gain value quantization in the sound quality enhancement layer.

以下、図面を参照して本発明の望ましい実施例を説明することによって、本発明を詳細に説明する。各図面に提示された同じ参照符号は同じ部材を示す。 Hereinafter, the present invention will be described in detail by describing preferred embodiments of the present invention with reference to the drawings. The same reference numerals provided in each drawing denote the same members.

図１は、本発明の望ましい一実施例によるビット率拡張音声符号化装置の機能ブロック図である。図１を参照すれば、前記ビット率拡張音声符号化装置は、基本階層１００と音質向上階層１３０とを含む多層固定コードブック構造を有する。 FIG. 1 is a functional block diagram of a bit rate extended speech encoding apparatus according to a preferred embodiment of the present invention. Referring to FIG. 1, the bit rate extended speech encoding apparatus has a multi-layer fixed codebook structure including a basic layer 100 and a sound quality improvement layer 130.

基本階層１００では、最小限の音質が復元できる符号化情報が生成される。基本階層１００は既存の標準化されたＣＥＬＰ音声符号化器の構成と類似している。したがって、基本階層１００は、入力音声信号を線形予測符号化によりフィルタリングして入力音声信号に対応する励起信号を生成する。 In the basic layer 100, encoded information that can restore the minimum sound quality is generated. The base layer 100 is similar to the configuration of existing standardized CELP speech encoders. Therefore, the base layer 100 generates an excitation signal corresponding to the input speech signal by filtering the input speech signal by linear predictive coding.

基本階層１００は前処理ユニット１０２、ＬＰＣ係数の抽出及びベクトル量子化器１０４、合成フィルタ１０６、減算器１０８、認知加重フィルタ１１０、ピッチ分析部１１２、ピッチ寄与度の除去部１１５、固定コードブック探索部１１７、固定コードブック１１９、第１乗算器１２１、加算器１２３、適応コードブック１２４、第２乗算器１２６、利得値量子化器１２９で構成される。 The basic layer 100 includes a preprocessing unit 102, an LPC coefficient extraction and vector quantizer 104, a synthesis filter 106, a subtractor 108, a cognitive weighting filter 110, a pitch analysis unit 112, a pitch contribution removal unit 115, and a fixed codebook search. A unit 117, a fixed codebook 119, a first multiplier 121, an adder 123, an adaptive codebook 124, a second multiplier 126, and a gain value quantizer 129.

前処理ユニット１０２は、ライン１０１を通じ入力される音声信号からＤＣ成分を除去する。すなわち、前処理ユニット１０２は、ハイパスフィルタを使用して入力音声信号をフィルタリングして入力音声信号の低周波帯域のノイズ成分を除去する。使われたハイパスフィルタＨ_ｈ１（ｎ）は数式（２）のような伝達関数を有する。 The preprocessing unit 102 removes the DC component from the audio signal input through the line 101. That is, the preprocessing unit 102 filters the input audio signal using a high-pass filter and removes noise components in the low frequency band of the input audio signal. The high-pass filter H _h1 (n) used has a transfer function as shown in Equation (2).

前処理ユニット１０２から出力される信号は、ライン１０３を通じてＬＰＣ係数の抽出及びベクトル量子化器１０４に伝送される。

The signal output from the preprocessing unit 102 is transmitted to the LPC coefficient extraction and vector quantizer 104 through a line 103.

ＬＰＣ係数の抽出及びベクトル量子化器１０４は、前記前処理ユニット１０２から出力される信号のＬＰＣ係数を抽出する。抽出されたＬＰＣ係数は、ＬＰＣ係数の抽出及びベクトル量子化器１０４によりベクトル量子化される。ＬＰＣ係数のベクトル量子化情報はライン１０５を通じて合成フィルタ１０６と多重化器１４０とに伝送される。 The LPC coefficient extraction and vector quantizer 104 extracts the LPC coefficient of the signal output from the preprocessing unit 102. The extracted LPC coefficients are LPC coefficient extraction and vector quantized by the vector quantizer 104. The vector quantization information of the LPC coefficient is transmitted to the synthesis filter 106 and the multiplexer 140 through the line 105.

合成フィルタ１０６は、前記ＬＰＣ係数のベクトル量子化情報を利用して、ライン１２８を通じて入力される励起信号に対応する合成された信号を出力する。前記合成された信号はライン１０７を通じて減算器１０８に出力される。 The synthesis filter 106 outputs a synthesized signal corresponding to the excitation signal inputted through the line 128 using the vector quantization information of the LPC coefficient. The synthesized signal is output to the subtracter 108 through the line 107.

減算器１０８は、ライン１０３を通じて入力される前処理ユニット１０２から出力される信号からライン１０７を通じて入力される合成された信号を減算して差信号を生成する。前記差信号はライン１０９を通じて認知加重フィルタ１１０に伝送される。 The subtracter 108 subtracts the synthesized signal input through the line 107 from the signal output from the preprocessing unit 102 input through the line 103 to generate a difference signal. The difference signal is transmitted to the perceptual weighting filter 110 via line 109.

認知加重フィルタ１１０は、人体聴覚構造のマスキング効果を利用するために、量子化雑音をマスキング臨界値以下に保持する。したがって、認知加重フィルタ１１０は前記差信号の量子化雑音が最小化されるように加重値を含む信号をピッチ分析部１１２に出力する。 The cognitive weighting filter 110 keeps the quantization noise below the masking critical value in order to use the masking effect of the human auditory structure. Accordingly, the cognitive weighting filter 110 outputs a signal including a weight value to the pitch analysis unit 112 so that the quantization noise of the difference signal is minimized.

ピッチ分析部１１２は認知加重フィルタ１１０から出力される信号に対して開回路ピッチと閉回路ピッチとを探索する。すなわち、ピッチ分析部１１２は認知加重フィルタ１１０から出力される信号を複数のサブフレームに分け、前記各サブフレームのピッチを分析して適応コードブックのインデックスと利得値とを出力する。前記適応コードブックのインデックスはライン１１３を通じてピッチ寄与度の除去部１１５と適応コードブック１２４とに伝送されながらライン１１４を通じて多重化器１４０に伝送される。また、前記適応コードブックの利得値は利得値量子化器１２９に提供される。 The pitch analysis unit 112 searches for an open circuit pitch and a closed circuit pitch for the signal output from the cognitive weighting filter 110. That is, the pitch analysis unit 112 divides the signal output from the cognitive weighting filter 110 into a plurality of subframes, analyzes the pitch of each subframe, and outputs the index and gain value of the adaptive codebook. The index of the adaptive codebook is transmitted to the multiplexer 140 through the line 114 while being transmitted to the pitch contribution removing unit 115 and the adaptive codebook 124 through the line 113. The gain value of the adaptive codebook is provided to a gain value quantizer 129.

ピッチ寄与度の除去部１１５は、前記適応コードブック１２４のインデックスに基づいて、認知加重フィルタ１１０の出力信号から固定コードブック探索のために必要な対象信号（または、ターゲットベクトル）を検出する。そして、ピッチ寄与度の除去部１１５は、ライン１１１でピッチ寄与度ｙ_１（ｎ）を減算して固定コードブック探索対象信号を、ライン１１６を通じて基本階層１００の固定コードブック探索部１１７と音質向上階層１３０の固定コードブック探索部１３１とに出力する。ピッチ寄与度ｙ_１（ｎ）は数式（３）によって求められる。 The pitch contribution removing unit 115 detects a target signal (or target vector) necessary for fixed codebook search from the output signal of the cognitive weighting filter 110 based on the index of the adaptive codebook 124. Then, the pitch contribution removal unit 115 subtracts the pitch contribution y ₁ (n) from the line 111 to obtain the fixed codebook search target signal, and the line 116 to the fixed codebook search unit 117 of the base layer 100 to improve the sound quality. The data is output to the fixed codebook search unit 131 in the hierarchy 130. The pitch contribution y ₁ (n) is obtained by Expression (3).

数式（３）でＡＣ_Ｇ（ｎ）は適応コードブック利得値が乗算された値である。

In Equation (3), AC _G (n) is a value obtained by multiplying the adaptive codebook gain value.

固定コードブック探索部１１７は、ライン１１１を通じて入力された対象信号ｘ’（ｎ）を使用して対象信号とインパルス応答ｈ（ｎ）との相関度ｄ（ｎ）を求める。 The fixed codebook search unit 117 uses the target signal x ′ (n) input through the line 111 to determine the degree of correlation d (n) between the target signal and the impulse response h (n).

例えば、副フレームのサイズが４０サンプルであり、各階層のパルス数が４つと仮定すれば、前記相関度ｄ（ｎ）は数式（４）のように定義されうる。 For example, assuming that the size of the subframe is 40 samples and the number of pulses in each layer is four, the correlation d (n) can be defined as Equation (4).

数式（４）で、ｈ（ｉ−ｎ）はインパルス応答であり、ｘ’（ｎ）は対象信号である。

In Expression (4), h (i−n) is an impulse response, and x ′ (n) is a target signal.

前記インパルス応答ｈ（ｎ）と相関度ｄ（ｎ）とは、ライン１１８’を通じて音質向上階層１３０の固定コードブック探索部１３１に提供される。 The impulse response h (n) and the correlation d (n) are provided to the fixed codebook search unit 131 of the sound quality enhancement layer 130 through a line 118 '.

前記固定コードブック探索部１１７は、前記インパルス応答ｈ（ｎ）と前記相関度ｄ（ｎ）とに基づいて、表１に示されたように、代数コードブック構造を有する固定されたコードブックを探索する。 Based on the impulse response h (n) and the correlation d (n), the fixed codebook search unit 117 searches for a fixed codebook having an algebraic codebook structure as shown in Table 1. Explore.

表１を参考すれば、固定コードブック探索部１１７で固定コードブックベクトルは４個の位置でのみそのパルスのサイズが０でない。したがって、前記パルスの符号ｓと相関度ｄ（ｎ）とを利用して相関度Ｃは数式（５）のように定義されうる。固定コードブック探索部１１７は、数式（５）により相関度Ｃを検出する。

Referring to Table 1, the fixed codebook search unit 117 has a fixed codebook vector whose pulse size is not 0 only at four positions. Accordingly, the correlation degree C can be defined as Equation (5) using the pulse code s and the correlation degree d (n). Fixed codebook search unit 117 detects correlation degree C using equation (5).

数式（５）で、ｍ_ｉはｉ番目パルスの位置を表し、ｓ_ｉはｉ番目パルスの符号を表す。

In Equation (5), _{m i} denotes the position of i-th pulse, _{s i} represents the sign of the i-th pulse.

固定コードブック検出部１１７は、合成フィルタ１０６のインパルス応答ｈ（ｎ）のエネルギーＥを数式（６）により検出する。 The fixed codebook detection unit 117 detects the energy E of the impulse response h (n) of the synthesis filter 106 using Equation (6).

数式（６）で、Φ（ｍ_ｉ、ｍ_ｊ）はｉ番目パルスの位置及びｊ番目パルスの位置に対するインパルス応答信号ｈ（ｎ）間の相関度であり、ｓ_ｉはｉ番目パルスの符号であり、ｓ_ｊはｊ番目パルスの符号である。

In Equation (6), Φ (m _i , m _j ) is the degree of correlation between the position of the i th pulse and the impulse response signal h (n) with respect to the position of the j th pulse, and s _i is the sign of the i th pulse. Yes, s _j is the sign of the jth pulse.

前記固定コードブック探索部１１７は、前記相関度Ｃとインパルス応答ｈ（ｎ）のエネルギーＥとを保存する。相関度Ｃは、符号ｓｉｇｎ［ｄ（ｉ）］とその絶対値とに分けられて保存される。ｓｉｇｎ［ｄ（ｉ）］はｄ（ｉ）の符号である。前記エネルギーＥは数式（７）のような形態に保存される。 The fixed codebook search unit 117 stores the correlation C and the energy E of the impulse response h (n). Correlation degree C is divided and stored in code sign [d (i)] and its absolute value. sign [d (i)] is the sign of d (i). The energy E is stored in a form such as Equation (7).

エネルギーＥに対する数式（６）は、数式（８）のように再定義されうる。

Equation (6) for energy E can be redefined as equation (8).

固定コードブック探索部１１７は、前記検出された相関度ＣとエネルギーＥとをライン１１８”を通じて、音質向上階層１３０の固定コードブック探索部１３１に提供しながら、検出された相関度ＣとエネルギーＥとを利用して固定コードブックを探索する。前記固定コードブック探索により、固定コードブックインデックスと利得値とが得られれば、固定コードブック探索部１１７は、前記固定コードブックインデックスを固定コードブック１１９と多重化器１４０とに伝送し、前記利得値を利得値量子化器１２９に伝送する。

The fixed codebook search unit 117 provides the detected correlation C and energy E to the fixed codebook search unit 131 of the sound quality enhancement layer 130 through the line 118 ″ while providing the detected correlation C and energy E. If a fixed codebook index and a gain value are obtained by the fixed codebook search, the fixed codebook search unit 117 searches the fixed codebook 119 for the fixed codebook index. Are transmitted to the multiplexer 140, and the gain value is transmitted to the gain value quantizer 129.

固定コードブック１１９は、ライン１１８を通じて入力されたインデックスに基づいて、基本階層１００の固定コードブックベクトルを出力する。固定コードブック１１９から出力される固定コードブックベクトルは、ライン１２０を通じて第１乗算器１２１に提供される。 The fixed codebook 119 outputs a fixed codebook vector of the base layer 100 based on the index input through the line 118. The fixed codebook vector output from the fixed codebook 119 is provided to the first multiplier 121 through the line 120.

第１乗算器１２１は、利得値量子化器１２９で提供される前記固定コードブックの利得値に対する量子化利得値Ｇ_ｃを前記固定コードブックベクトルに乗算し、その結果をライン１２２を通じて出力する。ライン１２２を通じて出力される信号は、固定コードブックのベクトルである。前記量子化利得値Ｇ_ｃは、利得値量子化器１２９から提供される。 The first multiplier 121, a quantized gain value G _c is multiplied by the fixed codebook vector for said gain value of the fixed codebook provided in gain value quantizer 129, and outputs the result via a line 122. The signal output through line 122 is a fixed codebook vector. The quantization gain value G _c is provided from a gain value quantizer 129.

ライン１１３を通じて適応コードブックインデックスが印加されれば、適応コードブック１２４は、前記適応コードブックインデックスに対応するパルスの位置情報と符号情報とを出力する。ライン１２５を通じて出力される適応コードブックベクトルは、第２乗算器１２６に提供される。 If the adaptive codebook index is applied through the line 113, the adaptive codebook 124 outputs pulse position information and code information corresponding to the adaptive codebook index. The adaptive codebook vector output through line 125 is provided to the second multiplier 126.

第２乗算器１２６は、適応コードブックの利得値に対する量子化された利得値Ｇ_ｐを前記ライン１２５を通じて伝送される適応コードブックベクトルに乗算し、その結果をライン１２７を通じて出力する。前記ライン１２７を通じて出力される信号は、利得値Ｇ_ｐが乗算された適応コードブックのベクトルである。前記量子化された利得値Ｇ_ｐは、利得値量子化器１２９から提供される。 The second multiplier 126, the adaptive code gain value G _p quantized relative gain values of the book by multiplying the adaptive codebook vector transmitted via the line 125, and outputs the result via a line 127. Signal output through the line 127 is a vector of adaptive codebook gain value G _p is multiplied. The quantized gain value G _p is provided from a gain value quantizer 129.

加算器１２３は、ライン１２２を通じて入力される量子化された利得値Ｇ_ｃと固定コードブックベクトルとを乗算して得た信号と、ライン１２７を通じて入力される量子化された利得値Ｇ_ｐと適応コードブックベクトルとを乗算して得た信号とを加算して励起信号を得る。前記励起信号は、ライン１２８を通じて合成フィルタ１０６に出力される。 The adder 123, the adaptive signal obtained by multiplying the fixed codebook vector with quantized gain value G _c is input via a line 122, a quantized gain value G _p is input via the line 127 An excitation signal is obtained by adding the signal obtained by multiplying the codebook vector. The excitation signal is output to the synthesis filter 106 through the line 128.

利得値量子化器１２９は、固定コードブック探索部１１７から出力される固定コードブックの利得値とピッチ分析部１１２から出力される適応コードブックの利得値とをそれぞれ量子化する。前記固定コードブックの利得値を量子化した利得値Ｇ_ｃは、第１乗算器１２１に出力され、適応コードブックの利得値を量子化した利得値Ｇ_ｐは、第２乗算器１２６に出力される。前記量子化した利得値Ｇ_ｃは、音質向上階層１３０に含まれている利得値差の量子化器１３４にも提供される。 The gain value quantizer 129 quantizes the gain value of the fixed codebook output from the fixed codebook search unit 117 and the gain value of the adaptive codebook output from the pitch analysis unit 112. Gain value G _c of the gain value obtained by quantizing the fixed codebook is output to the first multiplier 121, a gain value G _p the gain value obtained by quantizing the adaptive codebook is output to the second multiplier 126 The The quantized gain value G _c is also provided to a gain value difference quantizer 134 included in the sound quality enhancement layer 130.

音質向上階層１３０は、復元される音質を向上させるために基本階層１００で提供されるビット以外に追加的なビットをさらに提供するためのものである。例えば、基本階層１００が、８ｋｂｐｓのビット率を提供する時に、音質向上階層１３０が４ｋｂｐｓの追加ビット率が提供できる。図１は説明の便宜上、１つの音声向上階層１３０が基本階層１００に連結された構成を示したが、複数の音声向上階層が基本階層１００に連結されうる。 The sound quality enhancement layer 130 is for providing additional bits in addition to the bits provided in the basic layer 100 in order to improve the restored sound quality. For example, when the base layer 100 provides a bit rate of 8 kbps, the sound quality enhancement layer 130 can provide an additional bit rate of 4 kbps. Although FIG. 1 illustrates a configuration in which one voice enhancement layer 130 is connected to the basic layer 100 for convenience of explanation, a plurality of voice enhancement layers may be connected to the basic layer 100.

音質向上階層１３０は、固定コードブック探索部１３１と利得値差の量子化器１３４とで構成される。固定コードブック探索部１３１は、ライン１１８’を通じて提供されるインパルス応答信号ｈ（ｎ）、対象信号とインパルス応答信号ｈ（ｎ）との相関度であるｄ（ｎ）、パルスの符号と前記相関度であるｄ（ｎ）とを利用して検出されたｄ（ｎ）のサイズ情報に該当される相関度Ｃ及びインパルス応答信号ｈ（ｎ）のエネルギーＥを利用して固定コードブックを探索する。 The sound quality enhancement layer 130 includes a fixed codebook search unit 131 and a gain value difference quantizer 134. The fixed codebook search unit 131 includes an impulse response signal h (n) provided through the line 118 ′, a correlation degree d (n) between the target signal and the impulse response signal h (n), a pulse code, and the correlation. The fixed codebook is searched using the degree of correlation C and the energy E of the impulse response signal h (n) corresponding to the size information of d (n) detected using the degree d (n). .

このように固定コードブック探索部１３１は、固定コードブック探索部１１７で探索された対象信号と同じ対象信号とに対する固定コードブック探索を行う。固定コードブック探索部１３１は、代数コードブックを使用する。固定コードブック探索部１３１は、対象信号（ターゲットベクトル）のＭＳＥ（ＭｅａｎＳｑｕａｒｅＥｒｒｏｒ）を最小化し、数式（９）を最大化するベクトルｃ_ｋを見つける。見つけられたベクトルｃ_ｋが固定コードブックベクトルとなる。 In this way, the fixed codebook search unit 131 performs a fixed codebook search for the same target signal as the target signal searched by the fixed codebook search unit 117. The fixed codebook search unit 131 uses an algebraic codebook. The fixed codebook search unit 131 finds a vector _ck that minimizes the MSE (Mean Square Error) of the target signal (target vector) and maximizes Equation (9). The found vector _ck becomes a fixed codebook vector.

数式（９）でΦはインパルス応答ｈ（ｎ）間の相関度を表す。前記ｄ（ｎ）とΦとは基本階層１００で提供する値を利用する。前記Φは、固定コードブック探索部１１７から提供される。したがって、固定コードブック探索部１３１は固定コードブック探索時に必要な演算量を減らせる。

In Equation (9), Φ represents the degree of correlation between impulse responses h (n). As the d (n) and Φ, values provided in the basic layer 100 are used. The Φ is provided from the fixed codebook search unit 117. Therefore, the fixed codebook search unit 131 can reduce the amount of calculation required for the fixed codebook search.

基本階層１００の固定コードブックベクトルの次数が４０であり、基本階層１００及び音質向上階層１３０で大きさが０でないパルスをそれぞれ４つ探すと仮定すれば、基本階層１００の固定コードブック１１７でまず４個のパルスを探し、音質向上階層１３０の固定コードブック探索部１３１で４個のパルスを探す。したがって、固定コードブック探索部１３１は基本階層１００で探した４個のパルスの影響も考慮する。したがって、固定コードブック探索部１３１で得られる相関度Ｃ’は数式（１０）のように定義でき、エネルギーＥ’は数式（１１）のように定義されうる。 Assuming that the order of the fixed codebook vector of the basic layer 100 is 40, and that four non-zero magnitude pulses are to be searched for in the basic layer 100 and the sound quality enhancement layer 130, respectively, the fixed codebook 117 of the basic layer 100 starts with Four pulses are searched, and the fixed codebook search unit 131 of the sound quality improvement layer 130 searches for four pulses. Therefore, the fixed codebook search unit 131 also considers the influence of the four pulses searched for in the basic hierarchy 100. Therefore, the correlation degree C ′ obtained by the fixed codebook search unit 131 can be defined as Equation (10), and the energy E ′ can be defined as Equation (11).

数式（５）に定義された相関度Ｃ値を利用して、前記数式（１０）は、数式（１２）のように再定義されうる。

Using the correlation degree C value defined in Equation (5), Equation (10) can be redefined as Equation (12).

固定コードブック探索部１３１は、探索過程の複雑度を減らすためにエネルギーＥ’を数式（１３）のように再定義された演算により検出できる。

The fixed codebook search unit 131 can detect the energy E ′ by a redefined calculation as in Equation (13) in order to reduce the complexity of the search process.

数式（１３）は、数式（８）に定義されているエネルギーＥを利用すれば、数式（１４）のように再定義されうる。

Equation (13) can be redefined as Equation (14) by using the energy E defined in Equation (8).

相関度Ｃ’とエネルギーＥ’とは、音質向上階層１３０での固定コードブック探索以前に保存されて、固定コードブック探索過程を簡素化できる。

The correlation degree C ′ and the energy E ′ are stored before the fixed codebook search in the sound quality enhancement layer 130, and the fixed codebook search process can be simplified.

前述した相関度Ｃ’、エネルギーＥ’を利用して、音質向上階層１３０のパルスの符号情報と位置情報とを得るための固定コードブック探索部１３１の過程は、基本階層１００の固定コードブック探索部１１７で行われる方式と同一に行われる。この時、基本階層１００で探索されたパルスの位置情報と音質向上階層で探索されたパルスの位置情報とは同一でありうる。 The process of the fixed codebook search unit 131 for obtaining the code information and the position information of the pulse of the sound quality enhancement layer 130 using the correlation degree C ′ and the energy E ′ described above is the fixed codebook search of the basic layer 100. This is performed in the same manner as that performed by the unit 117. At this time, the pulse position information searched in the basic layer 100 and the pulse position information searched in the sound quality improvement layer may be the same.

図２は、図１のビット率拡張音声符号化装置において、固定コードブック探索部１１７により探索されたパルスの位置と固定コードブック探索部１３１により探索されたパルスの位置とを説明するための図面である。 FIG. 2 is a diagram for explaining the pulse positions searched by the fixed codebook search unit 117 and the pulse positions searched by the fixed codebook search unit 131 in the bit rate extended speech encoding apparatus of FIG. It is.

図２を参照すれば、固定コードブック探索２０１で探索されたパルスの位置は、音質向上階層固定コードブック探索２０２で探索されたパルスの位置と同じでありうる。したがって、最終固定コードブックのパルスの大きさは基本階層１００及び音質向上階層１３０の固定コードブックパルスの大きさを含む多重された大きさ（以下、多重大きさという）を有する。したがって、代数コードブックのパルスの大きさは＋１または−１のみの大きさではない。 Referring to FIG. 2, the position of the pulse searched in the fixed codebook search 201 may be the same as the position of the pulse searched in the sound quality improvement hierarchical fixed codebook search 202. Therefore, the pulse size of the final fixed codebook has a multiplexed size (hereinafter referred to as a multiplexed size) including the sizes of the fixed codebook pulses of the basic layer 100 and the sound quality enhancement layer 130. Therefore, the pulse size of the algebraic codebook is not only +1 or -1.

固定コードブック探索部１３１は、探索結果によって得られた固定コードブックベクトルを多重化器１４０に提供し、固定コードブックの利得値を利得値差の量子化器１３４に提供する。前記音質向上階層１３０での前記固定コードブックインデックスは、パルス符号情報とパルスの位置情報とで構成されうる。 The fixed codebook search unit 131 provides the fixed codebook vector obtained from the search result to the multiplexer 140, and provides the gain value of the fixed codebook to the gain value difference quantizer 134. The fixed codebook index in the sound quality enhancement layer 130 may be composed of pulse code information and pulse position information.

このように音質向上階層１３０で探索された固定コードブックインデックスは、次のフレームのために保存されていないので、基本階層１００の動作に影響を与えない。 Since the fixed codebook index searched in the sound quality enhancement layer 130 is not stored for the next frame, the operation of the basic layer 100 is not affected.

利得値差の量子化器１３４は、固定コードブック探索部１３１で求めた固定コードブックの利得値１３２と基本階層１００で量子化された固定コードブックの利得値Ｇ_ｃ間の差を求め、前記差を量子化する。これによって、利得値差の量子化情報Ｇ_ｄｉｆｆが利得値差の量子化器１３４からライン１３５を通じて多重化器１４０に伝送されるので、音質向上階層１３０は、固定コードブックの利得値に対する量子化ビットを減らせる。 Quantizer 134 of the gain value difference, determining a difference between the gain value G _c of the quantized fixed codebook gain value 132 and base layer 100 of the fixed codebook obtained by the fixed codebook searching unit 131, the Quantize the difference. As a result, the gain value difference quantization information G _diff is transmitted from the gain value difference quantizer 134 to the multiplexer 140 through the line 135, so that the sound quality enhancement layer 130 quantizes the gain value of the fixed codebook. Bits can be reduced.

多重化器１４０は、基本階層１００から提供されるＬＰＣ係数量子化情報、固定コードブックインデックス、適応コードブックインデックス、利得値量子化情報と音質向上階層１３０とから提供される音質向上階層の固定コードブックインデックス、利得値差の量子化情報をビットストリームに出力する。 The multiplexer 140 is a fixed code of the sound quality improvement layer provided from the LPC coefficient quantization information provided from the base layer 100, a fixed codebook index, an adaptive codebook index, gain value quantization information, and the sound quality improvement layer 130. The quantization information of the book index and gain value difference is output to the bit stream.

基本階層１００と音質向上階層１３０とのビットストリームは区分して伝送する。すなわち、図１に示されたように、音質向上階層１３０のビットストリームは、基本階層１００のビットストリーム後に伝送される。これによって前記ビットストリームは、ネットワークトラフィック状態によって、復号化装置に必要なビット率で容易に分離されうる。例えば、復号化装置側のチャンネル特性が劣悪で基本階層のビットストリームのみが受信できる場合に、前記復号化装置は図１のビット率拡張音声符号化装置が送出するビットストリームのうち基本階層のビットストリームのみが受信ができる。 The bit streams of the basic layer 100 and the sound quality improvement layer 130 are transmitted separately. That is, as shown in FIG. 1, the bit stream of the sound quality enhancement layer 130 is transmitted after the bit stream of the base layer 100. Accordingly, the bit stream can be easily separated at a bit rate required for the decoding apparatus according to network traffic conditions. For example, when the channel characteristics on the decoding device side are poor and only the base layer bit stream can be received, the decoding device uses the bits of the base layer among the bit streams transmitted by the bit rate extended speech encoding device of FIG. Only the stream can be received.

図３は、本発明の望ましい一実施例によるビット率拡張音声復号化装置のブロック図である。 FIG. 3 is a block diagram of a bit rate extended speech decoding apparatus according to an embodiment of the present invention.

図３を参照すれば、前記ビット率拡張音声復号化装置は、逆多重化器３０１、ＬＰＣ係数復号化部３０２、利得値復号化部３０３、第１固定コードブック復号化部３０４、適応コードブック復号化部３０５、利得値差の復号化部３０６、第２固定コードブック復号化部３０７、第１加算器３０８、第２加算器３０９、第１選択スイッチ３１０、第２選択スイッチ３１１、第１乗算器３１２、第２乗算器３１３、第３加算器３１４、合成フィルタ３１５、及び後処理部３１６で構成される。 Referring to FIG. 3, the bit rate extended speech decoding apparatus includes a demultiplexer 301, an LPC coefficient decoding unit 302, a gain value decoding unit 303, a first fixed codebook decoding unit 304, an adaptive codebook. Decoding unit 305, gain value difference decoding unit 306, second fixed codebook decoding unit 307, first adder 308, second adder 309, first selection switch 310, second selection switch 311, first It includes a multiplier 312, a second multiplier 313, a third adder 314, a synthesis filter 315, and a post-processing unit 316.

前記ビット率拡張音声復号化装置は、ビット率拡張音声符号化装置から伝送されるビットストリームを選択的に受信できる。すなわち、ビットストリームから基本階層に対するビットストリームのみ受信すれば、基本階層の音質が復元でき、基本階層及び音質向上階層に対するビットストリームを何れも受信すれば、さらに向上した音質が提供できる。 The bit rate extended speech decoding apparatus can selectively receive a bit stream transmitted from the bit rate extended speech encoding apparatus. That is, if only the bitstream for the base layer is received from the bitstream, the sound quality of the base layer can be restored, and if both the bitstreams for the base layer and the sound quality improvement layer are received, further improved sound quality can be provided.

逆多重化器３０１は、受信されるビットストリームを各モジュールの情報に逆多重化して出力する。すなわち、逆多重化器３０１は、ＬＰＣ係数量子化情報をＬＰＣ係数復号化部３０２に、利得値量子化情報は利得値復号化部３０３に、利得値差の量子化情報は利得値差の復号化部３０６に、音質向上階層の固定コードブックインデックスは第２固定コードブック復号化部３０７に、固定コードブックインデックスは第１固定コードブック復号化部３０４に、適応コードブックインデックスは適応コードブック復号化部３０５にそれぞれ提供する。 The demultiplexer 301 demultiplexes the received bit stream into the information of each module and outputs it. That is, the demultiplexer 301 decodes LPC coefficient quantization information to the LPC coefficient decoding unit 302, gain value quantization information to the gain value decoding unit 303, and gain value difference quantization information to decode the gain value difference. In the second codec decoding unit 307, the fixed codebook index in the first fixed codebook decoding unit 304, and the adaptive codebook index in the adaptive codebook decoding. Provided to the conversion unit 305.

ＬＰＣ係数復号化部３０２の構造は、符号化装置側のＬＰＣ係数の抽出及びベクトル量子化器１０４により決定され、入力されるＬＰＣ係数量子化情報からＬＰＣ係数を復元する。復元されたＬＰＣ係数は合成フィルタ３１５と後処理部３１６とに提供される。 The structure of the LPC coefficient decoding unit 302 is determined by the LPC coefficient extraction and vector quantizer 104 on the encoding device side, and restores the LPC coefficient from the input LPC coefficient quantization information. The restored LPC coefficient is provided to the synthesis filter 315 and the post-processing unit 316.

利得値復号化部３０３の構造は、符号化装置側の利得値量子化器１２９により決定される。利得値復号化部３０３、は入力される利得値量子化情報をデコードする。前記利得値量子化情報は、適応コードブック利得値と固定コードブック利得値とを含む。したがって、利得値復号化部３０３から基本階層１００での適応コードブック利得値ｇ_ｐと固定コードブック利得値ｇ_ｃとがそれぞれ出力される。 The structure of gain value decoding section 303 is determined by gain value quantizer 129 on the encoding device side. The gain value decoding unit 303 decodes input gain value quantization information. The gain value quantization information includes an adaptive codebook gain value and a fixed codebook gain value. Thus, the adaptive codebook gain value g _p at the base layer 100 and the fixed codebook gain value g _c from the gain value decoding unit 303 are output.

第１固定コードブック復号化部３０４は、入力される基本階層１００の固定コードブックインデックスをデコードして基本階層１００の固定コードブックを出力する。固定コードブック復号方式は、符号化装置の固定コードブック探索部１１７での探索方式により決定される。適応コードブック復号化部３０５は、入力される適応コードブックインデックスをデコードして基本階層１００の適応コードブックを出力する。 The first fixed codebook decoding unit 304 decodes the input fixed codebook index of the basic layer 100 and outputs the fixed codebook of the basic layer 100. The fixed codebook decoding method is determined by the search method in fixed codebook search unit 117 of the encoding device. The adaptive codebook decoding unit 305 decodes the input adaptive codebook index and outputs the adaptive codebook of the base layer 100.

前述ＰＣ係数復号化部３０２、利得値復号化部３０３、第１固定コードブック復号化部３０４、及び適応コードブック復号化部３０５は、逆多重化器３０１から伝送される基本階層１００での符号化情報をデコードする第１復号化ユニットと定義されうる。 The PC coefficient decoding unit 302, the gain value decoding unit 303, the first fixed codebook decoding unit 304, and the adaptive codebook decoding unit 305 are the codes in the base layer 100 transmitted from the demultiplexer 301. It may be defined as a first decoding unit that decodes the encoded information.

利得値差の復号化部３０６と第２固定コードブック復号化部３０７との動作はネットワークトラフィック状態や受信端末の処理容量に依存する。 The operations of the gain value difference decoding unit 306 and the second fixed codebook decoding unit 307 depend on the network traffic state and the processing capacity of the receiving terminal.

もし、利得値差の復号化部３０６と第２固定コードブック復号化部３０７とが動作されと決定されれば、利得値差の復号化部３０６は、入力される利得値差の量子化情報をデコードする。第２固定コードブック復号化部３０７は、入力される音質向上階層の固定コードブックインデックスをデコードする。利得値差復号化方式は、符号化装置側の利得値差の量子化器１３４により決定される。第２固定コードブック復号化部３０７でのデコード方式は、符号化装置側の第２固定コードブック探索部１３１により決定される。 If it is determined that the gain value difference decoding unit 306 and the second fixed codebook decoding unit 307 are operated, the gain value difference decoding unit 306 may input the gain value difference quantization information. Decode. The second fixed codebook decoding unit 307 decodes the input fixed codebook index of the sound quality improvement layer. The gain value difference decoding method is determined by the gain value difference quantizer 134 on the encoding device side. The decoding method in the second fixed codebook decoding unit 307 is determined by the second fixed codebook search unit 131 on the encoding device side.

利得値差の復号化部３０６と第２固定コードブック復号化部３０７とは、逆多重化器３０１から伝送される音質向上階層１３０での符号化情報をデコードする第２復号化ユニットと見なされうる。 The gain value difference decoding unit 306 and the second fixed codebook decoding unit 307 are regarded as a second decoding unit for decoding the encoded information in the sound quality enhancement layer 130 transmitted from the demultiplexer 301. sell.

第１加算器３０８は、利得値復号化部３０３から出力されるデコードされた固定コードブックの利得値ｇ_ｃと利得値差の復号化部３０６から出力されるデコードされた利得値差ｇ_ｄｉｆｆとを加算する。第１加算器３０８の出力は、復号化時に音質向上階層の利得値である。 The first adder 308 includes a decoded fixed codebook gain value g _c output from the gain value decoding unit 303 and a decoded gain value difference g _diff output from the gain value difference decoding unit 306. Is added. The output of the first adder 308 is the gain value of the sound quality enhancement layer at the time of decoding.

第２加算器３０９は、第２固定コードブック復号化部３０７でデコードされた音質向上階層１３０の固定コードブックと、第１固定コードブック復号化部３０４でデコードされた基本階層１００の固定コードブックとを加算する。したがって、第２加算器３０９から出力される信号は数式（１５）のように定義できる。 The second adder 309 includes a fixed codebook of the sound quality enhancement layer 130 decoded by the second fixed codebook decoding unit 307 and a fixed codebook of the basic layer 100 decoded by the first fixed codebook decoding unit 304. And add. Therefore, the signal output from the second adder 309 can be defined as Equation (15).

数式（１５）で、ｃ（ｎ）は基本階層での固定コードブックであり、ｃ'（ｎ）は音質向上階層での固定コードブックである。

In Equation (15), c (n) is a fixed codebook in the basic layer, and c ′ (n) is a fixed codebook in the sound quality improvement layer.

これによって、復号化装置での固定コードブックパルスは、基本階層と音質向上階層との代数コードブックを累積させて、多重大きさを有する代数コードブックパルス構造を有する。前記代数コードブックを累積させることは、あらゆるパルスの大きさが同じ大きさを有する既存の固定コードブック構造で発生する短所を補完するためのものである。したがって、累積させた代数コードブックのパルスは対象信号に適した符号を有する。 Accordingly, the fixed codebook pulse in the decoding apparatus has an algebraic codebook pulse structure having a multiple size by accumulating the algebraic codebooks of the basic layer and the sound quality enhancement layer. Accumulating the algebraic codebook is intended to compensate for the disadvantages that occur in existing fixed codebook structures where every pulse has the same magnitude. Therefore, the accumulated pulse of the algebraic codebook has a code suitable for the target signal.

第１選択スイッチ３１０は、利得値復号化部３０３でデコードされた固定コードブック利得値ｇ_ｃと第１加算器３０８から出力される信号とを選択的に伝送する。すなわち、復号化装置が基本階層に動作すれば、第１選択スイッチ３１０は利得値復号化部３０３から出力される固定コードブック利得値ｇ_ｃを伝送し、該当される復号化装置が音質向上階層に動作すれば、第１選択スイッチ３１０は加算器３０８から出力される利得値を伝送する。 The first selection switch 310 selectively transmits the fixed codebook gain value g _c decoded by the gain value decoding unit 303 and the signal output from the first adder 308. That is, if the decoding apparatus operates in the base layer, the first selection switch 310 transmits the fixed codebook gain value g _c output from the gain value decoding unit 303, and the corresponding decoding apparatus performs the sound quality improvement layer. The first selection switch 310 transmits the gain value output from the adder 308.

第２選択スイッチ３１１は、第２加算器３０９から出力される信号と第１固定コードブック復号化部３０４から出力される基本階層１００での固定コードブックとを選択的に伝送する。すなわち、前記復号化装置が音質向上階層で動作しない場合に、第２選択スイッチ３１１は、第１固定コードブック復号化部３０４から出力される信号を伝送し、前記復号化装置が音質向上階層で動作する場合に、第２選択スイッチ３１１は第２加算器３０９から出力される信号を伝送する。 The second selection switch 311 selectively transmits the signal output from the second adder 309 and the fixed codebook in the base layer 100 output from the first fixed codebook decoding unit 304. That is, when the decoding device does not operate in the sound quality enhancement layer, the second selection switch 311 transmits the signal output from the first fixed codebook decoding unit 304, and the decoding device is in the sound quality enhancement layer. In operation, the second selection switch 311 transmits a signal output from the second adder 309.

第１乗算器３１２は、第２選択スイッチ３１１から出力される固定コードブックに第１選択スイッチ３１０から出力される利得値を乗算して出力する。 The first multiplier 312 multiplies the fixed codebook output from the second selection switch 311 by the gain value output from the first selection switch 310 and outputs the result.

第２乗算器３１３は、適応コードブック復号化部３０５から出力されるデコードされた適応コードブックに利得値復号化部３０３から出力される適応コードブックの利得値ｇ_ｐを乗算して出力する。 The second multiplier 313, and outputs the multiplied gain value g _p adaptive codebook output from the gain value decoding unit 303 to the decoded adaptive codebook output from the adaptive codebook decoding unit 305.

第３加算器３１４は、第１乗算器３１２から出力される固定コードブックに関する情報と第２乗算器３１３から出力される適応コードブックに関する情報とを加算して復元された励起信号を発生する。 The third adder 314 adds the information related to the fixed codebook output from the first multiplier 312 and the information related to the adaptive codebook output from the second multiplier 313 to generate a restored excitation signal.

前述した第１加算器３０８、第２加算器３０９、第３加算器３１４、第１乗算器３１２、第２乗算器３１３、第１選択スイッチ３１０及び第２選択スイッチ３１１は、前述した第１復号化ユニットと第２復号化ユニットとでそれぞれデコードされた信号を前記復号化装置の動作環境によって演算する演算ユニットと定義されうる。 The first adder 308, the second adder 309, the third adder 314, the first multiplier 312, the second multiplier 313, the first selection switch 310, and the second selection switch 311 described above are the first decoding described above. The decoding unit and the second decoding unit may be defined as a calculation unit that calculates the signals decoded by the decoding device according to the operating environment of the decoding device.

合成フィルタ３１５は、ＬＰＣ係数復号化部３０２から提供される復元されたＬＰＣ係数を利用して、加算器３１４から提供される励起信号を合成して音声信号を復元する。 The synthesis filter 315 uses the restored LPC coefficient provided from the LPC coefficient decoding unit 302 to synthesize the excitation signal provided from the adder 314 and restore the speech signal.

後処理部３１６は、合成フィルタ３１５から伝送される音声信号の音質を向上させる。すなわち、音声信号の音質を向上させるために、後処理部３１６は、ＬＰＣ係数復号化部３０２から提供されるＬＰＣ係数を利用して、合成フィルタ３１５から出力される信号をフィルタリングするためのハイパスフィルタを使用する。 The post-processing unit 316 improves the sound quality of the audio signal transmitted from the synthesis filter 315. That is, in order to improve the sound quality of the audio signal, the post-processing unit 316 uses the LPC coefficient provided from the LPC coefficient decoding unit 302 to filter the signal output from the synthesis filter 315. Is used.

前述した合成フィルタ３１５及び後処理部３１６は、前記演算ユニットから出力される信号を、ＬＰＣ係数復号化部３０２から出力されるＬＰＣ係数と合成して音声信号を復元する復元ユニットと定義されうる。 The synthesis filter 315 and the post-processing unit 316 described above may be defined as a restoration unit that synthesizes the signal output from the arithmetic unit with the LPC coefficient output from the LPC coefficient decoding unit 302 to restore the speech signal.

図４は、本発明の一実施例によるビット率拡張音声符号化方法のフローチャートである。 FIG. 4 is a flowchart of a bit rate extended speech encoding method according to an embodiment of the present invention.

第４０１段階で、音声信号の符号化装置は、図１の前処理ユニット１０２のように入力された音声信号を前処理する。第４０２段階で、音声信号の符号化装置は、前処理された音声信号からＬＰＣ係数を抽出し、抽出されたＬＰＣ係数の量子化情報を生成する。 In operation 401, the audio signal encoding apparatus preprocesses an input audio signal as in the preprocessing unit 102 of FIG. In operation 402, the speech signal encoding apparatus extracts LPC coefficients from the preprocessed speech signal, and generates quantization information of the extracted LPC coefficients.

第４０３段階で、音声信号の符号化装置は、生成されたＬＰＣ係数の量子化情報を利用して励起信号を図１の合成フィルタ１０６でのように合成する。第４０４段階で、音声信号の符号化装置は、前記前処理された信号から前記合成された信号を減算してＬＰＣ残差信号を検出する。第４０５段階で、音声信号の符号化装置は、検出されたＬＰＣ残差信号を図１の認知加重フィルタ１１０でのようにフィルタリングして認知加重された信号を出力する。 In operation 403, the speech signal encoding apparatus uses the generated LPC coefficient quantization information to synthesize the excitation signal as in the synthesis filter 106 of FIG. In operation 404, the speech signal encoding apparatus detects an LPC residual signal by subtracting the synthesized signal from the preprocessed signal. In operation 405, the speech signal encoding apparatus filters the detected LPC residual signal as in the perceptual weighting filter 110 of FIG. 1 and outputs a perceptual weighted signal.

第４０６段階で、音声信号の符号化装置は認知加重された信号のピッチを図１のピッチ分析部１１２のように分析して、適応コードブックのインデックスと利得値とを得る。そして、音声信号の符号化装置は、図１のピッチ寄与度の除去部１１５のように適応コードブックのインデックスに基づいて認知加重された信号からピッチ寄与度を除去して、固定コードブック探索のために必要な対象信号を検出する。 In operation 406, the speech signal encoding apparatus analyzes the perceptually weighted signal pitch as in the pitch analysis unit 112 of FIG. 1 to obtain an index and gain value of the adaptive codebook. Then, the speech signal encoding apparatus removes the pitch contribution from the signal weighted based on the index of the adaptive codebook as in the pitch contribution removal unit 115 in FIG. The target signal necessary for this is detected.

第４０７段階で、音声信号の符号化装置は、図１の第１固定コードブック探索部１１７でのように基本階層固定コードブックを探索して、固定コードブック利得値と固定コードブックインデックスとを生成する。第４０８段階で、音声信号の符号化装置は、図１の利得値量子化器１２９でのように前記検出された固定コードブック利得値と前記検出された適応コードブック利得値とを量子化する。 In operation 407, the speech signal encoding apparatus searches the base layer fixed codebook as in the first fixed codebook search unit 117 of FIG. 1 to obtain a fixed codebook gain value and a fixed codebook index. Generate. In operation 408, the speech signal encoding apparatus quantizes the detected fixed codebook gain value and the detected adaptive codebook gain value as in the gain value quantizer 129 of FIG. .

第４０９段階で、音声信号の符号化装置は、基本階層での相関度Ｃ及びｄ（ｎ）、エネルギーＥのような媒介変数を利用して、音質向上階層固定コードブックを探索する。音質向上階層固定コードブック探索により、音質向上階層の固定コードブックの利得値とインデックスとがそれぞれ生成される。 In operation 409, the speech signal encoding apparatus searches for a sound quality enhancement layer fixed codebook using parameters such as the correlation degree C and d (n) and energy E in the base layer. Through the sound quality improvement layer fixed codebook search, the gain value and the index of the fixed codebook of the sound quality improvement layer are respectively generated.

第４１０段階で、音声信号の符号化装置は、基本階層固定コードブックの利得値と音質向上階層の固定コードブックの利得値間の差を量子化する。前述した音質向上階層での固定コードブック探索及び利得値量子化過程は、図１で説明したように複数階層に分けて行われうる。音質向上階層の処理が複数階層に分けて行われれば、それほど復元される音声信号の質が向上されうる。 In operation 410, the speech signal encoding apparatus quantizes the difference between the gain value of the base layer fixed codebook and the gain value of the fixed codebook of the sound quality enhancement layer. The fixed codebook search and gain value quantization process in the sound quality enhancement layer may be performed in a plurality of layers as described with reference to FIG. If the processing of the sound quality improvement layer is divided into a plurality of layers, the quality of the restored audio signal can be improved.

第４１１段階で、音声信号の符号化装置は前述した段階を通じて得たＬＰＣ係数量子化情報、基本階層の固定コードブックインデックス、基本階層の適応コードブックインデックス、基本階層の固定コードブックの利得値、基本階層の適応コードブックの利得値、音質向上階層の固定コードブックインデックス及び前記利得値差の量子化情報をビットストリーム形態に多重化して音声信号復号化装置側に送出する。 In operation 411, the speech signal encoding apparatus obtains the LPC coefficient quantization information obtained through the above-described steps, the base layer fixed codebook index, the base layer adaptive codebook index, the base layer fixed codebook gain value, The base layer adaptive codebook gain value, the sound quality enhancement layer fixed codebook index, and the gain value difference quantization information are multiplexed into a bitstream format and transmitted to the audio signal decoding apparatus side.

図５は、本発明の望ましい一実施例によるビット率拡張音声復号化方法のフローチャートである。 FIG. 5 is a flowchart of a bit rate enhanced speech decoding method according to an embodiment of the present invention.

第５０１段階で、音声信号復号化装置は、図３の逆多重化器３０１のように受信されるビットストリームを各構成の情報に逆多重化する。 In operation 501, the audio signal decoding apparatus demultiplexes the received bitstream into information of each component like the demultiplexer 301 in FIG. 3.

第５０２段階で、音声信号復号化装置は、前記逆多重化された信号をデコードする。すなわち、図３のＬＰＣ係数復号化部３０２、利得値復号化部３０３、第１固定コードブック復号化部３０４、適応コードブック復号化部３０５、利得値差の復号化部３０６、第２固定コードブック復号化部３０７のように前記逆多重化された信号をデコードする。 In operation 502, the audio signal decoding apparatus decodes the demultiplexed signal. That is, LPC coefficient decoding section 302, gain value decoding section 303, first fixed codebook decoding section 304, adaptive codebook decoding section 305, gain value difference decoding section 306, second fixed code in FIG. The demultiplexed signal is decoded as in the book decoding unit 307.

第５０３段階で、音声信号復号化装置は、音質向上階層固定コードブック利得値を所定演算処理により復元する。前記音声信号復号化装置は、復号化された固定コードブック利得値と音質向上階層の固定コードブック利得値の量子化情報として受信された利得値差とを加算して、音質向上階層の固定コードブック利得値を復元する。 In operation 503, the speech signal decoding apparatus restores the sound quality enhancement layer fixed codebook gain value by a predetermined calculation process. The speech signal decoding apparatus adds the decoded fixed codebook gain value and the gain value difference received as quantization information of the fixed codebook gain value of the sound quality enhancement layer, and adds the fixed code of the sound quality enhancement layer. Restore the book gain value.

第５０４段階で、音声信号復号化装置は、音声信号復号化装置の動作条件によって音質向上階層の固定コードブックと基本階層の固定コードブックとを選択的に伝送し、利得値も選択的に伝送される。すなわち、音声信号復号化装置が音質向上階層で動作すれば、復元された音質向上階層の固定コードブックの利得値が乗算された音質向上階層の固定コードブックを伝送させる。一方、音声信号の符号化装置が音質向上階層で動作しなければ、復号化された基本階層の固定コードブックに基本階層の固定コードブックの利得値を乗算した固定コードブックを伝送させる。 In operation 504, the audio signal decoding apparatus selectively transmits the fixed codebook of the sound quality enhancement layer and the fixed codebook of the basic layer according to the operating conditions of the audio signal decoding apparatus, and also selectively transmits the gain value. Is done. That is, if the speech signal decoding apparatus operates in the sound quality enhancement layer, the fixed codebook in the sound quality enhancement layer multiplied by the gain value of the restored fixed codebook in the sound quality enhancement layer is transmitted. On the other hand, if the audio signal encoding apparatus does not operate in the sound quality enhancement layer, the fixed codebook obtained by multiplying the decoded code for the basic layer by the gain value of the fixed codebook for the basic layer is transmitted.

第５０５段階で、音声信号復号化装置は、第５０２段階で復号化されたＬＰＣ係数を利用して、第５０４段階で選択的に伝送された固定コードブックを合成する。 In operation 505, the speech signal decoding apparatus synthesizes the fixed codebook selectively transmitted in operation 504 using the LPC coefficients decoded in operation 502.

第５０６段階で、音声信号復号化装置は、後処理部３１６のように後処理して復元された音声信号を生成する。 In operation 506, the speech signal decoding apparatus generates a speech signal restored by post-processing as in the post-processing unit 316.

図６は、本発明の望ましい他の実施例によるビット率拡張音声符号化装置の機能ブロック図である。図６を参照すれば、前記ビット率拡張音声符号化装置は、基本階層６００と音質向上階層６３０とを含む多層固定コードブック構造を有する。 FIG. 6 is a functional block diagram of a bit rate extended speech encoding apparatus according to another embodiment of the present invention. Referring to FIG. 6, the bit rate extended speech encoding apparatus has a multi-layer fixed codebook structure including a basic layer 600 and a sound quality improvement layer 630.

基本階層６００では、最小限の音質が復元できる符号化情報が生成される。基本階層６００は、既存の標準化されたＣＥＬＰ音声符号化器の構成と類似している。したがって、基本階層６００は、入力音声信号を線形予測符号化によりフィルタリングし、前記フィルタリングされた音声信号に対応する励起信号を生成する。励起信号は、固定コードブック探索と適応コードブック探索とにより生成される。 In the basic layer 600, encoded information that can restore the minimum sound quality is generated. Base layer 600 is similar to the configuration of existing standardized CELP speech encoders. Accordingly, the base layer 600 filters the input speech signal by linear predictive coding, and generates an excitation signal corresponding to the filtered speech signal. The excitation signal is generated by a fixed codebook search and an adaptive codebook search.

基本階層６００は、前処理ユニット６０２、ＬＰＣ係数の抽出及びベクトル量子化器６０４、合成フィルタ６０６、減算器６０８、認知加重フィルタ６１０、ピッチ分析部６１２、ピッチ寄与度の除去部６１５、固定コードブック探索部６１７、固定コードブック６１９、第１乗算器６２１、加算器６２３、適応コードブック６２４、第２乗算器６２６、利得値量子化器６２９で構成される。 The basic layer 600 includes a preprocessing unit 602, LPC coefficient extraction and vector quantizer 604, synthesis filter 606, subtractor 608, cognitive weighting filter 610, pitch analysis unit 612, pitch contribution removal unit 615, fixed codebook. A search unit 617, a fixed codebook 619, a first multiplier 621, an adder 623, an adaptive codebook 624, a second multiplier 626, and a gain value quantizer 629 are configured.

前処理ユニット６０２は、ライン６０１を通じて入力される音声信号からＤＣ成分を除去する。すなわち、前処理ユニット６０２は、ハイパスフィルタを使用して入力音声信号をフィルタリングして入力音声信号の低周波帯域のノイズ成分を除去する。使われたハイパスフィルタは、本発明の一実施例で基本階層１００の前処理ユニット１０２のハイパスフィルタと同一である。前処理ユニット６０２から出力される信号は、ライン６０３を通じてＬＰＣ係数の抽出及びベクトル量子化器６０４に伝送される。 The preprocessing unit 602 removes the DC component from the audio signal input through the line 601. That is, the preprocessing unit 602 filters the input audio signal using a high-pass filter to remove a low frequency band noise component of the input audio signal. The high-pass filter used is the same as the high-pass filter of the preprocessing unit 102 of the basic hierarchy 100 in one embodiment of the present invention. The signal output from the preprocessing unit 602 is transmitted to the LPC coefficient extraction and vector quantizer 604 through a line 603.

ＬＰＣ係数の抽出及びベクトル量子化器６０４は、前記前処理ユニット６０２から出力される信号のＬＰＣ係数を抽出する。抽出されたＬＰＣ係数は、ＬＰＣ係数の抽出及びベクトル量子化器６０４によりベクトル量子化される。ＬＰＣ係数のベクトル量子化情報は、ライン６０５を通じて合成フィルタ６０６と多重化器６５０とに伝送される。 The LPC coefficient extraction and vector quantizer 604 extracts the LPC coefficient of the signal output from the preprocessing unit 602. The extracted LPC coefficient is vector quantized by the LPC coefficient extraction and vector quantizer 604. The vector quantization information of the LPC coefficient is transmitted to the synthesis filter 606 and the multiplexer 650 through the line 605.

合成フィルタ６０６は、前記ＬＰＣ係数のベクトル量子化情報を利用してライン６２８を通じて入力される励起信号に対応する合成された信号を出力する。前記合成された信号は、ライン６０７を通じて減算器６０８に出力される。 The synthesis filter 606 outputs a synthesized signal corresponding to the excitation signal inputted through the line 628 using the vector quantization information of the LPC coefficient. The synthesized signal is output to a subtracter 608 through a line 607.

減算器６０８は、ライン６０３を通じて入力される前処理ユニット６０２から出力される信号から、ライン６０７を通じて入力される合成された信号を減算してＬＰＣ残差信号を生成する。前記ＬＰＣ残差信号は、ライン６０９を通じて認知加重フィルタ６１０に伝送される。 The subtracter 608 generates an LPC residual signal by subtracting the synthesized signal input through the line 607 from the signal output from the preprocessing unit 602 input through the line 603. The LPC residual signal is transmitted to the cognitive weighting filter 610 through line 609.

認知加重フィルタ６１０は、人体聴覚構造のマスキング効果を利用するために量子化雑音をマスキング臨界値以下に保持する。したがって、認知加重フィルタ６１０は、前記ＬＰＣ残差信号の量子化雑音が最小化されるように加重値を含む信号をピッチ分析部６１２に出力する。 The cognitive weighting filter 610 keeps the quantization noise below the masking critical value in order to use the masking effect of the human auditory structure. Accordingly, the cognitive weighting filter 610 outputs a signal including a weight value to the pitch analysis unit 612 so that the quantization noise of the LPC residual signal is minimized.

ピッチ分析部６１２は、認知加重フィルタ６１０から出力される信号に対して開回路ピッチと閉回路ピッチとを探索する。すなわち、ピッチ分析部６１２は、認知加重フィルタ６１０から出力される信号を複数のサブフレームに分け、前記標準化されたＣＬＥＰ音声符号化装置でのように各サブフレームのピッチを分析して適応コードブックのインデックスと利得値とを出力する。 The pitch analysis unit 612 searches for an open circuit pitch and a closed circuit pitch for the signal output from the cognitive weighting filter 610. That is, the pitch analysis unit 612 divides the signal output from the cognitive weighting filter 610 into a plurality of subframes, analyzes the pitch of each subframe as in the standardized CLEP speech coding apparatus, and performs an adaptive codebook. Output the index and gain value.

前記適応コードブックのインデックスは、ライン６１３を通じてピッチ寄与度の除去部６１５と適応コードブック６２４とに伝送されながらライン６１４を通じて多重化器６５０に伝送される。また、前記適応コードブックの利得値は、利得値量子化器６２９に提供される。 The index of the adaptive codebook is transmitted to the multiplexer 650 through the line 614 while being transmitted to the pitch contribution removing unit 615 and the adaptive codebook 624 through the line 613. Also, the gain value of the adaptive codebook is provided to a gain value quantizer 629.

ピッチ寄与度の除去部６１５は、前記適応コードブックのインデックスに基づいて、認知加重フィルタ６１０の出力信号から固定コードブック探索のために必要な対象信号を出力する。そして、ピッチ寄与度の除去部６１５は、認知加重フィルタ６１０の出力信号からピッチ寄与度ｙ_１（ｎ）を減算して、固定コードブック探索のために必要な対象信号を、ライン６１６を通じて基本階層６００の固定コードブック探索部６１７に出力する。ピッチ寄与度ｙ_１（ｎ）は、数式（３）によって求められる。 The pitch contribution removal unit 615 outputs a target signal necessary for a fixed codebook search from the output signal of the cognitive weighting filter 610 based on the index of the adaptive codebook. Then, the pitch contribution removal unit 615 subtracts the pitch contribution y ₁ (n) from the output signal of the cognitive weighting filter 610 to obtain the target signal necessary for the fixed codebook search through the line 616. The result is output to a fixed codebook search unit 617 of 600. The pitch contribution y ₁ (n) is obtained by Expression (3).

固定コードブック探索部６１７は、ライン６１１を通じて入力された対象信号ｘ'（ｎ）を使用して対象信号とインパルス応答ｈ（ｎ）との相関度ｄ（ｎ）とを求める。 The fixed codebook search unit 617 obtains the degree of correlation d (n) between the target signal and the impulse response h (n) using the target signal x ′ (n) input through the line 611.

例えば、副フレームの大きさが４０サンプルであり、各階層のパルス数が４つと仮定すれば、前記相関度ｄ（ｎ）は数式（２）のように定義されうる。 For example, assuming that the size of the subframe is 40 samples and the number of pulses in each layer is four, the correlation d (n) can be defined as Equation (2).

前記固定コードブック探索部６１７は、前記インパルス応答ｈ（ｎ）と前記相関度ｄ（ｎ）とに基づいて、前記表１の例のように構成された代数コードブック形態の固定コードブックを探索する。表１を参考すれば、固定コードブック探索部６１７で固定コードブックベクトルのパルスの大きさは、４個の位置でのみ０でない。したがって、前記パルスの符号ｓと相関度ｄ（ｎ）とを利用した相関度ｄ（ｎ）の大きさに対応する相関度Ｃは数式（３）のように定義されうる。固定コードブック探索部６１７は、数式（３）により相関度Ｃを検出する。固定コードブック検出部６１７は、インパルス応答エネルギーＥを数式（４）により検出する。 The fixed codebook search unit 617 searches for a fixed codebook in the form of an algebraic codebook configured as in the example of Table 1 based on the impulse response h (n) and the correlation degree d (n). To do. Referring to Table 1, the pulse size of the fixed codebook vector in the fixed codebook search unit 617 is not 0 only at the four positions. Therefore, the correlation degree C corresponding to the magnitude of the correlation degree d (n) using the pulse code s and the correlation degree d (n) can be defined as Equation (3). Fixed codebook search unit 617 detects correlation degree C using equation (3). The fixed code book detection unit 617 detects the impulse response energy E by the mathematical formula (4).

前記固定コードブック探索部６１７は、前記相関度ＣとエネルギーＥとを保存する。固定コードブック探索部６１７は、相関度Ｃを符号ｓｉｇｎ［ｄ（ｉ）］とその絶対値とに分けられて保存する。ｓｉｇｎ［ｄ（ｉ）］はｄ（ｉ）の符号である。前記エネルギーＥは数式（５）のような形態に保存される。エネルギーＥに対する数式（４）は数式（６）のように再定義されうる。 The fixed codebook search unit 617 stores the correlation C and energy E. Fixed codebook search unit 617 stores correlation degree C by dividing it into code sign [d (i)] and its absolute value. sign [d (i)] is the sign of d (i). The energy E is stored in a form such as Equation (5). Equation (4) for energy E can be redefined as equation (6).

前記探索により、固定コードブックインデックスと利得値とが得られれば、固定コードブック探索部６１７は、前記固定コードブックインデックスを固定コードブック６１９と多重化器６５０とに伝送し、前記利得値を利得値量子化器６２９に伝送する。 If a fixed codebook index and a gain value are obtained by the search, the fixed codebook search unit 617 transmits the fixed codebook index to the fixed codebook 619 and the multiplexer 650, and the gain value is gained. Transmit to the value quantizer 629.

固定コードブック６１９は、ライン６１８を通じて入力されたインデックスに基づいて、基本階層６００の固定コードブックベクトルを出力する。固定コードブックベクトルは、パルス位置情報ｍと符号情報ｓとに基づいて構成される。固定コードブック６１９から出力される固定コードブックベクトルは、ライン６２０を通じて第１乗算器６２１に提供される。 Fixed codebook 619 outputs a fixed codebook vector of base layer 600 based on the index input through line 618. The fixed codebook vector is configured based on the pulse position information m and the code information s. The fixed codebook vector output from the fixed codebook 619 is provided to the first multiplier 621 through a line 620.

第１乗算器６２１は、利得値量子化器６２９から提供される前記固定コードブックの利得値に対する量子化利得値Ｇ_ｃを、前記固定コードブックベクトルに乗算し、その結果をライン６２２を通じて出力する。ライン６２２を通じて出力される信号は、基本階層６００の固定コードブックベクトルに量子化利得値Ｇ_ｃを乗算した固定コードブックｃ_Ｇ（ｎ）と定義できる。前記量子化利得値Ｇ_ｃは、利得値量子化器６２９から提供される。 The first multiplier 621 multiplies the fixed codebook vector by the quantized gain value G _c for the gain value of the fixed codebook provided from the gain value quantizer 629, and outputs the result through a line 622. . The signal output through the line 622 can be defined as a fixed codebook c _G (n) obtained by multiplying the fixed codebook vector of the base layer 600 by the quantization gain value G _c . The quantization gain value G _c is provided from a gain value quantizer 629.

ライン６１３を通じて適応コードブックインデックスが印加されれば、適応コードブック６２４は、前記適応コードブックインデックスに対応する適応コードブックベクトルを出力する。ライン６２５を通じて、前記適応コードブックベクトルは、第２乗算器６２６に提供される。 If an adaptive codebook index is applied through line 613, adaptive codebook 624 outputs an adaptive codebook vector corresponding to the adaptive codebook index. Through line 625, the adaptive codebook vector is provided to a second multiplier 626.

第２乗算器６２６は、適応コードブックの利得値に対する量子化された利得値Ｇ_ｐを、前記ライン６２５を通じて伝送される適応コードブックベクトルに乗算し、その結果をライン６２７を通じて出力する。前記量子化された利得値Ｇ_ｐは、利得値量子化器６２９から提供される。 The second multiplier 626, a quantized gain value G _p for gain value of the adaptive codebook, multiplies the adaptive codebook vector transmitted via the line 625, and outputs the result via a line 627. The quantized gain value G _p is provided from a gain value quantizer 629.

加算器６２３は、ライン６２２を通じて入力される固定コードブックベクトルとライン６２７を通じて入力される適応コードブックベクトルとを加算して励起信号を得る。前記励起信号は、ライン６２８を通じて合成フィルタ６０６に出力される。 The adder 623 adds the fixed codebook vector input through the line 622 and the adaptive codebook vector input through the line 627 to obtain an excitation signal. The excitation signal is output to the synthesis filter 606 through line 628.

利得値量子化器６２９は、固定コードブック探索部６１７から出力される固定コードブックの利得値とピッチ分析部６１２から出力される適応コードブックの利得値とをそれぞれ量子化する。前記固定コードブックの利得値に対応する量子化した利得値Ｇ_ｃは、第１乗算器６２１に出力され、適応コードブックの利得値に対応する量子化した利得値Ｇ_ｐは、第２乗算器６２６に出力される。前記量子化した利得値Ｇ_ｃは音質向上階層６３０に含まれている利得値差の量子化器６４３にも提供される。 The gain value quantizer 629 quantizes the gain value of the fixed codebook output from the fixed codebook search unit 617 and the gain value of the adaptive codebook output from the pitch analysis unit 612, respectively. Gain value G _c quantized corresponding to the gain value of the fixed codebook is output to the first multiplier 621, a gain value G _p quantized corresponding to the gain value of the adaptive codebook, second multiplier 626 is output. The quantized gain value G _c is also provided to a gain value difference quantizer 643 included in the sound quality enhancement layer 630.

音質向上階層６３０は、図１の音質向上階層１３０のように復元される音質を向上させるために、基本階層６００で提供されるビット以外に追加的なビットをさらに提供するためのものである。図６は、説明の便宜上、１つの音声向上階層６３０が基本階層６００に連結された構成を示したが、複数の音声向上階層が基本階層６００に連結されうる。 The sound quality improvement layer 630 is for providing additional bits in addition to the bits provided in the basic layer 600 in order to improve the sound quality restored as in the sound quality improvement layer 130 of FIG. Although FIG. 6 shows a configuration in which one voice enhancement layer 630 is connected to the basic layer 600 for convenience of explanation, a plurality of voice enhancement layers can be connected to the basic layer 600.

音質向上階層６３０は、固定コードブック寄与度計算部６３１、第３加算器６３３、合成フィルタ６３４、認知加重フィルタ６３７、固定コードブック探索部６３９、固定コードブック６４１、利得値差の量子化器６４３、及び第３乗算器６４４で構成される。 The sound quality improvement layer 630 includes a fixed codebook contribution calculation unit 631, a third adder 633, a synthesis filter 634, a cognitive weighting filter 637, a fixed codebook search unit 639, a fixed codebook 641, and a gain value difference quantizer 643. , And a third multiplier 644.

基本階層６００の第１乗算器６２１から、固定コードブックのベクトルに量子化利得値Ｇ_ｃが乗算された固定コードブックｃ_Ｇ（ｎ）が受信されれば、固定コードブック寄与度計算部６３１は、数式（１６）により固定コードブック寄与度ｙ_２（ｎ）を計算する。 If the fixed codebook c _G (n) obtained by multiplying the fixed codebook vector by the quantization gain value G _c is received from the first multiplier 621 of the base layer 600, the fixed codebook contribution calculating unit 631 , The fixed codebook contribution y ₂ (n) is calculated by Equation (16).

数式（１６）で、Ｎは副フレームを構成するサンプル数によって決定される。したがって、ピッチ寄与度の除去部６１５で説明したように、副フレームの大きさが４０サンプルである場合にＮは４０である。数式（１６）で、ｈ（ｎ）は合成フィルタのインパルス応答である。固定コードブック寄与度計算部６３１で計算された固定コードブック寄与度は、ライン６３２を通じて第３加算器６３３に提供される。

In Equation (16), N is determined by the number of samples constituting the subframe. Therefore, as described in the pitch contribution removal unit 615, N is 40 when the size of the subframe is 40 samples. In Expression (16), h (n) is an impulse response of the synthesis filter. The fixed codebook contribution calculated by the fixed codebook contribution calculator 631 is provided to the third adder 633 through a line 632.

第３加算器６３３は、ライン６１６を通じて提供される基本階層６００の固定コードブック探索のために要求される対象信号から、ライン６３２を通じて提供される固定コードブック寄与度とライン６３５を通じて合成フィルタ６３４から提供される合成信号とを除去した信号を出力する。 The third adder 633 generates the fixed codebook contribution provided through the line 632 and the synthesis filter 634 through the line 635 from the target signal required for the fixed codebook search of the base layer 600 provided through the line 616. A signal obtained by removing the provided composite signal is output.

合成フィルタ６３４は、音質向上階層６３０の量子化された利得値Ａによって、固定コードブックのベクトルを乗算させることにより得られた固定コードブックを、ライン６４７を通じて受信すれば、ＬＰＣ係数の抽出及びベクトル量子化器６０４で抽出され量子化されたＬＰＣ係数を使用して、前記入力される固定コードブック信号を合成することにより得られる信号を出力する。 When the synthesis filter 634 receives the fixed codebook obtained by multiplying the quantized gain value A of the sound quality enhancement layer 630 by the vector of the fixed codebook through the line 647, the synthesis filter 634 extracts the LPC coefficient and calculates the vector. Using the LPC coefficients extracted and quantized by the quantizer 604, a signal obtained by synthesizing the input fixed codebook signal is output.

ここで、利得値Ａは、次のように表される。 Here, the gain value A is expressed as follows.

認知加重フィルタ６３７は、ライン６３６を通じて入力される信号を、認知加重フィルタ６０１のようにフィルタリングして、音質向上階層６３０で固定コードブック探索のために要求される対象信号を出力する。対象信号は、ライン６３８を通じて固定コードブック探索部６３９に伝送される。

The cognitive weighting filter 637 filters the signal input through the line 636 as the cognitive weighting filter 601 and outputs the target signal required for the fixed codebook search in the sound quality enhancement layer 630. The target signal is transmitted to the fixed codebook search unit 639 through the line 638.

固定コードブック探索部６３９は、基本階層６００の固定コードブック探索部６１７のように入力される対象信号に基づいて固定コードブックを探索して固定コードブックのインデックスと利得値とを得る。得られた固定コードブックのインデックスは、ライン６４０を通じて多重化器６５０に伝送されながら固定コードブック６４１に伝送される。前記固定コードブックの利得値Ｇ_ＣＥは、ライン６４２を通じて利得値差の量子化器６４３に伝送される。 The fixed codebook search unit 639 searches the fixed codebook based on the input target signal like the fixed codebook search unit 617 of the base layer 600 to obtain the fixed codebook index and gain value. The obtained fixed codebook index is transmitted to the fixed codebook 641 while being transmitted to the multiplexer 650 through the line 640. The gain value G _CE of the fixed codebook is transmitted to a gain value difference quantizer 643 through a line 642.

固定コードブック６４１は、入力された固定コードブックインデックスに基づいて音質向上階層６３０の固定コードブックベクトルを出力する。固定コードブックベクトルはパルスの位置情報ｍと符号情報ｓとを含みうる。固定コードブック６４１から出力される固定コードブックベクトルは、第３乗算器６４４に提供される。基本階層６００の固定コードブック６１９から出力される固定コードブックベクトルのパルスの位置と、音質向上階層６３０の固定コードブック６４１から出力される固定コードブックベクトルのパルスの位置とは同一でありうる。 Fixed codebook 641 outputs a fixed codebook vector of sound quality enhancement layer 630 based on the input fixed codebook index. The fixed codebook vector may include pulse position information m and code information s. The fixed codebook vector output from the fixed codebook 641 is provided to the third multiplier 644. The position of the pulse of the fixed codebook vector output from the fixed codebook 619 of the base layer 600 and the position of the pulse of the fixed codebook vector output from the fixed codebook 641 of the sound quality enhancement layer 630 may be the same.

利得値差の量子化器６４３は、基本階層６００の利得値量子化器６２９から出力される固定コードブックの利得値を量子化した利得値Ｇ_Ｃと、音質向上階層６３０の固定コードブック探索部６３９から出力される固定コードブックの量子化されていない利得値Ｇ_ＣＥとの間のログスケール差値を利用して、音質向上階層６３０の量子化された利得値Ａを得、量子化された利得値Ａを出力する。 Quantizer gain difference 643, a gain value G _C a gain value of the fixed codebook output from the gain value quantizer 629 of the base layer 600 by quantizing the fixed codebook searching unit of the speech quality enhancement layer 630 The quantized gain value A of the sound quality enhancement layer 630 is obtained by using the log scale difference value between the non-quantized gain value G _CE of the fixed codebook output from 639 and quantized. A gain value A is output.

図７は、利得値差の量子化器６４３の望ましい実施例を表したブロック図である。利得値量子化器６４３は、第１ログスケール変換部７０２、第２ログスケール変換部７０６、第４及び第５乗算器７０８、７１１及び第４加算器７０４を含む。 FIG. 7 is a block diagram illustrating a preferred embodiment of the quantizer 643 for gain value difference. The gain value quantizer 643 includes a first log scale converter 702, a second log scale converter 706, fourth and fifth multipliers 708 and 711, and a fourth adder 704.

基本階層６００の利得値量子化器６２９により提供される量子化された固定コードブック利得値Ｇ_Ｃが、ライン７０１を通じて入力されれば、第１ログスケール変換部７０２は、固定コードブック利得値Ｇ_ｃに対応するログスケール変換された固定コードブック利得値をライン７０３を通じて出力する。 Fixed codebook gain value G _C quantized provided by the gain value quantizer 629 of the base layer 600, when receiving via the line 701, the first log scale converter 702, a fixed codebook gain value G _The log scale converted fixed codebook gain value corresponding to _c is output through line 703.

音質向上階層６３０の固定コードブック探索部６３９から出力される量子化されていない利得値Ｇ_ＣＥが、ライン７０５を通じて入力されれば、第２ロードスケール変換部７０６によってログスケール変換された固定コードブック利得値を、ライン７０７を通じて出力する。 If the unquantized gain value G _CE output from the fixed codebook search unit 639 of the sound quality enhancement layer 630 is input through the line 705, the fixed codebook log-scale converted by the second load scale conversion unit 706 The gain value is output through line 707.

第４乗算器７０８は、ライン７０７を通じて入力されるログスケール変換された固定コードブック利得値に利得値差調整値Ｂを乗算し、乗算された結果をライン７０８を通じて出力する。 The fourth multiplier 708 multiplies the log scale converted fixed codebook gain value input through the line 707 by the gain value difference adjustment value B and outputs the multiplied result through the line 708.

ここで、利得値差調整値Ｂは、次のように表される。 Here, the gain value difference adjustment value B is expressed as follows.

第４加算器７０４は、ライン７０３を通じて入力される固定コードブック利得値とライン７０８を通じて入力される固定コードブック利得値間の差値を、ライン７１０を通じて出力する。

The fourth adder 704 outputs a difference value between the fixed codebook gain value input through the line 703 and the fixed codebook gain value input through the line 708 through the line 710.

第５乗算器７１１は、入力される利得値差にスケール拡張要素１０を乗算して、ログスケール利得値差Ｇ_ＤＩＦＦ７１２を生成する。 The fifth multiplier 711 generates the log scale gain value difference G _DIFF 712 by multiplying the input gain value difference by the scale expansion element 10.

前述した利得値差量子化６４３の動作過程は、数式（１９）のように定義できる。 The operation process of the gain value difference quantization 643 described above can be defined as Equation (19).

数式（１９）でＧ_ｃは、利得値量子化器６２９によって量子化された固定コードブックの利得値であり、Ｇ_ＣＥは固定コードブック探索部６３９から出力される量子化されていない利得値である。また、利得値差調整値Ｂは、ログスケール利得値間の差値の動的範囲を最小にする調整値である。利得値差調整値Ｂは、音声符号化器の種類によっていかなる値にもなることができ、実際の例として０.９８７が使われる。

In Equation (19), G _c is the gain value of the fixed codebook quantized by the gain value quantizer 629, and G _CE is the unquantized gain value output from the fixed codebook search unit 639. is there. The gain value difference adjustment value B is an adjustment value that minimizes the dynamic range of the difference value between the log scale gain values. The gain value difference adjustment value B can be any value depending on the type of speech encoder, and 0.987 is used as an actual example.

数式（１９）での計算を通じて生成されたログスケール利得値差７１２は、アナログ信号であるので、３ビットスカラー量子化器によって量子化される。３ビットスカラー量子化器により量子化された結果を利用して音質向上階層６３０の量子化された固定コードブック利得値Ａを出力する。前記量子化された利得値Ａは、ライン６４５を通じて第３乗算器６４４に出力されながらライン６４６を通じて多重化器６５０に出力される。 Since the log scale gain value difference 712 generated through the calculation of Equation (19) is an analog signal, it is quantized by a 3-bit scalar quantizer. Using the result quantized by the 3-bit scalar quantizer, the quantized fixed codebook gain value A of the sound quality enhancement layer 630 is output. The quantized gain value A is output to the multiplexer 650 through the line 646 while being output to the third multiplier 644 through the line 645.

第３乗算器６４４は、固定コードブック６４１から提供される固定コードブックベクトルに、利得値差の量子化器６４３から提供される音質向上階層６３０の量子化された固定コードブック利得値Ａを乗算し、乗算結果をライン６４７を合成フィルタ６３４に提供する。 The third multiplier 644 multiplies the fixed codebook vector provided from the fixed codebook 641 by the quantized fixed codebook gain value A of the sound quality enhancement layer 630 provided from the gain value difference quantizer 643. Then, the multiplication result is provided to the synthesis filter 634 as a line 647.

多重化器６５０は、基本階層６００から提供されるＬＰＣ係数量子化情報、固定コードブックインデックス、適応コードブックインデックス、利得値量子化情報と音質向上階層６３０から提供される音質向上階層の固定コードブックインデックス、利得値差の量子化情報を多重化し、その結果をビットストリームに出力する。 The multiplexer 650 is a fixed codebook of the sound quality improvement layer provided from the LPC coefficient quantization information, the fixed codebook index, the adaptive codebook index, the gain value quantization information and the sound quality improvement layer 630 provided from the base layer 600. The quantization information of the index and gain value difference is multiplexed, and the result is output to the bit stream.

基本階層６００と音質向上階層６３０とのビットストリームは、区分して伝送される。すなわち、図６に示されたように、音質向上階層６３０のビットストリームは、基本階層６００のビットストリーム後に伝送される。これによって前記ビットストリームは、ネットワークトラフィック状態によって復号化装置に必要なビット率で容易に分離されうる。例えば、復号化装置側のチャンネル特性が劣悪で、基本階層のビットストリームのみ受信できる場合に、前記復号化装置は、図６のビット率拡張音声符号化装置が送出するビットストリームのうち基本階層のビットストリームのみ受信できる。 The bit streams of the basic layer 600 and the sound quality improvement layer 630 are transmitted separately. That is, as shown in FIG. 6, the bit stream of the sound quality enhancement layer 630 is transmitted after the bit stream of the base layer 600. Accordingly, the bit stream can be easily separated at a bit rate required for the decoding apparatus according to network traffic conditions. For example, when the channel characteristics on the decoding device side are poor and only the base layer bit stream can be received, the decoding device transmits the base layer of the bit stream transmitted by the bit rate extended speech encoding device of FIG. Only bitstreams can be received.

図８は、本発明の望ましい他の実施例によるビット率拡張音声復号化装置のブロック図である。図８を参照すれば、前記ビット率拡張音声復号化装置は逆多重化器８０２、ＬＰＣ係数復号化部８０３、利得値復号化部８０４、第１固定コードブック復号化部８０５、適応コードブック復号化部８０６、利得値差の復号化部８０７、第２固定コードブック復号化部８０８、乗算器８０９、８１０、８１３、加算器８１１、８１４、選択スイッチ８１２、合成フィルタ８１５、及び後処理部８１６を含む。 FIG. 8 is a block diagram of a bit rate extended speech decoding apparatus according to another embodiment of the present invention. Referring to FIG. 8, the bit rate extended speech decoding apparatus includes a demultiplexer 802, an LPC coefficient decoding unit 803, a gain value decoding unit 804, a first fixed codebook decoding unit 805, an adaptive codebook decoding. 806, gain value difference decoding unit 807, second fixed codebook decoding unit 808, multipliers 809, 810, 813, adders 811, 814, selection switch 812, synthesis filter 815, and post-processing unit 816 including.

前記ビット率拡張音声復号化装置はビット率拡張音声符号化装置から伝送されるビットストリームを選択的に受信できる。すなわち、ビットストリームのうち基本階層に対するビットストリームのみ受信すれば、基本階層の音質が復元でき、基本階層及び音質向上階層に対するビットストリームを共に受信すれば、さらに向上した音質が提供できる。 The bit rate extended speech decoding apparatus can selectively receive a bit stream transmitted from the bit rate extended speech encoding apparatus. That is, if only the bitstream for the base layer is received, the sound quality of the base layer can be restored, and if both the bitstreams for the base layer and the sound quality improvement layer are received, further improved sound quality can be provided.

逆多重化器８０２は、受信されるビットストリーム８０１を各構成要素の情報に逆多重化して出力する。すなわち、逆多重化器８０２は、ＬＰＣ係数量子化情報をＬＰＣ係数復号化部８０３に、利得値量子化情報は利得値復号化部８０４に、利得値差の量子化情報は利得値差の復号化部８０７に、音質向上階層６３０の固定コードブックインデックスは第２固定コードブック復号化部８０８に、基本階層６００の固定コードブックインデックスは第１固定コードブック復号化部８０５に、適応コードブックインデックスは適応コードブック復号化部８０６にそれぞれ提供する。 The demultiplexer 802 demultiplexes the received bit stream 801 into information of each component and outputs the information. That is, the demultiplexer 802 decodes LPC coefficient quantization information to the LPC coefficient decoding unit 803, gain value quantization information to the gain value decoding unit 804, and gain value difference quantization information to decode the gain value difference. In the second fixed codebook decoding unit 808 for the fixed codebook index of the sound quality enhancement layer 630 and the adaptive codebook index in the first fixed codebook decoding unit 805 for the fixed codebook index of the basic layer 600. Are provided to the adaptive codebook decoding unit 806, respectively.

ＬＰＣ係数復号化部８０３の構造は、符号化装置側のＬＰＣ係数の抽出及びベクトル量子化器６０４により決定され、入力されるＬＰＣ係数量子化情報からＬＰＣ係数を復元する。復元されたＬＰＣ係数は、合成フィルタ８１５と後処理部８１６とに提供される。 The structure of the LPC coefficient decoding unit 803 is determined by the LPC coefficient extraction and vector quantizer 604 on the encoder side, and restores the LPC coefficient from the input LPC coefficient quantization information. The restored LPC coefficient is provided to the synthesis filter 815 and the post-processing unit 816.

利得値復号化部８０４の構造は、符号化装置側の利得値量子化器６２９により決定される。利得値復号化部８０４は、入力される利得値量子化情報をデコードする。前記利得値量子化情報は、適応コードブック利得値と固定コードブック利得値とを含む。したがって、利得値復号化部８０４から基本階層６００での適応コードブック利得値Ｇ_Ｐと固定コードブック利得値Ｇ_Ｃとがそれぞれ出力される。 The structure of gain value decoding section 804 is determined by gain value quantizer 629 on the encoding device side. The gain value decoding unit 804 decodes the input gain value quantization information. The gain value quantization information includes an adaptive codebook gain value and a fixed codebook gain value. Thus, the adaptive codebook gain value G _P of the basic hierarchical 600 and fixed codebook gain value G _C from the gain value decoding unit 804 are output.

第１固定コードブック復号化部８０５は、入力される第１固定コードブックインデックスをデコードして第１固定コードブックを出力する。固定コードブック復号方式は、符号化装置の固定コードブック探索部６１７での探索方式により決定される。 The first fixed codebook decoding unit 805 decodes the input first fixed codebook index and outputs the first fixed codebook. The fixed codebook decoding method is determined by the search method in fixed codebook search unit 617 of the encoding device.

適応コードブック復号化部８０６は、入力される適応コードブックインデックスをデコードして適応コードブックを出力する。 The adaptive codebook decoding unit 806 decodes the input adaptive codebook index and outputs an adaptive codebook.

前述したＬＰＣ係数復号化部８０３、利得値復号化部８０４、固定コードブック復号化部８０５、及び適応コードブック復号化部８０６は、逆多重化器８０２から伝送される基本階層６００での符号化情報をデコードする復号化ユニットと定義されうる。 The LPC coefficient decoding unit 803, gain value decoding unit 804, fixed codebook decoding unit 805, and adaptive codebook decoding unit 806 described above are encoded in the base layer 600 transmitted from the demultiplexer 802. It may be defined as a decoding unit that decodes information.

利得値差の復号化部８０７と第２固定コードブック復号化部８０８との動作は、ネットワークトラフィック状態や受信端末の処理容量に依存する。 The operations of the gain value difference decoding unit 807 and the second fixed codebook decoding unit 808 depend on the network traffic state and the processing capacity of the receiving terminal.

もし、利得値差の復号化部８０７と第２固定コードブック復号化部８０８とが動作すると決定されれば、利得値差の復号化部８０７は、入力される利得値差の量子化情報をデコードする。第２固定コードブック復号化部８０８は、入力される第２固定コードブックインデックスをデコードする。利得値差復号化方式は、符号化装置側の利得値差の量子化器６４３により決定される。 If it is determined that the gain value difference decoding unit 807 and the second fixed codebook decoding unit 808 operate, the gain value difference decoding unit 807 determines the gain value difference quantization information to be input. Decode. The second fixed codebook decoding unit 808 decodes the input second fixed codebook index. The gain value difference decoding method is determined by a gain value difference quantizer 643 on the encoding device side.

第２固定コードブック復号化部８０８で行われるデコード方式は、符号化装置側の第２固定コードブック探索部６３１により決定される。利得値差の復号化部８０７と第２固定コードブック復号化部８０８とは、逆多重化器９０２から伝送される音質向上階層６３０での符号化情報をデコードする復号化ユニットと定義されうる。 The decoding method performed by the second fixed codebook decoding unit 808 is determined by the second fixed codebook search unit 631 on the encoding device side. The gain value difference decoding unit 807 and the second fixed codebook decoding unit 808 may be defined as decoding units that decode the encoded information in the sound quality enhancement layer 630 transmitted from the demultiplexer 902.

乗算器８０９は、利得値復号化部８０４によって復元された基本階層６００の固定コードブック利得値Ｇ_ｃを第１固定コードブック復号化部８０５によって出力された基本階層の固定コードブックに乗算して基本階層の固定コードブックベクトルを出力する。 The multiplier 809 multiplies the fixed codebook gain value G _c of the base layer 600 restored by the gain value decoding unit 804 to the fixed codebook of the first fixed codebook decoding unit 805 output the base layer by Output base layer fixed codebook vector.

乗算器８１０は、利得値差の復号化部８０７によって復元された音質向上階層６３０での固定コードブック利得値Ａを、第２固定コードブック復号化部８０８によって出力された音質向上階層の固定コードブックに乗算して音質向上階層の固定コードブックベクトルを出力する。 The multiplier 810 uses the fixed codebook gain value A in the sound quality enhancement layer 630 restored by the gain value difference decoding unit 807 as the fixed code of the sound quality improvement layer output from the second fixed codebook decoding unit 808. Multiplies the book and outputs a fixed codebook vector of the sound quality enhancement layer.

加算器８１１は、乗算器８０９から出力される基本階層の固定コードブックベクトルと乗算器８１０から出力される音質向上階層の固定コードブックベクトルとを加算する。これによって、復号化装置での固定コードブックパルスは、基本階層と音質向上階層の代数コードブックを累積させて多重大きさを有する代数コードブックパルス構造を有する。前記代数コードブックを累積させることは、固定コードブックのあらゆるパルスが同じ大きさを有する既存の固定コードブック構造で発生する短所を補完するためのものである。 The adder 811 adds the fixed codebook vector of the basic layer output from the multiplier 809 and the fixed codebook vector of the sound quality improvement layer output from the multiplier 810. Accordingly, the fixed codebook pulse in the decoding apparatus has an algebraic codebook pulse structure having a multiple size by accumulating the algebraic codebooks of the basic layer and the sound quality enhancement layer. Accumulating the algebraic codebook is intended to compensate for the disadvantages that occur with existing fixed codebook structures in which every pulse of the fixed codebook has the same magnitude.

選択スイッチ８１２は、加算器８１１から出力される信号と乗算器８０９から出力される基本階層の固定コードブックベクトルとを選択的に伝送する。すなわち、前記復号化装置が音質向上階層で動作しない場合に、選択スイッチ８１２は、乗算器８０９から出力される基本階層の固定コードブックベクトルを選択して伝送する。前記符号化装置が音質向上階層で動作する場合に、選択スイッチ８１２は加算器８１１から出力される信号を選択して伝送する。 The selection switch 812 selectively transmits the signal output from the adder 811 and the base layer fixed codebook vector output from the multiplier 809. That is, when the decoding apparatus does not operate in the sound quality enhancement layer, the selection switch 812 selects and transmits the fixed codebook vector of the basic layer output from the multiplier 809. When the encoding apparatus operates in the sound quality improvement layer, the selection switch 812 selects and transmits the signal output from the adder 811.

乗算器８１３は、適応コードブック復号化部８０６から出力されるデコードされた適応コードブックに利得値復号化部８０４から出力される適応コードブックの利得値Ｇ_ｐを乗算して適応コードブックベクトルを出力する。 Multiplier 813, an adaptive codebook vector is multiplied by a gain value G _p of the adaptive codebook output from the gain value decoding unit 804 to the decoded adaptive codebook output from the adaptive codebook decoding unit 806 Output.

加算器８１４は、選択スイッチ８１２により選択された固定コードブックベクトルと乗算器８１３から出力される適応コードブックベクトルとを加算して復元された励起信号を発生させる。 The adder 814 adds the fixed codebook vector selected by the selection switch 812 and the adaptive codebook vector output from the multiplier 813 to generate a restored excitation signal.

前述した乗算器８１０、加算器８１１及び選択スイッチ８１２は、前記復号化装置の動作環境によって前述した基本階層の符号化情報を復号化するユニットと音質向上階層の符号化情報を復号化するユニットとでそれぞれデコードされた信号を演算する演算ユニットと定義されうる。 The multiplier 810, the adder 811 and the selection switch 812 described above include a unit for decoding the encoding information of the base layer and a unit for decoding the encoding information of the sound quality improvement layer according to the operating environment of the decoding device. Can be defined as operation units for calculating the decoded signals.

合成フィルタ８１５は、ＬＰＣ係数復号化部８０３から提供される復元されたＬＰＣ係数を利用して、加算器８１４から提供される励起信号を合成して音声信号を復元する。 The synthesis filter 815 synthesizes the excitation signal provided from the adder 814 using the restored LPC coefficient provided from the LPC coefficient decoding unit 803 to restore the speech signal.

後処理部８１６は、合成フィルタ８１５から伝送される音声信号を復元する。すなわち、後処理部８１６は、音声信号を復元するために、ＬＰＣ係数復号化部８０３から提供されるＬＰＣ係数を利用して、合成フィルタ８１５から出力される信号をフィルタリングするためのハイパスフィルタを使用する。 The post-processing unit 816 restores the audio signal transmitted from the synthesis filter 815. That is, the post-processing unit 816 uses a high-pass filter for filtering the signal output from the synthesis filter 815 using the LPC coefficient provided from the LPC coefficient decoding unit 803 to restore the audio signal. To do.

前述した合成フィルタ８１５と後処理部８１６とは前記演算ユニットから出力される信号を、ＬＰＣ係数復号化部８０３から出力されるＬＰＣ係数と合成して音声信号を復元する復元ユニットと定義されうる。 The synthesis filter 815 and the post-processing unit 816 described above may be defined as a restoration unit that synthesizes the signal output from the arithmetic unit with the LPC coefficient output from the LPC coefficient decoding unit 803 to restore the speech signal.

図９は、図６の音声信号の符号化装置で基本階層の固定コードブック探索９０１により探索されたパルスの位置と、音質向上階層の固定コードブック探索９０５により探索されたパルスの位置と、に基づいた固定コードブックベクトルを利用して図８の音声信号復号化装置で復元されるパルスの大きさを説明するための図面である。 FIG. 9 shows the positions of pulses searched by the fixed codebook search 901 in the basic layer and the pulse positions searched by the fixed codebook search 905 in the sound quality improvement layer in the speech signal encoding apparatus of FIG. FIG. 9 is a diagram for explaining the magnitude of a pulse restored by the speech signal decoding apparatus of FIG. 8 using a fixed codebook vector based thereon.

図９を参照すれば、第１固定コードブック復号化部８０５で提供される固定コードブックベクトル９０２に、利得値復号化部８０４で提供される固定コードブック利得値Ｇ_ｃが、乗算器８０９によって乗算されて、利得値が乗算された基本階層固定コードブックベクトル９０４が生成される。 Referring to FIG. 9, the fixed codebook vector 902 provided in the first fixed codebook decoding unit 805, the fixed codebook gain value G _c, which is provided by the gain value decoding unit 804, the multiplier 809 A base layer fixed codebook vector 904 multiplied by the gain value is generated by multiplication.

第２固定コードブック復号化部８０８で提供される固定コードブックベクトル９０６に、利得値差の復号化部８０７で提供される利得値Ｇ_ＣＥが乗算器８１０によって乗算されて、利得値が乗算された音質向上階層固定コードブックベクトル９０８が生成される。加算器８１１は、音質向上階層固定コードブックベクトル９０８と基本階層固定コードブックベクトル９０４とを加算した固定コードブックベクトル９１０を生成する。 A fixed codebook vector 906 provided in the second fixed codebook decoding unit 808, a gain value G _CE provided by the decoding unit 807 of the gain difference is multiplied by a multiplier 810, a gain value is multiplied A sound quality improvement hierarchical fixed codebook vector 908 is generated. The adder 811 generates a fixed codebook vector 910 obtained by adding the sound quality improvement hierarchical fixed codebook vector 908 and the basic hierarchical fixed codebook vector 904.

図９のパルスの構造に示されたように、基本階層の固定コードブックベクトル９０４と音質向上階層の固定コードブックベクトル９０８とは、加算器８１１に入力されて、音質向上階層の最終固定コードブック９１０を生成する。最終音質向上階層固定コードブック９１０は、相異なる利得値を有する２つの固定コードブックベクトルが加えられて得られるために、多重大きさを有する固定コードブックが形成できてさらに良い音質が提供できる。 As shown in the pulse structure of FIG. 9, the fixed codebook vector 904 of the basic layer and the fixed codebook vector 908 of the sound quality enhancement layer are input to the adder 811 to be the final fixed codebook of the sound quality enhancement layer. 910 is generated. Since the final sound quality enhancement hierarchical fixed codebook 910 is obtained by adding two fixed codebook vectors having different gain values, a fixed codebook having multiple sizes can be formed, and better sound quality can be provided.

図１０は、本発明の望ましい他の実施例によるビット率拡張音声符号化方法のフローチャートである。 FIG. 10 is a flowchart of a bit rate extended speech encoding method according to another embodiment of the present invention.

第１００１段階で、音声信号の符号化装置は、図６の前処理ユニット６０２のように入力された音声信号を前処理する。第１００２段階で、音声信号の符号化装置は、前処理された音声信号からＬＰＣ係数を抽出し、抽出されたＬＰＣ係数の量子化情報を生成する。 In operation 1001, the audio signal encoding apparatus preprocesses an input audio signal as in the preprocessing unit 602 of FIG. In operation 1002, the speech signal encoding apparatus extracts LPC coefficients from the preprocessed speech signal and generates quantization information of the extracted LPC coefficients.

第１００３段階で、音声信号の符号化装置は、前記前処理された信号から合成フィルタ６０６を経てＬＰＣ係数の残差信号を検出する。第１００４段階で、音声信号の符号化装置は、検出された残差信号を図６の認知加重フィルタ６１０でのようにフィルタリングして、認知加重された信号を出力する。 In operation 1003, the speech signal encoding apparatus detects a residual signal of an LPC coefficient from the preprocessed signal through a synthesis filter 606. In operation 1004, the speech signal encoding apparatus filters the detected residual signal as in the perceptual weighting filter 610 of FIG. 6 and outputs a perceptual weighted signal.

第１００５段階で、音声信号の符号化装置は、認知加重された信号のピッチを図６のピッチ分析部６１２のように分析し、図６のピッチ寄与度の除去部６１５のように分析された結果を利用して、前記認知加重された信号からピッチ寄与度を除去して、適応コードブック利得値と適応コードブックインデックスとを生成する。 In operation 1005, the speech signal encoding apparatus analyzes the pitch of the cognitive weighted signal as in the pitch analysis unit 612 of FIG. 6 and the pitch contribution removal unit 615 of FIG. 6. The result is used to remove the pitch contribution from the cognitive weighted signal to generate an adaptive codebook gain value and an adaptive codebook index.

第１００６段階で、音声信号の符号化装置は、図６の基本階層６００の固定コードブック探索部６１７でのように、基本階層固定コードブックを探索して固定コードブック利得値と固定コードブックインデックスとを生成する。 In operation 1006, the speech signal encoding apparatus searches the base layer fixed codebook to search for a fixed codebook gain value and a fixed codebook index as in the fixed codebook search unit 617 of the base layer 600 of FIG. And generate

第１００７段階で、音声信号の符号化装置は、図６の利得値量子化器６２９でのように、前記検出された固定コードブック利得値と前記検出された適応コードブック利得値とを量子化する。 In operation 1007, the speech signal encoding apparatus quantizes the detected fixed codebook gain value and the detected adaptive codebook gain value as in the gain value quantizer 629 of FIG. To do.

第１００８段階で、音声信号の符号化装置は、ベクトル量子化されたＬＰＣ係数を利用して、基本階層６００で生成された固定コードブックベクトルと適応コードブックベクトルとの励起信号を図６の合成フィルタ６０６でのように合成する。 In operation 1008, the speech signal encoding apparatus uses the vector quantized LPC coefficients to generate the excitation signal of the fixed codebook vector and the adaptive codebook vector generated in the base layer 600, as shown in FIG. Composite as in filter 606.

第１００９段階で、音声信号の符号化装置は、基本階層６００での固定コードブック探索のための対象信号の影響、及び音質向上階層６３０の以前のＬＰＣ合成信号を除去することによって図６の固定コードブック探索部６３９でのような固定コードブック探索のための対象信号を生成する。すなわち、基本階層６００で検出された対象信号から、基本階層の固定コードブック寄与度と音質向上階層６３０で検出された以前のＬＰＣ合成信号とを除去して音質向上階層での対象信号を得る。 In operation 1009, the speech signal encoding apparatus removes the influence of the target signal for the fixed codebook search in the base layer 600 and the previous LPC synthesized signal in the sound quality improvement layer 630. A target signal for fixed codebook search as in the codebook search unit 639 is generated. That is, from the target signal detected in the base layer 600, the fixed codebook contribution degree in the base layer and the previous LPC synthesized signal detected in the sound quality improvement layer 630 are removed to obtain the target signal in the sound quality improvement layer.

第１０１０段階で、音声信号の符号化装置は、第１００９段階で検出された対象信号を利用して、音質向上階層６３０の固定コードブック探索を行って音質向上階層の固定コードブック利得値と音質向上階層の固定コードブックインデックスとをそれぞれ生成する。 In operation 1010, the speech signal encoding apparatus performs a fixed codebook search in the sound quality improvement layer 630 using the target signal detected in operation 1009, and the fixed codebook gain value and sound quality in the sound quality improvement layer. A fixed codebook index of the improvement hierarchy is generated respectively.

第１０１１段階で、音声信号の符号化装置は、基本階層の量子化された固定コードブックの利得値と音質向上階層の量子化されていない固定コードブック利得値間のログスケール差を量子化する。前述した音質向上階層での固定コードブック探索及び利得値量子化過程は、複数階層の音質向上階層が具備されることによって複数階層で行われうる。音質向上階層処理が複数階層で行われれば、それほど復元される音声信号の質が向上しうる。 In operation 1011, the speech signal encoding apparatus quantizes a log scale difference between a quantized fixed codebook gain value of a base layer and an unquantized fixed codebook gain value of a sound quality enhancement layer. . The fixed codebook search and gain value quantization process in the sound quality enhancement layer described above may be performed in a plurality of layers by providing a plurality of sound quality enhancement layers. If the sound quality improvement layer processing is performed in a plurality of layers, the quality of the restored audio signal can be improved.

第１０１２段階で、音声信号の符号化装置は、音質向上階層で生成された固定コードブックベクトル（または、励起信号）を図６の合成フィルタ６３４に通過させて合成された信号を出力する。 In operation 1012, the speech signal encoding apparatus passes the fixed codebook vector (or excitation signal) generated in the sound quality enhancement layer to the synthesis filter 634 in FIG. 6 and outputs a synthesized signal.

第１０１３段階で、音声信号の符号化装置は、前述した段階を通じて得た線形予測係数量子化情報、基本階層の固定コードブックインデックス、基本階層の適応コードブックインデックス、基本階層の固定コードブックの利得値、基本階層の適応コードブックの利得値、音質向上階層の固定コードブックインデックス及び前記利得値差の量子化情報を多重化してビットストリームを得、前記ビットストリームを音声信号復号化装置側に送出する。 In operation 1013, the speech signal encoding apparatus performs linear prediction coefficient quantization information obtained through the above-described steps, base layer fixed codebook index, base layer adaptive codebook index, base layer fixed codebook gain. Value, base layer adaptive codebook gain value, sound quality enhancement layer fixed codebook index, and gain value difference quantization information are multiplexed to obtain a bitstream, and the bitstream is sent to the audio signal decoding device side To do.

図１１は、本発明の望ましい他の実施例によるビット率拡張音声復号化方法のフローチャートである。 FIG. 11 is a flowchart of a bit rate enhanced speech decoding method according to another embodiment of the present invention.

第１１０１段階で、音声信号復号化装置は、図８の逆多重化器８０２のように受信されるビットストリームを各構成要素の情報に逆多重化する。 In operation 1101, the audio signal decoding apparatus demultiplexes the received bitstream into information of each component as in the demultiplexer 802 of FIG.

第１１０２段階で音声信号復号化装置は、前記逆多重化された信号をデコードする。すなわち、図８のＬＰＣ係数復号化部８０３、利得値復号化部８０４、第１固定コードブック復号化部８０５、適応コードブック復号化部８０６、利得値差の復号化部８０７、第２固定コードブック復号化部８０８のように前記逆多重化された信号をデコードする。 In operation 1102, the audio signal decoding apparatus decodes the demultiplexed signal. That is, the LPC coefficient decoding unit 803, gain value decoding unit 804, first fixed codebook decoding unit 805, adaptive codebook decoding unit 806, gain value difference decoding unit 807, and second fixed code in FIG. Like the book decoding unit 808, the demultiplexed signal is decoded.

第１１０３段階で、音声信号復号化装置は、音声信号復号化装置の動作条件によって音質向上階層の固定コードブック、または基本階層の固定コードブックを選択的に伝送し、利得値も選択的に伝送される。すなわち、音声信号復号化装置が音質向上階層で動作すれば、復元された音質向上階層の固定コードブックの利得値が乗算された音質向上階層の固定コードブックと基本階層の固定コードブックに基本階層の固定コードブックの利得値とが乗算された固定コードブックを加算し、その結果を伝送させる。一方、音声信号の符号化装置が音質向上階層で動作しなければ、音声信号復号化装置は復号化された基本階層の固定コードブックに、基本階層の固定コードブックの利得値を乗算して得られた固定コードブックを伝送させる。 In operation 1103, the audio signal decoding apparatus selectively transmits a fixed codebook of a sound quality enhancement layer or a fixed codebook of a basic layer according to the operating conditions of the audio signal decoding apparatus, and also selectively transmits a gain value. Is done. That is, if the speech signal decoding apparatus operates in the sound quality enhancement layer, the base layer is divided into the fixed codebook of the sound quality improvement layer multiplied by the gain value of the fixed codebook of the restored sound quality improvement layer and the fixed codebook of the base layer. The fixed codebook multiplied by the gain value of the fixed codebook is added, and the result is transmitted. On the other hand, if the speech signal encoding device does not operate in the sound quality enhancement layer, the speech signal decoding device obtains the decoded base layer fixed codebook by the gain value of the base layer fixed codebook. The fixed codebook is transmitted.

第１１０４段階で、音声信号復号化装置は第１１０２段階で復号化されたＬＰＣ係数を利用して、第１１０３段階で選択的に伝送された固定コードブックを合成する。 In operation 1104, the speech signal decoding apparatus synthesizes the fixed codebook selectively transmitted in operation 1103 using the LPC coefficients decoded in operation 1102.

第１１０５段階で、音声信号復号化装置は、後処理ユニット８１６のように後処理して復元された音声信号を生成する。 In operation 1105, the speech signal decoding apparatus generates a speech signal restored by post-processing as in the post-processing unit 816.

これまで本発明についてその望ましい実施例を中心として説明した。本発明が属する技術分野で当業者は本発明が本発明の本質的な特性から外れない範囲で変形された形態に具現されうることが理解できる。したがって、開示された実施例は限定的な観点でなく、説明的な観点で考慮せねばならない。本発明の範囲は前述した説明でなく特許請求の範囲に記載されており、それと同等な範囲内のあらゆる差異点は本発明に含まれたものと解釈されねばならない。 The present invention has been described above with a focus on preferred embodiments thereof. Those skilled in the art to which the present invention pertains can understand that the present invention can be embodied in a modified form without departing from the essential characteristics of the present invention. Accordingly, the disclosed embodiments are to be considered in an illustrative, not a limiting sense. The scope of the present invention is described in the scope of claims rather than the above description, and all differences within the equivalent scope should be construed as being included in the present invention.

本発明はＣＥＬＰアルゴリズムを使用し、基本階層と音質向上階層とを有する音声符号化及び復号化装置に適用可能である。 The present invention is applicable to a speech encoding / decoding device that uses a CELP algorithm and has a basic layer and a sound quality improvement layer.

本発明の望ましい一実施例によるビット率拡張音声符号化装置のブロック図である。1 is a block diagram of a bit rate extended speech encoding apparatus according to an embodiment of the present invention. 図１に示された基本階層固定コードブック探索部により探索されたパルスの位置と音質向上階層固定コードブック探索部により探索されたパルスの位置との例示図である。FIG. 2 is an exemplary diagram of a pulse position searched by a basic layer fixed codebook search unit shown in FIG. 1 and a pulse position searched by a sound quality improvement layer fixed codebook search unit. 本発明の望ましい一実施例によるビット率拡張音声復号化装置のブロック図である。1 is a block diagram of a bit rate extended speech decoding apparatus according to an embodiment of the present invention. 本発明の望ましい一実施例によるビット率拡張音声符号化方法のフローチャートである。3 is a flowchart of a bit rate extended speech encoding method according to an embodiment of the present invention. 本発明の望ましい一実施例によるビット率拡張音声復号化方法のフローチャートである。3 is a flowchart of a bit rate extended speech decoding method according to an embodiment of the present invention. 本発明の望ましい他の実施例によるビット率拡張音声符号化装置のブロック図である。FIG. 5 is a block diagram of a bit rate extended speech encoding apparatus according to another embodiment of the present invention. 図６のビット率拡張音声符号化装置で音質向上階層の利得値差の量子化器のブロック図である。FIG. 7 is a block diagram of a quantizer for gain value difference of a sound quality improvement layer in the bit rate extended speech encoding apparatus of FIG. 6. 本発明の望ましい他の実施例によるビット率拡張音声復号化装置のブロック図である。FIG. 6 is a block diagram of a bit rate extended speech decoding apparatus according to another embodiment of the present invention. 図８のビット率拡張音声復号化装置で基本階層固定コードブック探索により探索されたパルスの位置と音質向上階層固定コードブック探索により探索されたパルスの位置との例示図である。FIG. 9 is an exemplary diagram of a pulse position searched by a base layer fixed codebook search and a pulse position searched by a sound quality improvement layer fixed codebook search in the bit rate extended speech decoding apparatus of FIG. 8. 本発明の望ましい他の実施例によるビット率拡張音声符号化方法のフローチャートである。7 is a flowchart of a bit rate extended speech encoding method according to another embodiment of the present invention. 本発明の望ましい他の実施例によるビット率拡張音声復号化方法のフローチャートである。5 is a flowchart of a bit rate enhanced speech decoding method according to another embodiment of the present invention.

Explanation of symbols

１００基本階層
１０１、１０３、１０５、１０７、１０９、１１１、１１３、１１４、１１６、１１８、１１８’、１１８’’、１２０、１２２、１２５、１２７、１３５ライン
１０２前処理ユニット
１０４ＬＰＣ係数の抽出及びベクトル量子化器
１０６合成フィルタ
１０８減算器
１１０認知加重フィルタ
１１２ピッチ分析部
１１５ピッチ寄与度の除去部
１１７固定コードブック探索部
１１９固定コードブック
１２１第１乗算器
１２３加算器
１２４適応コードブック
１２６第２乗算器
１２９利得値量子化部
１３０音質向上階層
１３１固定コードブック探索部
１３２利得値
１３４利得値差の量子化器
１４０多重化器
Ｇ_ｃ、Ｇ_ｐ量子化された利得値 100 Base layer 101, 103, 105, 107, 109, 111, 113, 114, 116, 118, 118 ', 118'', 120, 122, 125, 127, 135 Line 102 Preprocessing unit 104 LPC coefficient extraction and Vector quantizer 106 Synthesis filter 108 Subtractor 110 Cognitive weighting filter 112 Pitch analysis unit 115 Pitch contribution removal unit 117 Fixed codebook search unit 119 Fixed codebook 121 First multiplier 123 Adder 124 Adaptive codebook 126 Second Multiplier 129 Gain value quantization unit 130 Sound quality enhancement layer 131 Fixed codebook search unit 132 Gain value 134 Gain value difference quantizer 140 Multiplexer _Gc, _Gp Quantized gain value

Claims

In an audio signal encoding device,
Filtering the input speech signal using linear predictive coding and generating an excitation signal of the filtered speech signal by fixed codebook search and adaptive codebook search;
Defined by the correlation d (n) between the target signal and the impulse response signal required for the fixed codebook search detected in the basic layer, the code of the pulse and the correlation d (n), Each pulse, each based on a parameter obtained by a fixed codebook search in the basic layer, including a correlation C corresponding to the size information of the correlation d (n) and an energy E of the impulse response signal A sound quality enhancement layer for searching a fixed codebook by searching an algebraic codebook in which a pulse code and a position of each pulse are associated with each other; and
A speech signal encoding apparatus including a multiplexer that multiplexes a signal generated in the base layer and a signal generated in the sound quality improvement layer and outputs the multiplexed signal.

The speech signal encoding apparatus according to claim 1, wherein the fixed codebook search in the basic layer and the fixed codebook search in the sound quality improvement layer are performed using an algebraic codebook.

The sound quality enhancement layer quantizes a difference between a first gain value obtained by a fixed codebook search performed by the basic layer and a second gain value obtained by a fixed codebook search in the sound quality enhancement layer. The speech signal encoding apparatus according to claim 1, further comprising a function.

The multiplexer includes quantization information of linear predictive coding coefficients generated in the base layer, a fixed codebook index in the base layer, an adaptive codebook index in the base layer, and a fixed codebook in the base layer. Quantization information of gain values and quantization information of adaptive codebook gain values in the base layer, fixed codebook index generated in the sound quality enhancement layer, and fixed codebook gain values and sound quality enhancement layer in the base layer The apparatus according to claim 1, wherein the quantized information for the difference value between the fixed codebook gain values is multiplexed.

If there are a plurality of sound quality enhancement layers, the multiplexer multiplexes quantized information related to a difference value between a fixed codebook index output from a plurality of sound quality enhancement layers and the fixed codebook gain value. The audio signal encoding apparatus according to claim 4 , wherein the apparatus is an audio signal encoding apparatus.

In an audio signal encoding device,
A base layer for performing linear predictive coding filtering on an input speech signal and generating an excitation signal corresponding to the filtered speech signal by fixed codebook search and adaptive codebook search;
Defined by the correlation d (n) between the target signal and the impulse response signal required for the fixed codebook search detected in the basic layer, the code of the pulse and the correlation d (n), Each pulse based on a parameter generated by a fixed codebook search in the base layer, including a correlation C corresponding to size information of the correlation d (n) and an energy E of an impulse response signal . A fixed codebook search unit that searches for a fixed codebook by searching an algebraic codebook that associates the code of each pulse and the position of each pulse ;
A difference between a first fixed codebook gain value generated by the fixed codebook search of the base layer and a second fixed codebook gain value output from the fixed codebook search unit is detected, and the detected difference is detected. A plurality of sound quality enhancement layers including a quantizer for gain value difference to be quantized,
An audio signal encoding apparatus including a multiplexer that multiplexes a signal generated in the base layer and a signal generated in the sound quality improvement layer.

In the audio signal decoding device for decoding an audio signal encoded by the encoding device according to any one of claims 1 to 6 , divided into a basic layer and at least one sound quality improvement layer,
A first decoding unit for decoding encoded information in a base layer of the encoded audio signal;
A second decoding unit for restoring encoded information in a sound quality enhancement layer of the encoded audio signal according to an operating environment of the audio signal decoding device;
An arithmetic unit for calculating a signal restored by the first decoding unit and a signal restored by the second decoding unit according to an operating environment of the audio signal decoding device;
An audio signal decoding apparatus comprising: an audio signal restoration unit that synthesizes a signal output from the arithmetic unit using the linear predictive coding coefficient output from the first decoding unit to restore an audio signal.

The first decoding unit is
A linear predictive coding coefficient composite unit for decoding quantization information of the linear predictive coding coefficient included in the coding information in the base layer;
A first fixed codebook decoding unit for decoding a fixed codebook index included in the encoding information in the base layer;
An adaptive codebook decoding unit for decoding an adaptive codebook index included in the encoding information in the base layer;
The speech signal decoding apparatus according to claim 7 , further comprising: a gain value decoding unit that decodes a fixed codebook gain value and an adaptive codebook gain value included in the encoding information in the base layer.

The second decoding unit is
A gain value difference decoding unit for decoding quantization information of a difference between fixed codebook gain values included in encoding information in the speech enhancement layer;
The audio signal decoding apparatus according to claim 8 , further comprising: a second fixed codebook decoding unit that decodes a fixed codebook index included in the encoded information in the sound quality enhancement layer.

The arithmetic unit is
A first adder for adding a difference between the decoded fixed codebook gain value output from the gain value decoding unit and the decoded gain value output from the decoding unit of the gain value difference;
A first selection switch for transmitting the decoded fixed codebook gain value output from the gain value decoding unit or the gain value output from the first adder according to an operating condition of the audio signal decoding device;
The decoded fixed codebook of the sound quality enhancement layer output from the second fixed codebook decoding unit and the decoded basic layer fixed codebook output from the first fixed codebook decoding unit are added. A second adder;
A second selection switch for transmitting the signal output from the second adder or the decoded fixed codebook output from the first fixed codebook decoding unit according to the operating condition of the audio signal decoding device;
A first multiplier for multiplying a gain value of the decoded adaptive codebook output from the adaptive codebook decoding unit and the decoded adaptive codebook output from the gain value decoding unit;
A second multiplier that multiplies the signal output from the first selection switch and the signal output from the second selection switch;
The speech signal decoding apparatus according to claim 9 , further comprising: a third adder that adds a signal output from the first multiplication branch and a signal output from the second multiplication branch.

The audio signal restoration unit includes:
A synthesis filter that synthesizes a signal output from the third adder using the linear predictive coding coefficient;
The speech signal decoding apparatus according to claim 10 , further comprising: a post-processing unit for obtaining the restored speech signal using the linear predictive coding coefficient and a signal output from the synthesis filter.

The second decoding unit is
A gain value difference decoding unit for decoding quantization information of a difference between fixed codebook gain values included in encoding information in the speech enhancement layer;
The audio signal decoding device according to claim 7 , further comprising: a fixed codebook decoding unit that decodes a fixed codebook index included in the encoding information in the sound quality enhancement layer.

The audio signal encoding method is
Extracting a linear predictive coding coefficient of a speech signal input in a basic layer, and generating an excitation signal corresponding to the input speech signal by a fixed codebook search and an adaptive codebook search;
In at least one sound quality enhancement layer, a correlation d (n) between the target signal and impulse response signal required for the fixed codebook search detected in the basic layer, a pulse code, and the correlation d ( n), the correlation degree C corresponding to the size information of the correlation degree d (n), and the energy E of the impulse response signal, and generated by searching the fixed codebook in the base layer Searching for a fixed codebook by searching an algebraic codebook that associates each pulse, the sign of each pulse and the position of each pulse based on a parametric variable;
Multiplexing a signal generated by the base layer and the sound quality enhancement layer.

The method of claim 13 , wherein the sound quality enhancement hierarchical processing step is performed in a plurality of steps.

The sound quality enhancement layer processing step quantizes a difference value between the fixed codebook gain value obtained by the fixed codebook search in the basic layer processing step and the gain value obtained by the fixed codebook search of the sound quality improvement layer. The audio signal encoding method according to claim 13 , wherein:

The parametric variable includes a first correlation between the target signal and the impulse response generated in the base layer processing step, a second correlation corresponding to the magnitude of the first correlation, and an energy of the impulse response signal. The audio signal encoding method according to claim 13 , wherein:

An audio signal decoding method for decoding an audio signal encoded into a base layer and at least one sound quality improvement layer by the audio signal encoding method according to any one of claims 13 to 16 ,
Decoding the encoded audio signal;
Selectively transmitting the codebook for the base layer and the codebook for the sound quality enhancement layer decoded in the decoding step according to the operation condition of the speech signal encoding;
Generating a restored speech signal by synthesizing the selectively transmitted codebook and the linear prediction coefficient decoded in the decoding step.

The decoding step includes
Claim 17, wherein the encoded audio signal demultiplexed into the encoded information related to the coding information and speech quality enhancement layer relating to the base layer, to decode the encoded information demultiplexing Audio signal decoding method.

The audio signal decoding method includes:
The difference between the gain value of the fixed codebook in the base layer decoded in the decoding step and the gain value of the fixed codebook included in the decoded sound quality enhancement layer is added to fix the fixed codebook in the sound quality enhancement layer The audio signal decoding method according to claim 18 , further comprising: restoring a gain value of the audio signal.