JP2005181354A

JP2005181354A - Device and method for decoding

Info

Publication number: JP2005181354A
Application number: JP2003417338A
Authority: JP
Inventors: Takehiro Tominaga; 丈博冨永; Nobuo Kato; 伸雄加藤
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 2003-12-15
Filing date: 2003-12-15
Publication date: 2005-07-07

Abstract

<P>PROBLEM TO BE SOLVED: To shorten the time needed for decoding processing by reducing the calculation loads of inverse quantization processing and inverse normalization processing after the decoding processing of an entropy-encoded code series. <P>SOLUTION: A decoding section 12 cuts variable-length codes out of the code series supplied from a shift register 20 by a maximum word length from the position that a counter 21 indicates, and an index corresponding to the variable-length codes is supplied from an index ROM 22 to a spectrum coefficient ROM 24 by using its output value. The spectrum coefficient ROM 24 stores conversion tables showing correspondence relation between quantization spectrum coefficients x_quant and spectrum coefficients spec corresponding to indexes by values of scale factors sf. The value at an index position is read out of a conversion table selected based upon the value of a scale factor sf supplied from a subordinate information decoding part 11 and outputted as a spectrum coefficient spec. <P>COPYRIGHT: (C)2005,JPO&NCIPI

Description

本発明は、複数の係数に対して正規化処理及び量子化処理を施した各シンボルに対して複数の可変長符号の何れかを割り当てることで符号化して得られた符号列を復号する復号装置及び方法に関する。 The present invention relates to a decoding apparatus for decoding a code string obtained by encoding by assigning any of a plurality of variable length codes to each symbol that has been subjected to normalization processing and quantization processing on a plurality of coefficients. And a method.

知覚情報モデルに基づく映像や音声等のデータ圧縮では、先ず情報のモデル化によるデータ圧縮を行い、その後にエントロピー符号化を行う、という２段階の処理により高能率な圧縮を実現することが多い。このうち、１段階目の処理においては、例えば知覚不可能なロスを許容することで、より高い圧縮率を実現することができる。また、２段階目の処理においては、出現頻度の高いシンボルに対して短い符号を割り当てて符号化結果の平均符号長を短くすることで、より高い圧縮率を実現することができる。このような２段階のデータ圧縮を行う技術としては、ＩＳＯによって標準化された映像・音声の圧縮規格であるＭＰＥＧ（Moving Picture Experts Group）等の標準技術が存在し、広く使用されている。 In data compression such as video and audio based on a perceptual information model, high-efficiency compression is often realized by a two-stage process in which data compression is first performed by information modeling and then entropy coding is performed. Among these, in the first stage process, for example, a higher compression ratio can be realized by allowing a non-perceptible loss. Further, in the second stage processing, a higher compression ratio can be realized by assigning a short code to a symbol having a high appearance frequency and shortening the average code length of the encoding result. As a technique for performing such two-stage data compression, standard techniques such as MPEG (Moving Picture Experts Group), which is a video / audio compression standard standardized by ISO, exist and are widely used.

ここで、人間の知覚は非線形特性を示すという特徴があるため、１段階目の処理として、対数を用いた正規化や非線形量子化が行われることが多い。このとき、正規化に用いられる対数値としては、２進数を用いた計算機で効率よく実装することを目的として、２の巾乗となるような係数が用いられることが多い。 Here, since human perception has a characteristic of exhibiting non-linear characteristics, normalization using a logarithm or non-linear quantization is often performed as the first stage processing. At this time, as a logarithmic value used for normalization, a coefficient that is a power of 2 is often used for the purpose of efficient implementation by a computer using a binary number.

２段階の処理でデータ圧縮された符号化結果を復号する場合、圧縮時の逆方向に処理を行うことになる。すなわち、先ずエントロピー符号を復号し、その後にモデル化に依存するデータの復号を行うことになる。つまり、エントロピー符号を復号した後、その復号結果に対して何らかの処理を行って最終的な出力結果を得ることとなる。 When decoding an encoded result that has been data-compressed in a two-stage process, the process is performed in the reverse direction at the time of compression. That is, the entropy code is first decoded, and then the data depending on the modeling is decoded. That is, after decoding the entropy code, some processing is performed on the decoding result to obtain a final output result.

ところで、エントロピー符号化された符号列を効率よく復号し、シンボル値を求める手法としては、現在までに種々提案されている（例えば特許文献１参照）。 By the way, various methods have been proposed so far for efficiently decoding an entropy-coded code string and obtaining a symbol value (see, for example, Patent Document 1).

特許第３２２９６９０号公報Japanese Patent No. 3229690

ここで、上述のように２段階のデータ圧縮を行っている場合、シンボル値そのものは最終的な復号結果ではないため、その後に上記１段階目の処理に対応した処理が必要になる。しかしながら、この処理コストが高かった場合、エントロピー符号化された符号列の復号処理が十分に高能率化されても、全体として十分なパフォーマンスが得られないという問題がある。 Here, when the two-stage data compression is performed as described above, since the symbol value itself is not the final decoding result, a process corresponding to the first-stage process is required thereafter. However, when this processing cost is high, there is a problem that sufficient performance cannot be obtained as a whole even if the decoding process of the entropy-coded code string is sufficiently efficient.

上記１段階目の処理に対応した処理の例として、上述したＭＰＥＧの音声圧縮として、ＭＰＥＧ１ Layer３（以下、ＭＰ３と記す。）の規格における復号処理の抜粋を以下に示す。なお、規格の詳細な内容については、ＩＳＯ／ＭＰＥＧ標準文章「約 1.5Mbit/s のデジタル記憶媒体用映画及び音声の符号化 -- CD11172-3（オーディオ第３部）」（ISO/IECJTCI/SC29,1992）に記載されている。 As an example of processing corresponding to the first stage processing, an extract of decoding processing in the MPEG1 Layer 3 (hereinafter referred to as MP3) standard as the above-described MPEG audio compression is shown below. As for the detailed contents of the standard, the ISO / MPEG standard text “Encoding of movies and audio for digital storage media of about 1.5 Mbit / s-CD11172-3 (Audio Part 3)” (ISO / IECJTCI / SC29 , 1992).

ＭＰ３では、符号列のうち特にサブ情報といわれる、フレーム毎に特有の情報を記録する領域に含まれるグローバルゲイン等の情報からスケールファクタsfを計算し、このスケールファクタsfから以下の式（１）に従ってゲインgainを算出する。 In MP3, a scale factor sf is calculated from information such as a global gain included in an area in which information unique to each frame, which is called sub-information, is called a sub-information, and the following equation (1) is calculated from the scale factor sf. According to the above, gain gain is calculated.

また、同様に符号列に含まれるエントロピー符号化された周波数領域のデータから量子化スペクトル係数x_quantを求め、この量子化スペクトル係数x_quantから以下の式（２）に示すような逆非線形量子化により正規化スペクトル係数x_invquantを求める。 Similarly, a quantized spectral coefficient x_quant is obtained from entropy-encoded frequency domain data included in the code string, and normalized by inverse nonlinear quantization as shown in the following equation (2) from the quantized spectral coefficient x_quant. Obtain the normalized spectral coefficient x_invquant.

そして、両者を用いて以下の式（３）に従ってスペクトル係数specを計算し、このスペクトル係数specに対してさらにフィルタバンク処理等を行うことで、最終的な復号結果（音声波形等）を得ている。 Then, using both of them, a spectral coefficient spec is calculated according to the following equation (3), and a filter bank process or the like is further performed on the spectral coefficient spec to obtain a final decoding result (speech waveform or the like). Yes.

このとき、ゲインgain及びゲインgainの元になるスケールファクタsfの値は、複数の正規化スペクトル係数x_invquantで共有される。なお、同一のスケールファクタsfの値を共有する周波数帯域は、スケールファクタバンドと称される。 At this time, the gain gain and the value of the scale factor sf that is the basis of the gain gain are shared by a plurality of normalized spectral coefficients x_invquant. Note that frequency bands sharing the same scale factor sf value are referred to as scale factor bands.

このように、ＭＰ３では、エントロピー符号化されたシンボルである量子化スペクトル係数x_quantを復号する他に、スケールファクタバンド毎に式（１）に示したゲインgainの算出を行い、その後式（２）に示した非線形量子化の復号と式（３）に示した乗算とを行う必要があるのが分かる。 As described above, in MP3, in addition to decoding the quantized spectral coefficient x_quant which is an entropy-coded symbol, the gain gain shown in Expression (1) is calculated for each scale factor band, and then Expression (2) It can be seen that the nonlinear quantization decoding shown in (1) and the multiplication shown in Equation (3) need to be performed.

したがって、式（１）〜（３）に示した処理の処理コストが高かった場合、エントロピー符号化された符号列の復号処理が十分に高能率化されても、全体として十分なパフォーマンスが得られないことになる。 Therefore, when the processing cost of the processes shown in the equations (1) to (3) is high, even if the decoding process of the entropy-encoded code string is sufficiently efficient, sufficient performance as a whole can be obtained. There will be no.

本発明は、このような従来の実情に鑑みて提案されたものであり、エントロピー符号化された符号列の復号処理後の逆量子化処理及び逆正規化処理における計算負荷を軽減し、復号処理に要する時間を短縮することが可能な復号装置及びその方法を提供することを目的とする。 The present invention has been proposed in view of such a conventional situation, and reduces the calculation load in the inverse quantization process and the inverse normalization process after the decoding process of the entropy-encoded code string. An object of the present invention is to provide a decoding apparatus and method capable of reducing the time required for the above.

上述した目的を達成するために、本発明に係る復号装置は、複数の係数に対して正規化処理及び量子化処理を施した各シンボルに対して複数の可変長符号の何れかを割り当てることで符号化して得られた符号列を復号する復号装置において、上記複数の可変長符号と、各可変長符号に対応するシンボルに対して予め逆量子化処理及び逆正規化処理を施した値とが対応付けられた変換テーブルが記憶された記憶手段と、上記符号列から、上記変換テーブル内の上記複数の可変長符号の何れかと一致する符号列部分を探索し、探索した符号列部分と一致した可変長符号に対応付けられた値を読み出すことで、上記探索した符号列部分に対応する係数を復元する復号手段とを備えるものである。 In order to achieve the above-described object, the decoding apparatus according to the present invention assigns one of a plurality of variable length codes to each symbol obtained by performing normalization processing and quantization processing on a plurality of coefficients. In the decoding apparatus that decodes the code string obtained by encoding, the plurality of variable length codes and values obtained by performing inverse quantization processing and inverse normalization processing on symbols corresponding to the variable length codes in advance. A code string portion that matches any of the plurality of variable length codes in the conversion table is searched from the storage means storing the associated conversion table and the code string, and matches the searched code string portion. Decoding means for restoring a coefficient corresponding to the searched code string portion by reading a value associated with the variable-length code is provided.

また、上述した目的を達成するために、本発明に係る復号方法は、複数の係数に対して正規化処理及び量子化処理を施した各シンボルに対して複数の可変長符号の何れかを割り当てることで符号化して得られた符号列を復号する復号方法において、上記複数の可変長符号と、各可変長符号に対応するシンボルに対して予め逆量子化処理及び逆正規化処理を施した値とが対応付けられた変換テーブルを用いて、上記符号列から、該変換テーブル内の上記複数の可変長符号の何れかと一致する符号列部分を探索し、探索した符号列部分と一致した可変長符号に対応付けられた値を読み出すことで、上記探索した符号列部分に対応する係数を復元する復号工程を有するものである。 In order to achieve the above-described object, the decoding method according to the present invention assigns one of a plurality of variable length codes to each symbol obtained by performing normalization processing and quantization processing on a plurality of coefficients. In the decoding method for decoding a code string obtained by encoding, a value obtained by performing inverse quantization processing and inverse normalization processing on the plurality of variable length codes and symbols corresponding to the variable length codes in advance. Is searched for a code string portion that matches any of the plurality of variable length codes in the conversion table from the code string, and a variable length that matches the searched code string portion. It has a decoding step of restoring the coefficient corresponding to the searched code string portion by reading the value associated with the code.

このような復号装置及びその方法では、符号列を復号したシンボルに対して逆量子化処理及び逆正規化処理を施す代わりに、各可変長符号に対応するシンボルに対して予め逆量子化処理及び逆正規化処理を施した値とが対応付けられた変換テーブルを用いて、符号列から切り出された符号列部分と一致した可変長符号に対応付けられた値を読み出す。 In such a decoding apparatus and its method, instead of performing the inverse quantization process and the inverse normalization process on the symbol obtained by decoding the code string, the inverse quantization process and the symbol corresponding to each variable length code are performed in advance. A value associated with a variable-length code that matches a code string portion cut out from a code string is read using a conversion table associated with a value subjected to denormalization processing.

本発明に係る復号装置及びその方法によれば、符号列を復号したシンボルに対して逆量子化処理及び逆正規化処理を施す代わりに、各可変長符号に対応するシンボルに対して予め逆量子化処理及び逆正規化処理を施した値とが対応付けられた変換テーブルを用いて、符号列から切り出された符号列部分と一致した可変長符号に対応付けられた値を読み出すため、符号列の復号処理後の逆量子化処理及び逆正規化処理における計算負荷を軽減し、復号処理に要する時間を短縮することが可能となる。また、計算負荷の軽減に伴い、計算に必要な回路規模を小さくすることができるため、部品のコストダウンを図ることができる。 According to the decoding apparatus and method of the present invention, instead of performing inverse quantization processing and inverse normalization processing on symbols obtained by decoding a code string, inverse quantization is performed in advance on symbols corresponding to each variable length code. In order to read out the value associated with the variable length code that matches the code string portion cut out from the code string using the conversion table in which the values subjected to the normalization process and the denormalization process are associated with each other, the code string It is possible to reduce the calculation load in the inverse quantization process and the inverse normalization process after the decoding process, and to shorten the time required for the decoding process. Further, as the calculation load is reduced, the circuit scale necessary for the calculation can be reduced, so that the cost of parts can be reduced.

以下、本発明を適用した具体的な実施の形態について、図面を参照しながら詳細に説明する。この実施の形態は、本発明を、高能率音声符号化技術の１つであるＭＰ３における復号器（オーディオデコーダ）に適用したものである。なお、本発明は音声データの圧縮のみならず映像データの圧縮にも適用できることは勿論である。 Hereinafter, specific embodiments to which the present invention is applied will be described in detail with reference to the drawings. In this embodiment, the present invention is applied to a decoder (audio decoder) in MP3 which is one of high-efficiency speech coding techniques. Of course, the present invention can be applied not only to audio data compression but also to video data compression.

このオーディオデコーダは、周波数情報を正規化・非線形量子化した値（量子化スペクトル係数x_quant）に対して可変長符号を割り当てた符号列を復号するものであり、復号した量子化スペクトル係数x_quantに対して前述した式（１）〜（３）に相当する処理を行ってスペクトル係数specを復号する。 This audio decoder decodes a code string in which a variable length code is assigned to a value (quantized spectrum coefficient x_quant) obtained by normalizing and nonlinearly quantizing frequency information, and for the decoded quantized spectrum coefficient x_quant. Then, the processing corresponding to the above-described equations (1) to (3) is performed to decode the spectral coefficient spec.

なお、ＭＰ３の規格上、スケールファクタsfとして約３６０種類、量子化スペクトル係数x_quantとして約８２００種類の値を使用することが可能であるが、以下では簡単のため、スケールファクタsfとして８種類の値（０，１，・・・，７）、量子化スペクトル係数x_quantとして４種類の値（０，１，２，３）のみが選択可能であるものとする。また、各量子化スペクトル係数x_quantには、以下の表１に示すように、１〜３ビットの可変長符号が割り当てられているものとする。 According to the MP3 standard, about 360 types of scale factors sf and about 8200 types of quantized spectral coefficients x_quant can be used. However, for simplicity, eight values are used as scale factors sf below. Assume that only four types of values (0, 1, 2, 3) can be selected as (0, 1,..., 7) and the quantized spectral coefficient x_quant. In addition, as shown in Table 1 below, it is assumed that 1-3 bit variable length codes are assigned to each quantized spectral coefficient x_quant.

本実施の形態におけるオーディオデコーダの概略構成を図１に示す。図１に示すオーディオデコーダ１において、信号分離部１０は、入力した圧縮データ（ＭＰ３ビットストリーム）をフレーム単位で分割し、フレームの情報を量子化スペクトル係数x_quantが含まれる符号列とそれ以外のサブ情報とに分離する。信号分離部１０は、分離したサブ情報をサブ情報復号部１１に供給すると共に、量子化スペクトル係数x_quantが含まれる符号列を復号部１２に供給する。 FIG. 1 shows a schematic configuration of the audio decoder in the present embodiment. In the audio decoder 1 shown in FIG. 1, the signal separation unit 10 divides the input compressed data (MP3 bit stream) in units of frames, and the frame information is divided into a code string including the quantized spectral coefficient x_quant and other sub-codes. Separated into information. The signal separation unit 10 supplies the separated sub information to the sub information decoding unit 11, and also supplies the code sequence including the quantized spectral coefficient x_quant to the decoding unit 12.

サブ情報復号部１１は、サブ情報のうちスケールファクタsfの値を復号し、このスケールファクタsfを復号部１２に供給する。なお、実際のサブ情報にはスケールファクタsf以外の値も含まれるが、簡単のためここでは省略する。 The sub information decoding unit 11 decodes the value of the scale factor sf in the sub information and supplies the scale factor sf to the decoding unit 12. The actual sub-information includes values other than the scale factor sf, but are omitted here for simplicity.

復号部１２は、量子化スペクトル係数x_quantが含まれる符号列から量子化スペクトル係数x_quantを復号し、この量子化スペクトル係数x_quantに対して逆非線形量子化処理及び逆正規化処理を施すことによりスペクトル係数specを復元する。 The decoding unit 12 decodes the quantized spectral coefficient x_quant from the code string including the quantized spectral coefficient x_quant, and performs the inverse nonlinear quantization process and the inverse normalization process on the quantized spectral coefficient x_quant to thereby obtain the spectral coefficient. Restore spec.

直交変換部１３は、復元されたスペクトル係数specで構成される周波数軸上の信号を直交変換により時間軸上の信号に変換し、時間信号処理部１４は、この時間軸上の信号に対してサブ情報に基づいた変換を行い、復号済みデータ（ＰＣＭデータ）を出力する。 The orthogonal transform unit 13 transforms the signal on the frequency axis constituted by the restored spectral coefficient spec into a signal on the time axis by orthogonal transform, and the time signal processing unit 14 performs the processing on the signal on the time axis. Conversion based on the sub information is performed, and decoded data (PCM data) is output.

このオーディオデコーダ１のうち、復号部１２の内部構成を図２に示す。なお、この図２には、信号分離部１０及びサブ情報復号部１１についても併せて示している。図２に示すように、復号部１２は、符号列を入力する毎にシフト動作を行うシフトレジスタ２０と、外部から与えられる信号のうち次に復号を行うべき場所を記憶し、そこから外部から与えられる最大語長分のビット数を切り出すことができるようにシフトレジスタ２０を制御するためのカウンタ２１と、シフトレジスタ２０の値から後述するインデックスを出力するインデックスＲＯＭ２２と、インデックスから符号長を出力する符号長ＲＯＭ２３と、インデックス及びスケールファクタsfからスペクトル係数specを求め、このスペクトル係数specを直交変換部１３（図１）に供給するスペクトル係数ＲＯＭ２４とから構成される。 FIG. 2 shows an internal configuration of the decoding unit 12 in the audio decoder 1. In FIG. 2, the signal separation unit 10 and the sub information decoding unit 11 are also shown. As shown in FIG. 2, the decoding unit 12 stores a shift register 20 that performs a shift operation every time a code string is input, and a location where decoding is to be performed next out of signals supplied from the outside, and from there externally A counter 21 for controlling the shift register 20 so as to cut out the number of bits corresponding to a given maximum word length, an index ROM 22 for outputting an index described later from the value of the shift register 20, and a code length from the index Code length ROM 23, and a spectral coefficient ROM 24 for obtaining a spectral coefficient spec from the index and scale factor sf and supplying the spectral coefficient spec to the orthogonal transform unit 13 (FIG. 1).

以下、この復号部１２においてスペクトル係数specが出力されるまでの各ブロックの動作について説明する。 Hereinafter, the operation of each block until the spectral coefficient spec is output in the decoding unit 12 will be described.

先ず、シフトレジスタ２０は、信号分離部１０から供給された符号列のうち、カウンタ２１が示す位置から、可変長符号として最大の長さを持つもののビット数（最大語長）分、具体的には３ビット分を切り出す。簡単のため、本実施の形態では最大語長は常に３であるものとするが、この値が可変である場合には、最大語長の情報を例えばサブ情報復号部１１から与えるようにすることも可能である。 First, the shift register 20 has the number of bits (maximum word length) of the code string supplied from the signal separation unit 10 having the maximum length as the variable length code from the position indicated by the counter 21, specifically, Cuts out 3 bits. For simplicity, it is assumed that the maximum word length is always 3 in the present embodiment, but when this value is variable, information of the maximum word length is given from, for example, the sub information decoding unit 11 Is also possible.

次に、このシフトレジスタ２０の出力値を用いてインデックスＲＯＭ２２がアクセスされる。このインデックスＲＯＭ２２の内容を以下の表２に示す。 Next, the index ROM 22 is accessed using the output value of the shift register 20. The contents of the index ROM 22 are shown in Table 2 below.

表２に示すように、４種類の各可変長符号に対して、可変長符号をそのまま含み、且つ可変長符号の先頭ビットを最上位ビットとした３ビット（最大符号長）の符号列がとり得る全ての値に対して、同じインデックス値が割り当てられている。これにより、シフトレジスタ２０から可変長符号に対応したインデックスを取得することができる。なお、ここでは便宜上インデックスの値として量子化スペクトル係数x_quantと同じ値を使用したが、スペクトル係数ＲＯＭ２４との整合性が取れる範囲で、この値を自由に選ぶことができる。 As shown in Table 2, for each of the four types of variable length codes, a code string of 3 bits (maximum code length) including the variable length code as it is and the first bit of the variable length code as the most significant bit is taken. The same index value is assigned to all the obtained values. Thereby, an index corresponding to the variable length code can be acquired from the shift register 20. Here, for convenience, the same value as the quantized spectral coefficient x_quant is used as the index value, but this value can be freely selected within a range where consistency with the spectral coefficient ROM 24 can be obtained.

続いて、インデックスＲＯＭ２２の出力を用いて符号長ＲＯＭ２３がアクセスされる。この符号長ＲＯＭ２３の内容を以下の表３に示す。 Subsequently, the code length ROM 23 is accessed using the output of the index ROM 22. The contents of the code length ROM 23 are shown in Table 3 below.

表３に示すように、４種類の各可変長符号に対応するインデックス値に対して、実際の符号長が対応付けられている。この符号長を用いてカウンタ２１の値を増加させることにより、次回の符号列の読み出し開始位置を次の可変長符号の開始位置に正しく合わせることが可能となる。 As shown in Table 3, the actual code length is associated with the index values corresponding to the four types of variable length codes. By increasing the value of the counter 21 using this code length, it becomes possible to correctly match the read start position of the next code string with the start position of the next variable length code.

スペクトル係数ＲＯＭ２４は、スケールファクタsfの値毎に作成された８つの変換テーブルから構成される。スケールファクタsfの値が０，１，２，３である場合における変換テーブルを以下の表４乃至表７に示す。 The spectrum coefficient ROM 24 is composed of eight conversion tables created for each value of the scale factor sf. Tables 4 to 7 below show conversion tables when the values of the scale factor sf are 0, 1, 2, and 3.

表４乃至表７から分かるように、各変換テーブルには、それぞれスケールファクタsfがある値であった場合における、インデックスの値に相当する量子化スペクトル係数x_quantとスペクトル係数specとの対応関係が示されている。８つの変換テーブルのうち実際に使用されるものは、サブ情報復号部１１から供給されたスケールファクタsfの値を用いて選択される。そして、選択されたテーブルからインデックスの位置に記録された値が読み出され、その値がスペクトル係数specとして出力される。 As can be seen from Tables 4 to 7, each conversion table shows the correspondence between the quantized spectral coefficient x_quant corresponding to the index value and the spectral coefficient spec when the scale factor sf is a certain value. Has been. Of the eight conversion tables, the one actually used is selected using the value of the scale factor sf supplied from the sub information decoding unit 11. Then, the value recorded at the index position is read from the selected table, and the value is output as the spectrum coefficient spec.

なお、スペクトル係数ＲＯＭ２４の選択及びアクセスをソフトウェアで実装する場合には、１次元目をスケールファクタsf、２次元目をインデックスＲＯＭ２２の出力値とした２次元配列をアクセスすることによって行ってもよい。 When the selection and access of the spectral coefficient ROM 24 is implemented by software, it may be performed by accessing a two-dimensional array in which the first dimension is the scale factor sf and the second dimension is the output value of the index ROM 22.

復号部１２は、以上の動作を信号分離部１０からの入力が終了するまで繰り返すことにより、周波数情報を正規化・非線形量子化した値（量子化スペクトル係数x_quant）に対して可変長符号を割り当てた符号列を復号し、スペクトル係数specを求めることができる。 The decoding unit 12 assigns a variable-length code to a value (quantized spectrum coefficient x_quant) obtained by normalizing and nonlinearly quantizing the frequency information by repeating the above operation until the input from the signal separating unit 10 is completed. The spectral coefficient spec can be obtained by decoding the obtained code string.

なお、実際の高能率音声符号化技術では、上述したようにスケールファクタsfの値としてより多くの種類の値が選択可能であるため、スペクトル係数ＲＯＭ２４のサイズが非常に大きなものとなってしまう。 In the actual high-efficiency speech coding technique, as described above, since more types of values can be selected as the value of the scale factor sf, the size of the spectrum coefficient ROM 24 becomes very large.

そこで、実際には以下に説明するように正規化係数の周期性を利用することで、スペクトル係数ＲＯＭ２４のサイズをある程度の大きさに抑えることが可能である。 Therefore, in practice, as described below, the size of the spectrum coefficient ROM 24 can be suppressed to some extent by using the periodicity of the normalization coefficient.

上述したように、通常、正規化係数としては２の巾乗となるような値が用いられるため、ゲインgainの値を２進数領域で見ると周期性をもって変化することになる。 As described above, since a value that is a power of 2 is normally used as the normalization coefficient, the gain gain value changes with periodicity when viewed in the binary number domain.

すなわち、ゲインgainとスケールファクタsfとの関係は前述した式（３）のようになるが、このときスケールファクタsfを４で割った余りをｓｆ％４と表記すると、以下の式（４）のように変形することができる。 That is, the relationship between the gain gain and the scale factor sf is as shown in Equation (3) described above. At this time, when the remainder obtained by dividing the scale factor sf by 4 is expressed as sf% 4, the following equation (4) is obtained. Can be deformed.

ここで、式（４）における(ｓｆ−ｓｆ％４)は常に４の倍数であるから、２＾((ｓｆ−ｓｆ％４)／４)は、常に２の整数乗の値となる。つまり、この値を掛け合わせることは２の整数乗の値を掛け合わせることに等しく、２進数を用いた計算機であれば、シフトや浮動小数点表現における指数部の操作だけで容易に実現できる計算である。 Here, since (sf−sf% 4) in the expression (4) is always a multiple of 4, 2 ^ ((sf−sf% 4) / 4) is always an integer power of 2. In other words, multiplying this value is equivalent to multiplying the value of 2 to the power of an integer, and if it is a computer using a binary number, it is a calculation that can be easily realized only by operation of the exponent part in the shift or floating point expression. is there.

したがって、図２において、サブ情報の復号結果であるスケールファクタsfの値そのものをスペクトル係数ＲＯＭ２４の選択に使用するのではなく、ｓｆ％４を用いて選択するようにし、スペクトル係数ＲＯＭ２４から読み出した値をそのまま出力とせず、その出力に対して２＾((ｓｆ−ｓｆ％４)／４)倍する処理をシフト演算などを用いて行えば、計算量を大幅に増やすことなく、スペクトル係数ＲＯＭ２４のサイズを小さく保つことが可能となる。 Therefore, in FIG. 2, the value of the scale factor sf that is the decoding result of the sub information is not used for selecting the spectral coefficient ROM 24, but is selected using sf% 4, and the value read from the spectral coefficient ROM 24 is used. If the process of multiplying the output by 2 ^ ((sf-sf% 4) / 4) using the shift operation or the like is performed without shifting the output as it is, the calculation amount of the spectrum coefficient ROM 24 is not significantly increased. The size can be kept small.

以上説明したように、本実施の形態におけるオーディオデコーダ１によれば、スペクトル係数specを求める際に、最も負荷が大きい乗算、指数計算を実質的になくせるため、実行時の計算負荷を大幅に軽減することができ、復号処理に要する時間を短縮することができる。具体的には、本件発明者らによるソフトウェアでの実装において、量子化スペクトル係数x_quantからスペクトル係数specを求める部分の処理時間は、従来の方法と比較して約１／３の時間に短縮することができた。 As described above, according to the audio decoder 1 of the present embodiment, when the spectral coefficient spec is obtained, multiplication and exponent calculation with the largest load can be substantially eliminated, so that the calculation load during execution is greatly increased. The time required for the decoding process can be shortened. Specifically, in the software implementation by the present inventors, the processing time of the part for obtaining the spectral coefficient spec from the quantized spectral coefficient x_quant is shortened to about 時間 time compared to the conventional method. I was able to.

また、計算負荷の軽減に伴い、計算に必要な回路規模を小さくすることができるため、例えば限られたメモリサイズでの実行が要求される組み込み機器において、より小さいメモリサイズで復号処理を行うことが可能となり、部品のコストダウンを図ることができる。 In addition, since the circuit scale required for calculation can be reduced as the calculation load is reduced, for example, in an embedded device that requires execution with a limited memory size, decoding processing is performed with a smaller memory size. This makes it possible to reduce the cost of parts.

なお、本発明は上述した実施の形態のみに限定されるものではなく、本発明の要旨を逸脱しない範囲において種々の変更が可能であることは勿論である。 It should be noted that the present invention is not limited to the above-described embodiments, and various modifications can be made without departing from the scope of the present invention.

例えば、上述した実施の形態では、ＭＰ３のように、逆正規化処理の際にのみパラメータを用いるオーディオデコーダに適用した例について説明したが、ＭＰ３に限定されるものではなく、逆量子化処理及び逆正規化処理の際にパラメータを用いる他のオーディオデコーダに適用することも可能である。 For example, in the above-described embodiment, an example is described in which the present invention is applied to an audio decoder that uses parameters only during denormalization processing, such as MP3, but is not limited to MP3. The present invention can also be applied to other audio decoders that use parameters in the denormalization process.

一例として、以下のような復号処理を行うオーディオデコーダを想定する。すなわち、先ずスケールファクタsfから以下の式（５）に従ってゲインgainを算出する。 As an example, an audio decoder that performs the following decoding process is assumed. That is, first, the gain gain is calculated from the scale factor sf according to the following equation (5).

また、エントロピー符号化された周波数領域のデータから量子化スペクトル係数x_quantを求め、この量子化スペクトル係数x_quantから以下の式（６）に示すような逆非線形量子化により正規化スペクトル係数x_invquantを求める。ここで、式（６）におけるmax_quantは、量子化スペクトル係数x_quant毎の量子化ステップから求まる値である。 Also, a quantized spectral coefficient x_quant is obtained from entropy-encoded frequency domain data, and a normalized spectral coefficient x_invquant is obtained from the quantized spectral coefficient x_quant by inverse nonlinear quantization as shown in the following equation (6). Here, max_quant in equation (6) is a value obtained from the quantization step for each quantization spectral coefficient x_quant.

そして、両者を用いて以下の式（７）に従ってスペクトル係数specを計算し、このスペクトル係数specに対してさらにフィルタバンク処理等を行うことで、最終的な復号結果（音声波形等）を得る。 Then, using both of them, a spectrum coefficient spec is calculated according to the following equation (7), and a filter bank process or the like is further performed on the spectrum coefficient spec to obtain a final decoding result (speech waveform or the like).

したがって、このようにスケールファクタsf及びmax_quantという２つのパラメータが存在するオーディオデコーダにおいては、インデックスの値に相当する量子化スペクトル係数x_quantとスペクトル係数specとの対応関係が示された変換テーブルをこれら２つのパラメータ毎にスペクトル係数ＲＯＭ２４に用意すればよい。 Therefore, in an audio decoder in which two parameters such as the scale factor sf and max_quant exist in this way, these two conversion tables indicating the correspondence between the quantized spectral coefficient x_quant corresponding to the index value and the spectral coefficient spec are displayed. What is necessary is just to prepare in the spectrum coefficient ROM 24 for every one parameter.

さらに、上述した実施の形態と同様に、正規化係数の周期性を利用し、図２においてサブ情報の復号結果であるスケールファクタsfの値そのものをスペクトル係数ＲＯＭ２４の選択に使用するのではなく、ｓｆ％３を用いて選択するようにし、スペクトル係数ＲＯＭ２４から読み出した値をそのまま出力とせず、その出力に対して２＾((ｓｆ−ｓｆ％３)／３)倍する処理をシフト演算などを用いて行えば、計算量を大幅に増やすことなく、スペクトル係数ＲＯＭ２４のサイズを小さく保つことが可能となる。 Further, as in the above-described embodiment, the periodicity of the normalization coefficient is used, and the value of the scale factor sf that is the decoding result of the sub information in FIG. 2 is not used for selection of the spectrum coefficient ROM 24. The value read out from the spectral coefficient ROM 24 is not output as it is, but the process of multiplying the output by 2 ^ ((sf-sf% 3) / 3) is performed as a shift operation. If used, the size of the spectrum coefficient ROM 24 can be kept small without significantly increasing the amount of calculation.

本実施の形態におけるオーディオデコーダの概略構成を示す図である。It is a figure which shows schematic structure of the audio decoder in this Embodiment. 同オーディオデコーダの復号部における内部構成を示す図である。It is a figure which shows the internal structure in the decoding part of the audio decoder.

Explanation of symbols

１オーディオデコーダ、１０信号分離部、１１サブ情報復号部、１２復号部、１３直交変換部、１４時間信号処理部、２０シフトレジスタ、２１カウンタ、２２インデックスＲＯＭ、２３符号長ＲＯＭ、２４スペクトル係数ＲＯＭ DESCRIPTION OF SYMBOLS 1 Audio decoder, 10 Signal separation part, 11 Sub information decoding part, 12 Decoding part, 13 Orthogonal transformation part, 14 Time signal processing part, 20 Shift register, 21 Counter, 22 Index ROM, 23 Code length ROM, 24 Spectral coefficient ROM

Claims

In a decoding device that decodes a code string obtained by encoding by assigning any of a plurality of variable length codes to each symbol subjected to normalization processing and quantization processing on a plurality of coefficients,
Storage means for storing a conversion table in which the plurality of variable length codes and the values corresponding to the symbols corresponding to the variable length codes are previously subjected to inverse quantization processing and inverse normalization processing;
From the code string, searching for a code string portion that matches any of the plurality of variable length codes in the conversion table, and reading a value associated with the variable length code that matches the searched code string part, A decoding device comprising: decoding means for restoring a coefficient corresponding to the searched code string portion.

The decoding apparatus according to claim 1, wherein the conversion table is created for each parameter used for the denormalization process, or the dequantization process and the denormalization process.

The normalization coefficient used in the denormalization process is obtained according to the following formula 1, where the normalization coefficient is gain and the scale factor used in the denormalization process is sf:

The decoding apparatus according to claim 1, wherein the conversion table is created for each value of n according to at least the periodicity of the normalization coefficient.

In the conversion table, a plurality of variable length codes and a remainder value obtained by dividing the scale factor by the value of n are regarded as the scale factor, and a symbol corresponding to each variable length code is dequantized in advance. And a value subjected to denormalization processing are associated with each other,
The decoding means reads a value associated with the variable-length code that matches the searched code string portion, sets the scale factor to sf, and divides the scale factor by the value of n as sf% n. 4. The coefficient corresponding to the searched code string portion is restored by multiplying the read value by 2 ^ ((sf−sf% n) / n). Decoding device.

In a decoding method for decoding a code string obtained by encoding by assigning any of a plurality of variable length codes to each symbol subjected to normalization processing and quantization processing on a plurality of coefficients,
From the code string, using the conversion table in which the plurality of variable-length codes and the values corresponding to the symbols corresponding to the variable-length codes are previously subjected to inverse quantization processing and inverse normalization processing, The searched code string is searched by searching for a code string portion that matches any of the plurality of variable length codes in the conversion table and reading a value associated with the variable length code that matches the searched code string portion. A decoding method comprising: a decoding step of restoring a coefficient corresponding to the portion.

6. The decoding method according to claim 5, wherein the conversion table is created for each parameter used for the denormalization process or the dequantization process and the denormalization process.

The normalization coefficient used in the denormalization process is obtained according to the following formula 2 when the normalization coefficient is gain and the scale factor used in the denormalization process is sf.

The decoding method according to claim 5, wherein the conversion table is created for each value of n according to at least the periodicity of the normalization coefficient.

In the conversion table, a plurality of variable length codes and a remainder value obtained by dividing the scale factor by the value of n are regarded as the scale factor, and a symbol corresponding to each variable length code is dequantized in advance. And a value subjected to denormalization processing are associated with each other,
The decoding means reads a value associated with the variable length code that matches the searched code string part, sets the scale factor to sf, and divides the scale factor by the value of n as sf% n. The coefficient corresponding to the searched code string portion is restored by multiplying the read value by 2 ^ ((sf-sf% n) / n). Decryption method.