JP2008289085A

JP2008289085A - Decoding method, decoder, decoding apparatus, encoding method, encoder, program and recording medium

Info

Publication number: JP2008289085A
Application number: JP2007134457A
Authority: JP
Inventors: Yuusuke Hiwazaki; 祐介日和▲崎▼; Naka Omuro; 仲大室; Takeshi Mori; 岳至森; Akitoshi Kataoka; 章俊片岡; Shigeaki Sasaki; 茂明佐々木
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2007-05-21
Filing date: 2007-05-21
Publication date: 2008-11-27
Anticipated expiration: 2027-05-21
Also published as: JP4638895B2

Abstract

<P>PROBLEM TO BE SOLVED: To provide a decoding technology and an encoding technology for satisfying the maintenance of encoding quality and reduction in computational complexity in encoding while reducing a memory capacity to be used. <P>SOLUTION: This decoding method includes an spreading code decomposing step, a weighted form decoding step, a weighting removal calculating step, a gain decoding step and a multiplication step. In the spreading code decomposing step, an input code is decomposed into a form code and a gain code. In the weighted form decoding step, a weighted form code book is used to convert the form code into a weighted form vector. In the weighting removal calculating step, a weight of the weighted form vector is removed and a form vector is outputted. In the gain decoding step, a gain code book is used to convert the gain code into a gain. In the multiplication step, the form vector and the gain are multiplied to output a decoded signal. <P>COPYRIGHT: (C)2009,JPO&INPIT

Description

本発明は、符号を復号する復号方法、当該符号を符号化する符号化方法、これらの方法を利用した復号器、復号装置、符号化装置、プログラムおよび記録媒体に関する。 The present invention relates to a decoding method for decoding a code, an encoding method for encoding the code, a decoder, a decoding device, an encoding device, a program, and a recording medium using these methods.

従来から用いられている電話帯域の音声信号を符号化する音声符号化方法として、Ｇ.７１１（非特許文献１）に用いられる非線型波形圧縮符号化（μ則・Ａ則ＰＣＭ）や、Ｇ. ７２６（非特許文献２）などに用いられる差分予測波形圧縮符号化（ＡＤＰＣＭ）などの波形符号化方式がある。公衆電話網及びインターネットを用いた音声通信（ＶｏＩＰ）では、ほぼこの符号化方式が用いられている。 Non-linear waveform compression coding (μ-law / A-law PCM) used in G.711 (Non-Patent Document 1), G-711 (Non-Patent Document 1), G-711 There is a waveform coding method such as differential prediction waveform compression coding (ADPCM) used in 726 (Non-Patent Document 2) and the like. In voice communication (VoIP) using a public telephone network and the Internet, this encoding method is almost used.

一方、携帯電話などの回線の伝送帯域に制限がある場合には、線形予測分析に基づく符号化方式が主流であり、この線形予測分析によって得られる包絡情報を元に雑音を変形して符号化する手法が用いられている。しかし、線形予測分析方式では、符号化処理時間単位毎に演算量の多い自己相関関数を求める必要がある。また、符号選択時には、この包絡情報を符号化処理時間単位毎に反映して符号を選択する必要があり、符号化に要する演算量は上述した波形符号化方式の数十倍となる。また、波形符号化や線形予測分析に基づく符号化方式以外にも高音質で圧縮効率の良い符号化方式は多数存在する。 On the other hand, when the transmission band of a line such as a cellular phone is limited, the coding method based on linear prediction analysis is the mainstream, and noise is transformed and encoded based on the envelope information obtained by this linear prediction analysis. Is used. However, in the linear prediction analysis method, it is necessary to obtain an autocorrelation function having a large amount of calculation for each encoding processing time unit. Further, when selecting a code, it is necessary to select a code by reflecting this envelope information for each encoding processing time unit, and the amount of calculation required for encoding is several tens of times that of the above-described waveform encoding method. In addition to the coding methods based on waveform coding and linear prediction analysis, there are many coding methods with high sound quality and good compression efficiency.

こうして符号化方式が多数存在する場合、方式の違いによって相互接続性が保証されていない。よって異なる符号化方式を搭載した複数の端末装置と通信を行う場合、自らの通信端末上で対応する複数の符号化器および復号器を動作させ、通信相手の端末装置に実装されている符号化方式に応じて符号化方式を使い分ける必要がある。しかし、使用できる演算量が制限される端末装置では、演算量の大きい符号化器を複数同時に動作させることは不可能である。これに対し、波形符号化方式はどのようなＶｏＩＰ会議端末にも一般的に実装されている。以上より、結局はＧ.７１１やＧ.７２６のような波形符号化方式を用いざるを得ない。 Thus, when there are a large number of encoding methods, the interoperability is not guaranteed due to the difference in the methods. Therefore, when communicating with a plurality of terminal devices equipped with different encoding methods, the corresponding encoders and decoders are operated on their own communication terminals, and the encoding implemented in the terminal device of the communication partner It is necessary to use different encoding methods depending on the method. However, in a terminal device in which the amount of computation that can be used is limited, it is impossible to simultaneously operate a plurality of encoders having a large amount of computation. On the other hand, the waveform coding method is generally implemented in any VoIP conference terminal. From the above, eventually, a waveform encoding method such as G.711 or G.726 must be used.

μ則・Ａ則ＰＣＭやＡＤＰＣＭによる符号化方式は振幅の非線形圧縮を用いるため、再生音声に重畳する符号化雑音は音声全体のパワと相関が強く、入力音声レベルに依存せずに復号音声のＳＮ比を一定にできるという利点がある。しかし、この符号化雑音は白色雑音となる。従来Ｇ．７１１やＧ．７２６への入力音声は、ＩＲＳ特性等に代表される高域成分が強調された従来の電話機から出力される信号の周波数特性に変更されることが想定されているため、このように高域成分が強調されている信号によれば白色雑音が顕著に知覚されることはない。ここでのＩＲＳ特性（非特許文献３）とは図１に示すような緩やかな高域通過フィルタ型の周波数特性を指す。 Since the coding method based on μ-law / A-law PCM and ADPCM uses nonlinear compression of amplitude, the coding noise superimposed on the reproduced speech has a strong correlation with the power of the entire speech and does not depend on the input speech level. There is an advantage that the S / N ratio can be made constant. However, this coding noise becomes white noise. Conventional G.G. 711 and G.G. Since the input voice to 726 is assumed to be changed to the frequency characteristics of a signal output from a conventional telephone in which high frequency components typified by IRS characteristics are emphasized, the high frequency components are According to a signal in which is emphasized, white noise is not perceived remarkably. Here, the IRS characteristic (Non-patent Document 3) indicates a gentle high-pass filter type frequency characteristic as shown in FIG.

しかし、ＶｏＩＰなどの通信において高域成分が強調された周波数特性を持つマイクが使用されることは稀である。そのため、音声信号が持つ低域へのパワの集中が是正されることなく符号化され（エンコーダミスマッチ）、低域側のパワ増加の影響で高域側のＳＮ比が悪化し、復号側で雑音が顕著に知覚されてしまうという問題が生じる。例えば平坦な周波数特性を持つマイクを使用して音声を収音すると、符号化対象の信号も低域（〜１ｋＨｚ程度）にパワの集中したものとなり、高域において入力音声レベルに対する符号化雑音レベルが相対的に大きくなり、復号側で雑音が知覚されやすくなる。 However, it is rare that a microphone having a frequency characteristic in which a high frequency component is emphasized is used in communication such as VoIP. For this reason, the power signal is encoded without correcting the power concentration in the low frequency range (encoder mismatch), the SN ratio on the high frequency side deteriorates due to the increase in power on the low frequency side, and noise occurs on the decoding side. This causes a problem that the image is perceived prominently. For example, if a microphone with flat frequency characteristics is used to pick up speech, the signal to be encoded is also concentrated in the low frequency range (about 1 kHz), and the encoding noise level relative to the input speech level in the high frequency range. Becomes relatively large, and noise is easily perceived on the decoding side.

上記の問題を解決するため、出願人は、波形符号化のエンコーダミスマッチによって生じた量子化雑音を低演算量かつ高能率に低減する符号化方法および復号方法を実現している。また、これを実現するにあたり、ビットストリームをスケーラブル構成にすることで基本符号では従来端末とのビットストリームの互換性を保ち、相互接続性を高める手段を提供している（特許文献１）。具体的には、エンコーダミスマッチによる品質劣化を避けるためには、Ｇ．７１１あるいはＧ．７２６を基本段として用いる多段構成の符号化器を用いており、基本段の雑音を低減する２段目には、線形予測分析に基づく符号化方式よりも大幅に低演算量で動作する符号化方式を用いていた。 In order to solve the above problem, the applicant has realized an encoding method and a decoding method that reduce quantization noise caused by encoder mismatch in waveform encoding to a low computational complexity and high efficiency. In order to realize this, the basic code provides means for maintaining the bit stream compatibility with the conventional terminal and enhancing the interconnectivity by making the bit stream scalable (Patent Document 1). Specifically, in order to avoid quality degradation due to encoder mismatch, G. 711 or G.I. A multi-stage encoder using 726 as the basic stage is used, and the second stage for reducing the noise of the basic stage is an encoding that operates with a much lower amount of computation than the encoding method based on linear prediction analysis. The method was used.

特許文献１の符号化器では、あらかじめ重みを付与した高域重み付き形状符号帳と高域重み付きパワ逆数表などの複数の符号帳を用いて高品質に再生でき、かつ低演算量に符号化する機能を実現した。
ITU-T (Telecommunication Standardization Sector, International Telecommunication Union), Geneva, Switzerland. ITU-T G.711-Pulse code modulation (PCM) of voice frequencies, Nov.1988. ITU-T (Telecommunication Standardization Sector, International Telecommunication Union), Geneva, Switzerland. ITU-T G.726-40,32,24,16 kbit/s adaptive differential pulse code modulation (ADPCM), Dec. 1990. ITU-T (Telecommunication Standardization Sector, International Telecommunication Union), Geneva, Switzerland. ITU-T P.830 Annex D-modified IRS send and receive characteristics, Feb. 1996. 特開２００６−１１９３０１号公報 In the encoder of Patent Document 1, it is possible to reproduce with high quality using a plurality of codebooks such as a high-frequency weighted shape codebook pre-weighted and a high-frequency weighted power reciprocal table, and code with low computational complexity. The function to be realized.
ITU-T (Telecommunication Standardization Sector, International Telecommunication Union), Geneva, Switzerland.ITU-T G.711-Pulse code modulation (PCM) of voice frequencies, Nov.1988. ITU-T (Telecommunication Standardization Sector, International Telecommunication Union), Geneva, Switzerland.ITU-T G.726-40,32,24,16 kbit / s adaptive differential pulse code modulation (ADPCM), Dec. 1990. ITU-T (Telecommunication Standardization Sector, International Telecommunication Union), Geneva, Switzerland.ITU-T P.830 Annex D-modified IRS send and receive characteristics, Feb. 1996. JP 2006-119301 A

上述のように、従来は、符号化処理でも復号処理でも、線形予測分析結果などに基づいて重みを変更するために、重みの付いていない同じ符号帳を用いていた。特許文献１では、符号化処理用に、あらかじめ重みを付けた符号帳を用意することで、符号化の品質を保ちながら演算量を低減した。しかし、特許文献１の方法では、復号処理には重みの付いていない符号帳が必要なので、重みを付けた符号帳を記録するためのメモリを確保しなければならなかった。 As described above, conventionally, in order to change the weight based on the linear prediction analysis result or the like in both the encoding process and the decoding process, the same codebook without the weight is used. In Patent Document 1, the amount of calculation is reduced while maintaining the quality of encoding by preparing a codebook that is pre-weighted for encoding processing. However, in the method of Patent Document 1, since a codebook with no weight is required for the decoding process, it is necessary to secure a memory for recording the weighted codebook.

本発明は、メモリの使用量を特許文献１よりも低減しながら、符号化の品質を従来と同等に保つこと、符号化の演算量は特許文献１と同等とすることを満足する復号技術、符号化技術を提供することを目的とする。 The present invention provides a decoding technique that satisfies the requirement that the encoding quality is kept equal to the conventional one while the amount of memory used is lower than that of Patent Literature 1, and that the amount of computation of the encoding is equivalent to that of Patent Literature 1. An object is to provide an encoding technique.

本発明の復号方法は、拡張符号分解ステップ、重み付き形状復号ステップ、重み付け除去演算ステップ、利得復号ステップ、乗算ステップを有する。拡張符号分解ステップは、入力された符号を、形状符号と利得符号に分解する。重み付き形状復号ステップは、重み付き形状符号帳を用いて、形状符号を重み付き形状ベクトルに変換する。重み付け除去演算ステップは、重み付き形状ベクトルの重みを除去し、形状ベクトルを出力する。利得復号ステップは、利得符号帳を用いて、利得符号を利得に変換する。乗算ステップは、形状ベクトルと利得とを乗算して復号信号を出力する。なお、重み付け除去演算ステップでは、符号化するときに用いる重み行列Ｗの逆行列Ｕを用いて重みつき形状ベクトルの重みを除去すればよい。 The decoding method of the present invention includes an extended code decomposition step, a weighted shape decoding step, a weighting removal calculation step, a gain decoding step, and a multiplication step. In the extended code decomposition step, the input code is decomposed into a shape code and a gain code. The weighted shape decoding step converts the shape code into a weighted shape vector using a weighted shape codebook. The weighting removal calculating step removes the weight of the weighted shape vector and outputs the shape vector. The gain decoding step converts the gain code into a gain using a gain codebook. The multiplication step multiplies the shape vector and the gain to output a decoded signal. In the weight removal operation step, the weight of the weighted shape vector may be removed using the inverse matrix U of the weight matrix W used for encoding.

また、本発明の符号化方法は、重み付けステップ、形状計算ステップ、利得計算ステップ、多重ステップを有する。形状計算ステップは、初期化サブステップ、内積サブステップ、理想利得計算サブステップ、距離計算サブステップ、確認サブステップを有する。そして、内積サブステップ、理想利得計算サブステップ、距離計算サブステップ、確認サブステップを、ｎが前記重み付き形状符号帳内の重み付き形状ベクトルの数Ｎまで繰り返し、繰り返し終了時のｉ_ｏｐｔを形状符号Ｉ_ｓとし、形状符号Ｉ_ｓと繰り返し終了時の最適な理想利得ｇ^〜 _ｏｐｔを出力とする。利得計算ステップは、ｇ_ｘを前記利得符号帳内のｘ番目に値が小さい利得とするときに、利得符号帳内の値の小さい利得から順番に、ｇ^〜 _ｏｐｔ＜（ｇ_ｘ＋ｇ_ｘ＋１）／２を満たす最初のｘを探索し、ｘを利得符号Ｉ_ｇとして出力する。 The encoding method of the present invention includes a weighting step, a shape calculating step, a gain calculating step, and a multiplexing step. The shape calculation step includes an initialization substep, an inner product substep, an ideal gain calculation substep, a distance calculation substep, and a confirmation substep. Then, the inner product sub-step, ideal gain calculation sub-step, distance calculation sub-step, and confirmation sub-step are repeated until n is the number N of weighted shape vectors in the weighted shape codebook, and i _opt at the end of the iteration is shaped It is assumed that the code is _Is , and the shape code _Is and the optimum ideal gain g ^to _opt at the end of the repetition are output. In the gain calculation step, when g _x is the gain having the x-th smallest value in the gain codebook, the order of g ^to _opt <(g _x + g _{x + 1} ) / The first x satisfying 2 is searched, and x is output as the gain code _Ig .

本発明の復号方法によれば、復号に重み付き形状符号帳を用いることができるので、符号化のときに用いた重み付き形状符号帳を復号のときにも利用できる。よって、重み付きの形状符号帳をメモリに記録する際には、重み付きでない形状符号帳を記録する必要がない。双方向通信時には符号化器と復号器の両方を同時に備える必要があるため、特許文献１の符号化方式と組み合わせれば、符号化器と復号器の符号帳が共通となるため、現実に必要とされるメモリ使用量を大幅に低減しながら、符号化の品質を保つこと、符号化の演算量を低減することを満足する復号技術、符号化技術を提供できる。 According to the decoding method of the present invention, since a weighted shape codebook can be used for decoding, the weighted shape codebook used for encoding can also be used for decoding. Therefore, when recording a weighted shape codebook in the memory, there is no need to record a non-weighted shape codebook. Since it is necessary to provide both an encoder and a decoder at the same time in bidirectional communication, the codebook of the encoder and the decoder becomes common when combined with the encoding method of Patent Document 1, so it is actually necessary. It is possible to provide a decoding technique and an encoding technique that satisfy the requirements of maintaining the quality of encoding and reducing the amount of calculation of encoding while greatly reducing the amount of memory used.

［第１実施形態］
［符号化装置］
図２に第１実施形態の符号化装置の機能構成例を示す。第１実施形態の符号化装置は、基本符号化器９１０、基本復号器９２０、加算部９３０、拡張符号化器１００から構成される。基本符号化器９１０は、Ｇ.７１１やＧ. ７２６などの従来の波形符号化方式の符号化器を用いればよい。基本復号器９２０は、基本符号化器９１０に対応する復号器である。基本符号化器９１０は、入力信号ｓを符号化し、基本符号Ｉ_ｂを出力する。基本復号器９２０は、基本符号Ｉ_ｂを復号する。加算部９３０は、復号された信号と入力信号との差（基本符号化の量子化雑音）を求める。拡張符号化器１００は、基本符号化の量子化雑音（基本雑音信号ｅ）を符号化する。拡張符号化器は、実時間処理を行う場合であれば、８サンプル（１ｍｓ）〜１６０サンプル（２０ｍｓ）の短時間の処理フレームごとに処理を行う。このときの処理フレームのサンプル数をＫとした場合、基本雑音信号ｅは、Ｋ次元のベクトルで表現できる。符号化装置への入力信号は、例えば８ｋＨｚでサンプリングされた３．４ｋＨｚ帯域（電話帯域）の音声デジタル信号である。実時間処理を行わないのであれば、メモリの許す範囲内で一括処理してもよい。 [First Embodiment]
[Encoding device]
FIG. 2 shows a functional configuration example of the encoding apparatus according to the first embodiment. The encoding apparatus according to the first embodiment includes a basic encoder 910, a basic decoder 920, an adder 930, and an extended encoder 100. The basic encoder 910 may be a conventional waveform encoding encoder such as G.711 or G.726. The basic decoder 920 is a decoder corresponding to the basic encoder 910. Basic encoder 910 encodes the input signal s, and outputs the base code I _b. Elementary decoders 920 decodes the base code _{I b.} The adding unit 930 obtains a difference (basic coding quantization noise) between the decoded signal and the input signal. The extension encoder 100 encodes quantization noise (basic noise signal e) of basic encoding. If real-time processing is performed, the extension encoder performs processing for each short processing frame of 8 samples (1 ms) to 160 samples (20 ms). If the number of samples of the processing frame at this time is K, the basic noise signal e can be expressed by a K-dimensional vector. The input signal to the encoding device is an audio digital signal in the 3.4 kHz band (telephone band) sampled at 8 kHz, for example. If real-time processing is not performed, batch processing may be performed within the range allowed by the memory.

本発明は、図２の符号化装置の中の特に拡張符号化器１００に関するので、以下では拡張符号化器１００について説明する。図３は、拡張符号化器１００の処理フローの例である。拡張符号化器１００は、重み付け部１１０、形状計算部１２０、利得計算部１３０、多重化部１４０、重み付き形状符号帳１５０、重み付き形状パワ逆数帳１６０、利得符号帳１７０から構成される。重み付き形状符号帳１５０には、Ｎ個の重み付き形状ベクトルＷｃ_１，…，Ｗｃ_Ｎが記録されている。重み付き形状ベクトルは、それぞれＫ＋ｒサンプルの重み付き形状信号からなるＫ＋ｒ次元のベクトルである。ここで、ｒは重み付けフィルタのタップ数であり、後述する式（２）の重み付けの場合は１である。重み付き形状パワ逆数帳１６０には、重み付き形状ベクトルのノルムの逆数（１／‖Ｗｃ_１‖^２），…，（１／‖Ｗｃ_Ｎ‖^２）が記録されている。利得符号帳１７０には、利得ｇ_１，…，ｇ_Ｍが記録されている。重み付け部１１０は、基本雑音信号（拡張符号化器１００に対する入力信号）ｅに重みを付与し、重み付き雑音信号Ｗｅを出力する（Ｓ１１０）。形状計算部１２０は、重み付き雑音信号Ｗｅとの距離が最小となるように、重み付き形状符号帳１５０内の重み付き形状ベクトルＷｃ_ｎ（ｎは１〜Ｎの整数、Ｎは重み付き形状符号帳に格納されている重み付き形状ベクトルの個数）を選定するとともに最適な理想利得ｇ^〜 _ｏｐｔを求め、当該重み付き形状ベクトルを示す形状符号Ｉ_ｓと最適な理想利得ｇ^〜 _ｏｐｔとを出力する（Ｓ１２０）。利得計算部１３０は、最適な理想利得ｇ^〜 _ｏｐｔとの距離が最小となるように、利得符号帳１７０内の利得ｇ_ｍ（ｍは１〜Ｍの整数、Ｍは利得符号帳に格納されている利得の個数）を選定し、当該利得を示す利得符号Ｉ_ｇを出力する（Ｓ１３０）。多重化部１４０は、形状符号Ｉ_ｓと利得符号Ｉ_ｇとを多重化して、符号Ｉ_ｅを出力する（Ｓ１４０）。以下では、重み付け部１１０、形状計算部１２０、利得計算部１３０について詳細に説明する。 Since the present invention particularly relates to the extension encoder 100 in the encoding apparatus of FIG. 2, the extension encoder 100 will be described below. FIG. 3 is an example of a processing flow of the extension encoder 100. The extended encoder 100 includes a weighting unit 110, a shape calculation unit 120, a gain calculation unit 130, a multiplexing unit 140, a weighted shape codebook 150, a weighted shape power reciprocal book 160, and a gain codebook 170. In the weighted shape codebook 150, N weighted shape vectors Wc ₁ ,..., Wc _N are recorded. Each weighted shape vector is a K + r-dimensional vector composed of weighted shape signals of K + r samples. Here, r is the number of taps of the weighting filter, and is 1 in the case of weighting according to equation (2) described later. In the weighted shape power reciprocal book 160, reciprocals of norms of weighted shape vectors (1 / ‖Wc ₁ ‖ ² ), ..., (1 / ‖Wc _N ‖ ² ) are recorded. Gains g ₁ ,..., G _M are recorded in gain codebook 170. The weighting unit 110 assigns a weight to the basic noise signal (input signal to the extension encoder 100) e and outputs a weighted noise signal We (S110). Shape calculation unit 120, as a distance between the weighted noise signal We is minimized, weighted shape vector Wc n _(n is an integer 1~N in weighted shape codebook 0.99, N is the weighted coded shape determine the optimum ideal gain g ^~ _opt with selecting the number) of the weighted shape vectors stored in the book, and outputs the coded shape I _s and optimal ideal gain g ^~ _opt indicating the weighted shape vector (S120). The gain calculator 130 stores the gain g _m in the gain codebook 170 (m is an integer from 1 to M, and M is stored in the gain codebook so that the distance ^{from the} optimal ideal gain g ^to _opt is minimized. select a gain number of) that are, for outputting a gain code _{I g} indicating the gain (S130). Multiplexer 140, and a coded shape _{I s} and the gain code _{I g} are multiplexed, and outputs the code _{I e} (S140). Hereinafter, the weighting unit 110, the shape calculation unit 120, and the gain calculation unit 130 will be described in detail.

重み付け部１１０
重み付け部１１０は、基本雑音信号（拡張符号化器１００に対する入力信号）ｅに重みを付与し、重み付き雑音信号Ｗｅを出力する（Ｓ１１０）。具体的には、例えば、

と表現されるＦＩＲフィルタを用いる方法がある。このフィルタは、数学的には（Ｋ＋ｒ）行Ｋ列の行列として次のように表現できる。

式（１）の場合は１タップのフィルタであるため、ｒ＝１である。重みを付与する演算はこの行列Ｗを基本雑音信号ｅに乗じ、重み付き雑音信号Ｗｅを出力する演算である。すなわち畳み込み演算である。なお、例えば、ｂの値として0.550107181を用いた場合のフィルタの周波数特性を図４に示す。図４（Ａ）は振幅の周波数特性、図４（Ｂ）は位相の周波数特性を示している。このようなフィルタ（重み付け）を用いることによって、基本雑音信号ｅの低域成分は大幅に減衰され、高域を重視するため、拡張符号化器１００では高域の雑音を低減できる。なお、入力信号が音声でない場合には、入力信号の特性に合わせたフィルタを用意すればよい。また、このフィルタはｒが１よりも大きい高次のフィルタとしてもよい。 Weighting unit 110
The weighting unit 110 assigns a weight to the basic noise signal (input signal to the extension encoder 100) e and outputs a weighted noise signal We (S110). Specifically, for example,

There is a method using an FIR filter expressed as: This filter can be expressed mathematically as a matrix of (K + r) rows and K columns as follows.

In the case of Expression (1), since it is a one-tap filter, r = 1. The operation for assigning a weight is an operation for multiplying the basic noise signal e by this matrix W and outputting a weighted noise signal We. That is, a convolution operation. For example, FIG. 4 shows the frequency characteristics of the filter when 0.550107181 is used as the value of b. 4A shows the frequency characteristics of the amplitude, and FIG. 4B shows the frequency characteristics of the phase. By using such a filter (weighting), the low frequency component of the basic noise signal e is greatly attenuated and the high frequency is emphasized. Therefore, the extended encoder 100 can reduce high frequency noise. If the input signal is not voice, a filter that matches the characteristics of the input signal may be prepared. Further, this filter may be a higher-order filter in which r is larger than 1.

形状計算部１２０
形状計算部１２０は、重み付き雑音信号Ｗｅと重み付き形状符号帳１５０内の重み付き形状ベクトルＷｃ_ｎ（ｎは１〜Ｎの整数）に最適な理想利得ｇ^〜 _ｏｐｔを乗じたベクトルとの距離Ｄ^〜が最小となるように、重み付き形状ベクトルＷｃ_ｎと最適な理想利得ｇ^〜 _ｏｐｔを選定する（Ｓ１２０）。ただし、理想利得とは、量子化される前の利得（計算によって求めた利得）をさしている。また、最適な理想利得とは、各重み付き形状ベクトルＷｃ_ｎと重み付き雑音信号Ｗｅとの距離Ｄ^〜を最小にできる理想利得である。 Shape calculator 120
Shape calculation unit 120, the distance between the vector (n is an integer of 1 to N) weighted shape vector _Wc n weighted noise signal We and the weighted shape codebook 150 multiplied by the optimal ideal gain ^g _{~ opt} to as D ^~ is minimized, selecting the weighted shape vector Wc _n and optimal ideal gain ^g _{~ opt} (S120). However, the ideal gain refers to the gain before quantization (gain obtained by calculation). Further, the optimal ideal gain, is ideal gain can be minimized distance D ^~ of the respective weighted shape vector Wc _n and weighted noise signal We.

ここで、距離Ｄ^〜は、次のように表現できる。
Ｄ^〜＝‖Ｗｅ−ｇ^〜 _ｏｐｔＷｃ_ｎ‖^２（３）
距離Ｄ^〜を最小にする理想利得ｇ^〜 _ｏｐｔは、距離Ｄ^〜を理想利得ｇ^〜 _ｏｐｔで偏微分した値を０にするので、

が成り立つ。なお、式中のｔは、行列ないしはベクトルの転置を表す。この式をｇ^〜 _ｏｐｔについて解くと、

となる。ｇ^〜 _ｏｐｔを式（３）に代入すると、次式となる。

右辺の第１項‖Ｗｅ‖^２は、Ｗｃ_ｎに対して不変であるため、定数項である。したがって、右辺第２項を最大にすることが、距離Ｄ^〜を最小とすることになる。そこで、拡張符号化器１００では、距離ｄ_ｓを次式のように定義し、距離ｄ_ｓが最大となる重み付き形状ベクトルＷｃ_ｎと最適な理想利得ｇ^〜 _ｏｐｔを選定する。

Here, the distance D ^~ can be expressed as follows.
^{^{_{D ~ = ‖We-g ~ opt}}} Wc n ‖ ² (3)
Distance ^{D ~} ideal gain ^g _~ to minimize _opt, since the distance obtained by partially differentiating ^{D ~} the ideal gain ^g _{~ opt} value of 0

Holds. Note that t in the equation represents transposition of a matrix or a vector. Solving this equation for g ^~ _opt ,

It becomes. When the g ^~ _opt into equation (3), the following equation.

The first term ‖We‖ ² on the right side are the invariant to Wc _n, constant terms. Therefore, maximizing the second term on the right side becomes the distance D ^~ to be minimized. Therefore, the extension encoder 100, the distance _{d s} is defined as: the distance _{d s} to select the most ideal gain ^g _{~ opt} and weighted shape vector Wc _n which maximizes.

次に具体的な形状計算部１２０の構成の例を示す。形状計算部１２０は、図２に示すように初期化手段１２１、内積手段１２２、理想利得計算手段１２３、距離計算手段１２４、確認手段１２５を備えている。各手段の処理は、図３のＳ１２０の内部に示されており、次のようになる。 Next, a specific example of the configuration of the shape calculation unit 120 is shown. As shown in FIG. 2, the shape calculation unit 120 includes an initialization unit 121, an inner product unit 122, an ideal gain calculation unit 123, a distance calculation unit 124, and a confirmation unit 125. The processing of each means is shown in S120 of FIG. 3 and is as follows.

初期化手段１２１は、ｎ＝１、ｄ_ｓｍａｘ＝０とする（Ｓ１２１）。内積手段１２２は、重み付き形状符号帳１５０内のｎ番目の重み付き形状ベクトルＷｃ_ｎと重み付き雑音信号Ｗｅとの内積結果（Ｗｃ_ｎ）^ｔ（Ｗｅ）を、スカラ変数ｑ_ｎとして記録する（Ｓ１２２）。理想利得計算手段は、重み付き形状パワ逆数帳１６０内のｎ番目の要素（１／‖Ｗｃ_ｎ‖^２）と、変数ｑ_ｎとの積を、理想利得ｇ^〜 _ｎとして記録する（Ｓ１２３）。距離計算手段１２４は、理想利得ｇ^〜 _ｎと変数ｑ_ｎの積を、距離ｄ_ｓとして記録する（Ｓ１２４）。なお、ｑ_ｎ＝（Ｗｃ_ｎ）^ｔ（Ｗｅ）は重み付き形状ベクトルと重み付き雑音信号の内積であり、式（５）の分子と同じであるため、ｄ_ｓは、理想利得ｇ^〜 _ｎと変数ｑ_ｎの積として求めることができる。確認手段１２５は、ｄ_ｓｍａｘ＜ｄ_ｓならば、ｄ_ｓをｄ_ｓｍａｘ、ｎを最適な符号ｉ_ｏｐｔ、ｇ^〜 _ｎを最適な理想利得ｇ^〜 _ｏｐｔとする（Ｓ１２５１、Ｓ１２５２）。形状計算部１２０は、ｎがＮ（重み付き形状符号帳の要素の数）よりも小さければ、ｎ＝ｎ＋１としてステップＳ１２２に戻る（Ｓ１２６１、Ｓ１２６２）。ｎ＝Ｎならば、繰り返し終了時のｉ_ｏｐｔを形状符号Ｉ_ｓとし、形状符号Ｉ_ｓと繰り返し終了時の最適な理想利得ｇ^〜 _ｏｐｔを出力とする（Ｓ１２６３）。 The initialization unit 121 sets n = 1 and d _smax = 0 (S121). The inner product means 122 records the inner product result (Wc _n ) ^t (We) of the _nth weighted shape vector Wcn in the weighted shape codebook 150 and the weighted noise signal We as a scalar variable q _n ( S122). Ideal gain calculating means, the n-th element of the weighted shape power reciprocal Book 160 (1 / ‖Wc _n ‖ ^2), the product of the variables _{q n,} is recorded as an ideal gain ^g _{~ n} (S123). The distance calculation unit 124 records the product of the ideal gains g ^to _n and the variable q _n as the distance d _s (S124). Note that q _n = (Wc _n ) ^t (We) is the inner product of the weighted shape vector and the weighted noise signal, and is the same as the numerator of Expression (5), so d _s is the ideal gain g ^to _n it can be obtained as the product of the variable q _n. If d _smax <d _s , the confirmation unit 125 sets d _s to d _smax , n to the optimal code i _opt , and g ^to _n to the optimal ideal gain g ^to _opt (S 1251, S 1252). If n is smaller than N (the number of elements in the weighted shape codebook), the shape calculation unit 120 sets n = n + 1 and returns to step S122 (S1261, S1262). If n = N, the _{i opt} at repeat end a shape code _{I s,} and outputs the coded shape _{I s} and optimal ideal gain ^g _{~ opt} for repeated at the end (S1263).

形状計算部１２０は、このように距離ｄ_ｓが理想利得ｇ^〜 _ｎと変数ｑ_ｎの積として求めることができることを利用しているので、重み付き形状ベクトルＷｃ_ｎと最適な理想利得ｇ^〜 _ｏｐｔを選定するための演算量を特許文献１よりも低減できる。 Shape calculation unit 120, the use of the fact that it is possible in this way the distance _{d s} is determined as the product of the ideal gain ^g _{~ n} and variables _{q n,} weighted shape vector Wc _n and optimal ideal gain ^g _{~ opt} The amount of calculation for selecting can be reduced as compared with Patent Document 1.

利得計算部１３０
利得計算部１３０は、最適な理想利得ｇ^〜 _ｏｐｔとの距離Ｄが最小となるように、利得符号帳１７０内の利得ｇ_ｍ（ｍは１〜Ｍの整数）を選定し、当該利得を示す利得符号Ｉ_ｇを出力する。（Ｓ１３０）。ここで、距離Ｄは、
Ｄ＝‖Ｗｅ−ｇ_ｍＷｃ_ｎ‖^２（８）
である。そして、距離ｄ_ｇを
ｄ_ｇ＝‖ｇ^〜 _ｏｐｔ−ｇ_ｍ‖^２（９）
とし、距離ｄ_ｇが最小となる利得ｇ_ｍを選定する。 Gain calculation unit 130
The gain calculation unit 130 selects the gain g _m (m is an integer of 1 to M) in the gain codebook 170 so that the distance D ^{from the} optimum ideal gain g ^to _opt is minimized, and indicates the gain. The gain code _Ig is output. (S130). Here, the distance D is
D = _‖We-g m Wc _n ‖ ² (8)
It is. Then, the distance d _{g is changed} to d _g = ‖ g ^to _opt ⁻ g _m ‖ ² (9)
And a gain g _m that minimizes the distance d _g is selected.

利得計算部１３０は、探索手段１３５を備えている。探索手段１３５は、利得符号帳１７０内のｘ番目に値が小さい利得ｇ_ｘと、ｘ＋１番目に値が小さい利得ｇ_ｘ＋１を用いて、利得ｇ_ｍ（ｍは１〜Ｍの整数）を選定する。例えば、利得符号帳内の値の小さい利得から順番に、
ｇ^〜 _ｏｐｔ＜（ｇ_ｘ＋ｇ_ｘ＋１）／２（１０）
を満足するかを確認する。そして、最初に式（１０）を満足するｘを探索する。 The gain calculation unit 130 includes search means 135. Search means 135 selects gain g _m (m is an integer from ₁ to M) using gain g _x having the smallest value in x in gain codebook 170 and gain g _{x + 1} having the smallest value in _{x + 1.} . For example, in order from the smallest gain in the gain codebook,
g ^to _opt <(g _x + g _{x + 1} ) / 2 (10)
Check if you are satisfied. First, x that satisfies Expression (10) is searched.

具体的な探索処理の例は次のとおりである。探索手段１３５は、ｘ＝１とする（Ｓ１３１）。式（１０）を満足するかを確認する（Ｓ１３２）。ステップＳ１３２がＮｏならば、ｘ＝ｘ＋１とする（Ｓ１３３）。ｘがＭ（Ｍは、利得符号帳１７０内の利得ｇ_ｍの数）よりも小さいかを確認する（Ｓ１３４）。ステップＳ１３４がＹｅｓの場合はステップＳ１３２に戻る。ステップＳ１３２がＹｅｓの場合とステップＳ１３４がＮｏの場合は、ｘを利得符号Ｉ_ｇとして出力する（Ｓ１３５）。
利得計算部１３０は、このように式（１０）を利用して計算するので、演算量を特許文献１よりも低減できる。 A specific example of the search process is as follows. The search means 135 sets x = 1 (S131). It is confirmed whether the expression (10) is satisfied (S132). If step S132 is No, x = x + 1 is set (S133). x is M (M is the number of gain _{g m} in the gain codebook 170) checks whether less than (S134). If step S134 is Yes, the process returns to step S132. Step S132 is the case with step S134: Yes If No, and outputs a x as a gain code _{I g} (S135).
Since the gain calculation unit 130 calculates using the formula (10) in this way, the amount of calculation can be reduced as compared with Patent Document 1.

［復号装置］
図５に第１実施形態の復号装置の構成例を示す。復号装置は、基本復号器９２０と加算部９４０と拡張復号器２００から構成される。基本復号器９２０は、基本符号化器９１０に対応する復号器である。基本復号器９２０は、基本符号Ｉ_ｂを再生基本信号Ｓ＾_ｂに復号する。拡張復号器２００は、拡張符号Ｉ_ｅを再生雑音信号ｅ＾に復号する。加算部９４０は、再生基本信号Ｓ＾_ｂと再生雑音信号ｅ＾を加算し、再生信号ｓ＾を出力する。 [Decoding device]
FIG. 5 shows a configuration example of the decoding apparatus according to the first embodiment. The decoding apparatus includes a basic decoder 920, an adder 940, and an extended decoder 200. The basic decoder 920 is a decoder corresponding to the basic encoder 910. The basic decoder 920 decodes the basic code _Ib into the reproduction basic signal S ^ _b . The extended decoder 200 decodes the extended code I _e into a reproduced noise signal e ^. The adder 940 adds the reproduction basic signal S ^ _b and the reproduction noise signal e ^ and outputs a reproduction signal s ^.

本発明は、図５の復号装置の中の特に拡張復号器２００に関するので、以下では拡張復号器２００について説明する。図６は、拡張復号器２００の処理フローの例である。拡張復号器２００は、分解部２１０、重み付き形状復号部２２０、重み付け除去演算部２３０、利得復号部２４０、乗算部２５０、重み付き形状符号帳１５０、利得符号帳１７０で構成される。第１実施形態の拡張復号器２００では、復号用の形状符号帳にも重み付きの符号帳を用いることができる点が特許文献１と大きく異なる点である。重み付きの符号帳を用いることができるので、符号化に用いた重み付き形状符号帳１５０を復号にも使用できる。 Since the present invention particularly relates to the extended decoder 200 in the decoding apparatus of FIG. 5, the extended decoder 200 will be described below. FIG. 6 is an example of a processing flow of the extended decoder 200. The extended decoder 200 includes a decomposition unit 210, a weighted shape decoding unit 220, a weighting removal calculation unit 230, a gain decoding unit 240, a multiplication unit 250, a weighted shape codebook 150, and a gain codebook 170. The extended decoder 200 according to the first embodiment is significantly different from Patent Document 1 in that a weighted codebook can be used for the shape codebook for decoding. Since a weighted codebook can be used, the weighted shape codebook 150 used for encoding can also be used for decoding.

分解部２１０は、入力された符号Ｉ_ｅを、形状符号Ｉ_ｓと利得符号Ｉ_ｇに分解する（Ｓ２１０）。重み付き形状復号部２２０は、重み付き形状符号帳１５０を用いて、形状符号Ｉ_ｓを重み付き形状ベクトルＷｃ_ｎ（ｎは１〜Ｎの整数、Ｎは重み付き形状符号帳に格納されている重み付き形状ベクトルの個数）に変換する。重み付け除去演算部２３０は、重み付き形状ベクトルＷｃ_ｎの重みＷを除去し、形状ベクトルｃ_ｎを出力する（Ｓ２３０）。利得復号部２４０は、利得符号帳１７０を用いて、利得符号Ｉ_ｇを利得ｇ_ｍ（ｍは１〜Ｍの整数、Ｍは利得符号帳に格納されている利得の個数）に変換する。乗算部２５０は、形状ベクトルｃ_ｎと利得ｇ_ｍとを乗算して、復号信号（再生雑音信号）ｅ＾を出力する（Ｓ２５０）。 Decomposition unit 210 decomposes the input code _{I e,} the coded shape _{I s} and the gain code _{I g} (S210). Weighted shape decoding unit 220 uses the weighted shape codebook 150, integers coded shape I _s weighted shape vector Wc n _(n a is 1 to N, N is stored in the weighted shape codebook Number of weighted shape vectors). Weighting removal operation unit 230 removes the weight W of a weighted shape vector Wc _n, and outputs the shape vector _{c n} (S230). Gain decoding section 240, by using a gain codebook 170, and converts the gain code I _g gain g _{m (m} is an integer of 1 to M, M is the number of gain stored in the gain codebook) to. Multiplication section 250 multiplies the shape vector _{c n} and the gain _{g m,} the decoded signal (reproduced noise signal) e ^ (S250).

拡張復号器２００が重み付きの符号帳を用いることができる理由は、重み付け除去演算部２３０にある。例えば、重み付け除去演算部２３０では、符号化のときに用いた重みを付与する行列（フィルタ）の逆行列（逆フィルタ）を用いればよい。例えば、式（２）の行列Ｗの逆行列Ｕは、

と表現されるＫ行（Ｋ＋ｒ）列の行列である。式（１１）の場合は、1タップのフィルタであるため、ｒ＝１である。重み付け除去は、重み付き形状ベクトルにこの逆行列Ｕを乗算し、形状ベクトルを出力することである。 The reason why the extended decoder 200 can use a weighted codebook resides in the weight removal operation unit 230. For example, the weight removal arithmetic unit 230 may use an inverse matrix (inverse filter) of a matrix (filter) to which weights used for encoding are applied. For example, the inverse matrix U of the matrix W in equation (2) is

Is a matrix of K rows (K + r) columns. In the case of Expression (11), since it is a 1-tap filter, r = 1. The weight removal is to multiply the weighted shape vector by this inverse matrix U and output the shape vector.

式（１１）の逆行列を用いて重み付き形状ベクトルＷｃ_ｎの重みＷを除去し、形状ベクトルｃ_ｎを出力するために、重み付け除去演算部２３０は、初期化手段２３１、加算手段２３２、除算手段２３３、更新手段２３４を備えている。初期化手段２３１は、ｋ＝１、ｆ_ａ＝０とする（Ｓ２３１）。加算手段は、重み付き形状ベクトルＷｃ_ｎのｋ番目の要素ｆ_ｋをｆ_ａに加算し、新しいｆ_ａとする（Ｓ２３２）。除算手段２３３は、形状ベクトルのｎ番目の要素ｐ_ｋを、ｐ_ｋ＝ｆ_ａ／ｂとし、記録する（Ｓ２３３）。更新手段は、ｋがＫよりも小さい場合には、ｋに１を加算し、新しいｋとする（Ｓ２３４１、Ｓ２３４２）。ｋがＫ（Ｋは形状ベクトルのベクトル長）に等しい場合には、処理を終了する。このようにステップＳ２３２〜Ｓ２３４２を、ｐ_１，…，ｐ_Ｋ（ただし、Ｋは形状ベクトルのベクトル長）のすべてを求めるまで繰り返す。 Removing the weight W of a weighted shape vector Wc _n by using the inverse matrix of Equation (11), in order to output the shape vector _{c n,} the weighting removal operation unit 230, the initialization unit 231, addition unit 232, division Means 233 and update means 234 are provided. The initialization unit 231 sets k = 1 and f _a = 0 (S231). Addition means, the k th element _{f k} of the weighted shape vector Wc _n is added to _{f a,} the new _{f a} (S232). The dividing unit 233 records the n-th element p _k of the shape vector as p _k = f _a / b (S233). When k is smaller than K, the updating unit adds 1 to k to obtain a new k (S2341, S2342). If k is equal to K (K is the vector length of the shape vector), the process ends. In this way, steps S232 to S2342 are repeated until all of p ₁ ,..., P _K (where K is the vector length of the shape vector) is obtained.

第１実施形態の復号装置は、このように重み付け除去演算部２３０があるため、符号化器と同じ重み付き形状符号帳１５０を用いることができる。そして、符号化に用いる重み付き形状符号帳を復号にも用いれば、符号化装置と復号装置の両方を備える双方向通信器の符号帳のためのメモリの使用量を、大幅に低減（具体的には、ほぼ半減）できる。 Since the decoding apparatus according to the first embodiment includes the weight removal operation unit 230 as described above, the same weighted shape codebook 150 as that of the encoder can be used. If the weighted shape codebook used for encoding is also used for decoding, the amount of memory used for the codebook of the bidirectional communication device including both the encoding device and the decoding device is significantly reduced (specifically Can be almost halved).

また、一般的に復号に要する演算量は符号探索を行う符号化時に要する演算量より著しく小さい。第１実施形態の場合、符号化に要する演算量と復号に要する演算量は、ほぼＮ：１の関係にある。つまり、復号のための演算量が微妙に増えたとしても、符号化を含めた全体の演算量にはほとんど影響がない。上述のように第１実施形態の符号化装置は、品質を特許文献１と同等に保ちながら演算量を減らしている。したがって、全体的には、メモリの使用量を大幅に減らしながら、演算量も品質も同等を保つことができる。 In general, the amount of calculation required for decoding is significantly smaller than the amount of calculation required for encoding for code search. In the case of the first embodiment, the amount of calculation required for encoding and the amount of calculation required for decoding are substantially in a relationship of N: 1. That is, even if the calculation amount for decoding increases slightly, there is almost no influence on the total calculation amount including encoding. As described above, the encoding device of the first embodiment reduces the amount of calculation while maintaining the quality equivalent to that of Patent Document 1. Therefore, overall, the amount of calculation and the quality can be kept equal while greatly reducing the amount of memory used.

また、図７には、本発明の効果を示すために、基本符号部にＧ．７１１を用いて第１実施形態の符号化装置と復号装置を用いた場合の再生信号（復号信号）のスペクトル解析例を示す。図７（Ａ）は原音声（破線）とその音声を、Ｇ．７１１を用いて符号化して復号した再生音（実線）、図７（Ｂ）は原音声（破線）とその音声を第１実施形態の符号化装置を用いて符号化して復号した再生音（破線）のスペクトル解析結果である。Ｇ．７１１単体を用いた場合では、現音に存在する高域の調波構造が量子化雑音に埋もれているが、第１実施形態を用いれば高域（２５００ＫＨｚ以上）の調波構造が再現されていることが分かる。この結果は、特許文献１と同等である。したがって、符号化と復号の品質を維持しながら演算量を少なくできる。 FIG. 7 shows G. in the basic code part in order to show the effect of the present invention. 7A shows an example of spectrum analysis of a reproduction signal (decoded signal) when the encoding apparatus and decoding apparatus of the first embodiment are used. FIG. 7A shows the original voice (dashed line) and its voice. Reproduced sound (solid line) encoded using 711 and decoded (solid line), FIG. 7B shows the original sound (broken line) and the reproduced sound obtained by encoding and decoding the sound using the encoding device of the first embodiment (broken line). ) Spectral analysis results. G. When the 711 unit is used, the high-frequency harmonic structure existing in the current sound is buried in the quantization noise. However, if the first embodiment is used, the high-frequency (2,500 KHz or higher) harmonic structure is reproduced. I understand that. This result is equivalent to Patent Document 1. Therefore, the amount of calculation can be reduced while maintaining the quality of encoding and decoding.

［第２実施形態］
［符号化装置］
図８に第２実施形態の符号化装置の機能構成例を示す。第１実施形態の利得計算部１３０は、利得符号帳１７０から利得符号Ｉ_ｇを探索した。第２実施形態の利得計算部３３０は、計算により利得符号Ｉ_ｇを求める。そこで、第２実施形態の拡張符号化器３００は、利得計算部１３０と利得符号帳１７０の代わりに、利得計算部３３０を備えている。また、利得計算部３３０は、量子化手段３３５を有している。その他の構成は、図２と同じである。また、図９に第２実施形態の拡張符号化器３００の処理フローの例を示す。図３に示した第１実施形態の拡張符号化器１００の処理フローと、ステップＳ１１０、Ｓ１２０、Ｓ１４０は同じである。以下では、第１実施形態との違いである利得計算部３３０について説明する。 [Second Embodiment]
[Encoding device]
FIG. 8 shows a functional configuration example of the encoding apparatus according to the second embodiment. Gain calculating section 130 of the first embodiment, it explored the gain code _{I g} from the gain codebook 170. Gain calculator 330 of the second embodiment determines the gain code I _g by calculation. Therefore, the extension encoder 300 according to the second embodiment includes a gain calculation unit 330 instead of the gain calculation unit 130 and the gain codebook 170. The gain calculation unit 330 includes a quantization unit 335. Other configurations are the same as those in FIG. FIG. 9 shows an example of the processing flow of the extension encoder 300 of the second embodiment. The processing flow of the extension encoder 100 of the first embodiment shown in FIG. 3 is the same as steps S110, S120, and S140. Below, the gain calculation part 330 which is the difference with 1st Embodiment is demonstrated.

利得計算部３３０の量子化手段３３５は、射影関数ｆ（ｘ）を用いて最適な理想利得ｇ^〜 _ｏｐｔから、利得符号Ｉ_ｇを求める。射影関数ｆ（ｘ）は、直線、曲線、あるいは直線と曲線を組み合わせた連続関数である。量子化手段３３５は、まず射影関数の演算ｆ（ｇ^〜 _ｏｐｔ）を行う（Ｓ３３１）。次に、四捨五入演算を行い次のように利得符号Ｉ_ｇを求める（Ｓ３３２）。
Ｉ_ｇ＝ｒｏｕｎｄ（ｆ（ｇ^〜 _ｏｐｔ））（１２）
利得計算部３３０は、求めた利得符号Ｉ_ｇを出力する（Ｓ３３３）。 Quantization means of the gain calculator 330 335, the optimum ideal gain ^g _{~ opt} using a projection function f (x), determining the gain code _{I g.} The projection function f (x) is a straight line, a curve, or a continuous function that combines a straight line and a curve. The quantization means 335 first performs a projection function calculation f (g ^to _opt ) (S331). Next, the rounding operation as follows seek gain code _{I g} (S332).
_{^{_{I g = round (f (g}}} ~ opt)) (12)
Gain calculating unit 330 outputs a gain code _{I g} obtained (S333).

射影関数の具体例としては、次式のような双曲線がある。

そして、例えば、利得符号の数Ｍが６４の場合は、
ａ＝１３００
ｂ＝−１
ｃ＝１．０７
ｄ＝１９．２７
とすればよい。なお、射影関数は、連続関数であればよく、ｘの値に応じて複数の関数を切り替えてもよい。
利得計算部３３０は、このように射影関数を利用して計算するので、演算量を特許文献１よりも低減できる。また、利得符号帳が必要ないので、メモリ使用量を低減できる。 As a specific example of the projection function, there is a hyperbola as follows.

For example, when the number M of gain codes is 64,
a = 1300
b = -1
c = 1.07
d = 19.27
And it is sufficient. The projection function may be a continuous function, and a plurality of functions may be switched according to the value of x.
Since the gain calculation unit 330 calculates using the projection function in this way, the amount of calculation can be reduced as compared with Patent Document 1. Further, since no gain codebook is required, the amount of memory used can be reduced.

［復号装置］
図１０に第２実施形態の復号装置の構成例を示す。第１実施形態の利得復号部２４０は、利得符号帳１７０を用いて利得ｇ_ｍを求めた。第２実施形態の利得復号部４４０は、計算により利得ｇ_ｍを求める。そこで、第２実施形態の拡張復号器４００は、利得復号部２４０と利得符号帳１７０の代わりに、利得復号部４４０を備えている。その他の構成は、図５と同じである。また、図１１に第２実施形態の拡張復号器４００の処理フローの例を示す。図６に示した第１実施形態の拡張復号器２００の処理フローと、ステップＳ２１０、Ｓ２２０、Ｓ２３０、Ｓ２５０は同じである。以下では、第１実施形態との違いである利得復号部４４０について説明する。 [Decoding device]
FIG. 10 shows a configuration example of the decoding apparatus according to the second embodiment. Gain decoding unit 240 of the first embodiment, to determine the gain _{g m} using a gain codebook 170. The gain decoding unit 440 of the second embodiment obtains the gain g _m by calculation. Therefore, the extended decoder 400 according to the second embodiment includes a gain decoding unit 440 instead of the gain decoding unit 240 and the gain codebook 170. Other configurations are the same as those in FIG. FIG. 11 shows an example of the processing flow of the extended decoder 400 of the second embodiment. The processing flow of the extended decoder 200 of the first embodiment shown in FIG. 6 is the same as steps S210, S220, S230, and S250. Below, the gain decoding part 440 which is a difference with 1st Embodiment is demonstrated.

利得復号部４４０は、符号化で用いた射影関数の逆関数ｆ^−１（ｙ）を用いて、利得符号Ｉ_ｇから利得ｇ_ｍを求める。具体的には、利得復号部４４０は、
ｇ_ｍ＝ｆ^−１（Ｉ_ｇ）（１４）
の演算によって利得ｇ_ｍを求める（Ｓ４４０）。 Gain decoding section 440, inverse projection functions used in coding ^f -1 with (y), determining the gain _{g m} from the gain code _{I g.} Specifically, the gain decoding unit 440
g _m = f ⁻¹ (I _g ) (14)
The gain g _m is obtained by the calculation of (S440).

射影関数として式（１３）の双曲線を用いた場合であれば、逆関数は、

このときのａ、ｂ、ｃ、ｄは、利得計算部３３０と同じ値を用いる。 If the hyperbola of equation (13) is used as the projection function, the inverse function is

At this time, the same values as those of the gain calculation unit 330 are used for a, b, c, and d.

このような構成と処理方法であれば、第１実施形態と同程度の演算量で、メモリ使用量を低減できる。また、符号化と復号の品質にかかわる処理は第１実施形態と同じなので、図７に示したスペクトル解析結果と同じ結果が期待できる。したがって、したがって、符号化と復号の品質を維持しながら演算量を少なくできる。 With such a configuration and processing method, the memory usage can be reduced with the same amount of computation as in the first embodiment. Further, since the processing related to the quality of encoding and decoding is the same as that in the first embodiment, the same result as the spectrum analysis result shown in FIG. 7 can be expected. Therefore, the amount of computation can be reduced while maintaining the quality of encoding and decoding.

［変形例］
本発明の復号方法の大切なポイントの１つは、ステップＳ２２０、Ｓ２３０である。また、これらのステップを実行するために必要な構成部は、重み付き形状復号部２２０、重み付き形状符号帳１５０、重み付け除去演算部２３０である。その他の構成部は、第１実施形態の復号装置（図５）や第２実施形態の復号装置（図１０）に限定する必要はない。図１２に第２実施形態の変形例の復号装置の機能構成例を示す。この構成では、図１０の乗算部２５０の代わりに除算部６５０があり、加算部９４０の代わりに加算部６４０がある。除算部６５０は、再生基本信号ｓ＾_ｂを利得ｇ_ｍで除算する。加算部６４０は、除算部６５０の出力ｓ＾_ｂ／ｇ_ｍと形状ベクトルｃ_ｎを加算し、再生信号ｅ＾’を得る。 [Modification]
One important point of the decoding method of the present invention is steps S220 and S230. Further, components necessary for executing these steps are a weighted shape decoding unit 220, a weighted shape codebook 150, and a weighting removal calculating unit 230. The other components need not be limited to the decoding device of the first embodiment (FIG. 5) or the decoding device of the second embodiment (FIG. 10). FIG. 12 shows a functional configuration example of a decoding device according to a modification of the second embodiment. In this configuration, there is a division unit 650 instead of the multiplication unit 250 in FIG. 10, and an addition unit 640 instead of the addition unit 940. The division unit 650 divides the reproduction basic signal ＾ _b by the gain g _m . Adding section 640 adds the outputs _s ^ b / _{g m} and shape vector _{c n} of the divider 650 to obtain a reproduction signal e ^ '.

図１０の復号装置の再生信号ｅ＾は、形状ベクトルｃ_ｎと利得ｇ_ｍとの積と再生基本信号ｓ＾_ｂとの和である。したがって、再生信号ｅ＾と再生信号ｅ＾’とは、
ｅ＾’＝ｅ＾／ｇ_ｍ（１６）
の関係となる。つまり、再生信号ｅ＾’は、ボリュームは異なるが波形は再生信号ｅ＾と同じ信号である。符号化や復号の処理では再生される信号のボリュームは他の処理で調整されるものであり、波形が再生されていれば品質上は問題ない。したがって、図１２に示した構成でも第２実施形態の復号装置と同等の効果を得ることができる。 ^ Reproduced signal e of the decoding device in FIG. 10 is the sum of the product of the shape vector c _n and the gain g _m and the reproduction fundamental signal s ^ _b. Therefore, the reproduction signal e ^ and the reproduction signal e ^ '
_{e ^ '= e ^ / g} m (16)
It becomes the relationship. That is, the reproduction signal e ^ 'is the same signal as the reproduction signal e ^ although the volume is different. In the encoding and decoding processes, the volume of the reproduced signal is adjusted by other processes, and there is no problem in quality if the waveform is reproduced. Therefore, even with the configuration shown in FIG. 12, the same effect as that of the decoding device of the second embodiment can be obtained.

このように、図６や図１１に示したステップＳ２２０、Ｓ２３０有する復号方法、図５や図１０に示した重み付き形状復号部２２０、重み付き形状符号帳１５０、重み付け除去演算部２３０を備える復号装置であれば、本発明の効果を得ることができる。 As described above, the decoding method including steps S220 and S230 illustrated in FIGS. 6 and 11, the decoding including the weighted shape decoding unit 220, the weighted shape codebook 150, and the weighting removal calculating unit 230 illustrated in FIGS. 5 and 10. If it is an apparatus, the effect of this invention can be acquired.

図１３に、コンピュータの機能構成例を示す。なお、本発明の符号化方法や復号方法は、コンピュータの記録部２０２０に、上記方法の各ステップを実行させるプログラムを読み込ませ、制御部２０１０、入力部２０３０、出力部２０４０などに動作させることで実施できる。また、コンピュータに読み込ませる方法としては、プログラムをコンピュータ読み取り可能な記録媒体に記録しておき、記録媒体からコンピュータに読み込ませる方法、サーバ等に記録されたプログラムを、電気通信回線等を通じてコンピュータに読み込ませる方法などがある。 FIG. 13 shows a functional configuration example of a computer. In the encoding method and decoding method of the present invention, a program for causing the recording unit 2020 of the computer to execute each step of the above method is read and operated by the control unit 2010, the input unit 2030, the output unit 2040, and the like. Can be implemented. In addition, as a method of causing the computer to read, the program is recorded on a computer-readable recording medium, and the program recorded on the server or the like is read into the computer through a telecommunication line or the like. There is a method to make it.

電話網などに用いられているＩＲＳ周波数特性を説明するための特性曲線図。The characteristic curve figure for demonstrating the IRS frequency characteristic used for the telephone network. 第１実施形態の符号化装置の機能構成例を示す図。The figure which shows the function structural example of the encoding apparatus of 1st Embodiment. 第１実施形態の符号化装置の拡張符号化器の処理フローの例を示す図。The figure which shows the example of the processing flow of the extended encoder of the encoding apparatus of 1st Embodiment. 重み付け部の周波数特性を示す図。The figure which shows the frequency characteristic of a weighting part. 第１実施形態の復号装置の機能構成例を示す図。The figure which shows the function structural example of the decoding apparatus of 1st Embodiment. 第１実施形態の復号装置の拡張復号器の処理フローの例を示す図。The figure which shows the example of the processing flow of the extended decoder of the decoding apparatus of 1st Embodiment. 基本符号部にＧ．７１１を用い、第１実施形態の符号化装置、復号装置を用いた場合の復号結果のスペクトル解析例を示す図。G. 7 is a diagram illustrating a spectrum analysis example of a decoding result when the encoding device and the decoding device according to the first embodiment are used. 第２実施形態の符号化装置の機能構成例を示す図。The figure which shows the function structural example of the encoding apparatus of 2nd Embodiment. 第２実施形態の符号化装置の拡張符号化器の処理フローの例を示す図。The figure which shows the example of the processing flow of the extended encoder of the encoding apparatus of 2nd Embodiment. 第２実施形態の復号装置の構成例を示す図。The figure which shows the structural example of the decoding apparatus of 2nd Embodiment. 第２実施形態の復号装置の拡張復号器の処理フローの例を示す図。The figure which shows the example of the processing flow of the extended decoder of the decoding apparatus of 2nd Embodiment. 第２実施形態の変形例の復号装置の機能構成例を示す図。The figure which shows the function structural example of the decoding apparatus of the modification of 2nd Embodiment. コンピュータの機能構成例を示す図。The figure which shows the function structural example of a computer.

Explanation of symbols

１００拡張符号化器１１０重み付け部
１２０形状計算部１２１初期化手段
１２２内積手段１２３理想利得計算手段
１２４距離計算手段１２５確認手段
１３０利得計算部１３５探索手段
１４０多重化部１５０重み付き形状符号帳
１６０形状パワ逆数帳１７０利得符号帳
２００拡張復号器２１０分解部
２２０形状復号部２３０除去演算部
２３１初期化手段２３２加算手段
２３３除算手段２３４更新手段
２４０利得復号部２５０乗算部 DESCRIPTION OF SYMBOLS 100 Extended encoder 110 Weighting part 120 Shape calculation part 121 Initialization means 122 Inner product means 123 Ideal gain calculation means 124 Distance calculation means 125 Confirmation means 130 Gain calculation part 135 Search means 140 Multiplexing part 150 Weighted shape codebook 160 Shape Power reciprocal book 170 Gain code book 200 Extended decoder 210 Decomposition unit 220 Shape decoding unit 230 Removal operation unit 231 Initialization unit 232 Addition unit 233 Division unit 234 Update unit 240 Gain decoding unit 250 Multiplication unit

Claims

A decoding method for decoding a shape code indicating a shape vector,
A weighted shape decoding step of converting the shape code into a weighted shape vector using a weighted shape codebook;
A weighting removal calculating step of removing weights of the weighted shape vector and outputting the shape vector;
A decryption method.

A decoding method for decoding a code composed of a shape code indicating a shape vector and a gain code indicating a gain,
A decomposing step of decomposing the input code into a shape code and a gain code;
A weighted shape decoding step of converting the shape code into a weighted shape vector using a weighted shape codebook;
A weighting removal calculating step of removing weights of the weighted shape vector and outputting the shape vector;
A gain decoding step of converting the gain code into a gain;
A multiplication step of multiplying the shape vector and the gain and outputting a decoded signal;
A decryption method.

The decoding method according to claim 1 or 2, comprising:
The decoding method according to claim 1, wherein the weighted shape codebook is the same as the weighted shape codebook used for encoding.

A decoding method according to any one of claims 1 to 3,
The weighting removal calculating step is a step of outputting, as the shape vector, a result obtained by multiplying the weight of the weighted shape vector by the inverse matrix U of the weight matrix W used for encoding. .

The decoding method according to claim 4, wherein
The inverse matrix U is

A decoding method characterized by being expressed as:

The decoding method according to claim 5, wherein
The weighting removal calculating step includes:
an initialization sub-step with k = 1 and f _a = 0;
An adding substep of component k f _k of the weighted shape vector is added to f _a, the new f _a,
A division sub-step in which the k-th element p _k of the shape vector is set to p _k = f _a / b;
an update step of adding 1 to k to give a new k;
Have
The decoding method, wherein the addition sub-step, the division sub-step, and the update step are repeated until all of p ₁ ,..., P _K (where K is a vector length of a shape vector) is obtained.

A code encoded by the first encoding method (hereinafter referred to as a “basic code”) and a code obtained by encoding an error generated by the encoding by the first encoding method by the second encoding method ( Hereinafter, a decoding method for decoding “quality extension code”),
The quality extension decoding step of decoding a quality extension code and outputting a reproduction quality extension signal includes the steps of the decoding method according to any one of claims 1 to 6,
A basic decoding step of decoding a basic code and outputting a reproduced basic signal;
A decoding method comprising: an addition step of adding the reproduction basic signal and the reproduction quality extension signal to output a reproduction signal.

A decoder for decoding a shape code indicating a shape vector,
A weighted shape codebook that associates shape codes with weighted shape vectors;
A weighted shape decoding unit that converts the shape code into a weighted shape vector using the weighted shape codebook;
Removing a weight of the weighted shape vector and outputting a shape vector;
A decoder.

A decoder for decoding a code composed of a shape code indicating a shape vector and a gain code indicating a gain,
A decomposition unit that decomposes the input code into a shape code and a gain code;
A weighted shape codebook that associates shape codes with weighted shape vectors;
A weighted shape decoding unit that converts the shape code into a weighted shape vector using the weighted shape codebook;
Removing a weight of the weighted shape vector and outputting a shape vector;
A gain codebook that associates the gain code with the gain;
A gain decoding unit that converts the gain code into a gain using the gain codebook;
A multiplier that multiplies the shape vector and the gain to output a reproduction quality extension signal;
A decoder.

The decoder according to claim 8 or 9, comprising:
The decoder according to claim 1, wherein the weighted shape codebook is the same as the weighted shape codebook used for encoding.

A decoder according to any one of claims 8 to 10,
The weight removal operation unit outputs a result obtained by multiplying the weight of the weighted shape vector by the inverse matrix U of the weight matrix W used for encoding, as the shape vector. .

The decoder according to claim 11, comprising:
The inverse matrix U is

Decoder that can be expressed as

The decoder according to claim 12, comprising:
The weight removal operation unit
initialization means for k = 1 and f _a = 0;
Adding means for the k-th element f _k of the weighted shape vector is added to f _a, the new f _a,
A dividing means for setting the k-th element p _k of the shape vector to p _k = f _a / b;
an updating means for adding 1 to k and setting it as a new k;
It said updating means and said adding means and said dividing means has _{a, p 1, ..., p K} ( although, K is the shape vector length of the vector), characterized in that to Kaee treated repeatedly until finding all Decoder.

A code encoded by the first encoding method (hereinafter referred to as a “basic code”) and a code obtained by encoding an error generated by the encoding by the first encoding method by the second encoding method ( Hereinafter, it is a decoding apparatus that decodes “quality extension code”),
A decoder according to any one of claims 8 to 13, comprising a decoder according to any one of claims 8 to 13, as a quality extension decoder that decodes a quality extension code and outputs a reproduction quality extension signal.
A basic decoder for decoding a basic code and outputting a reproduced basic signal;
A decoding apparatus comprising: an adder that adds the reproduction basic signal and the reproduction quality extension signal and outputs a reproduction signal.

An encoding method for converting an input signal into a code composed of a shape code indicating a shape vector and a gain code indicating a gain,
A weighting step of giving a weight to the input signal e and outputting a weighted noise signal We;
As the distance between the weighted noise signal We is minimized to obtain the optimal ideal gain g ^~ _opt with selecting the weighted shape vector Wc _n weighted in shape codebook, the shape representing the weighted shape vector A shape calculation step for outputting a code _Is and an optimum ideal gain g ^to _opt ;
As the distance between the optimal ideal gain g ^~ _opt is minimized, and the gain calculation step of selecting a gain g _m in the gain codebook, and outputs the gain code I _g indicating the gain,
A multiplexing step of multiplexing the shape code I _s and the gain code _Ig and outputting the code I _e ,
The shape calculation step includes:
an initialization sub-step with n = 1 and d _smax = 0;
An inner product sub-step for recording an inner product result (Wc _n ) ^t (We) of an _nth weighted shape vector Wcn in the weighted shape codebook and a weighted noise signal We as a variable q _n ;
An ideal gain calculation sub-step for recording the product of the _nth element (1 / ‖Wc _n ‖ ² ) in the weighted shape power reciprocal book and the variable q _n as ideal gains g ^to _n ;
A distance calculation substep for recording a product of the ideal gains g ^to _n and the variable q _n as a distance d _s ;
If d _smax <d _s , a confirmation sub-step with d _s as d _smax , n as the optimal code i _opt , and g ^through _n as the optimal ideal gains g ^through _opt ,
Have
Repeating the inner product substep, the ideal gain calculation substep, the distance calculation substep, and the confirmation substep until n is the number N of weighted shape vectors in the weighted shape codebook,
The repetition at the end of the _{i opt} and shape sign _{I s,}
A coding method characterized in that the shape code _Is and the optimum ideal gain g ^to _opt at the end of repetition are output.

An encoding method for converting an input signal into a code composed of a shape code indicating a shape vector and a gain code indicating a gain,
A weighting step of giving a weight to the input signal e and outputting a weighted noise signal We;
As the distance between the weighted noise signal We is minimized, seeking the ideal gain g ^~ _opt for suitable with selecting the weighted shape vector Wc _n weighted in shape codebook, the shape representing the weighted shape vector A shape calculation step for outputting a code _Is and an optimum ideal gain g ^to _opt ;
As the distance between the optimal ideal gain g ^~ _opt is minimized, and the gain calculation step of selecting a gain g _m in the gain codebook, and outputs the gain code I _g indicating the gain,
A multiplexing step of multiplexing the shape code I _s and the gain code _Ig and outputting the code I _e ,
The gain calculating step includes:
Let g _x be the gain of the xth smallest value in the gain codebook,
In order from a small gain of the value of the gain codebook ^searches for the first x satisfying _{_{_{g ~ opt <(g x +}}} g x + 1) / 2,
An encoding method, wherein x is output as a gain code _Ig .

An encoder that converts an input signal into a code composed of a shape code indicating a shape vector and a gain code indicating a gain,
A weighted shape codebook that associates shape codes with weighted shape vectors;
A weighted shape power reciprocal book that records the reciprocal of the weighted shape vector power;
A gain codebook that associates the gain code with the gain;
A weighting unit that gives a weight to the input signal e and outputs a weighted noise signal We;
As the distance between the weighted noise signal We is minimized to obtain the optimal ideal gain g ^~ _opt with selecting the weighted shape vector Wc _n weighted in shape codebook, the shape representing the weighted shape vector A shape calculation unit for outputting a code _Is and an optimum ideal gain g ^to _opt ;
As the distance between the optimal ideal gain g ^~ _opt is minimized, and the gain calculation unit that selects the gain g _m in the gain codebook, and outputs the gain code I _g indicating the gain,
A multiplexing unit that multiplexes the shape code _Is and the gain code _Ig and outputs the code _Ie ;
The shape calculator is
initialization means for n = 1 and d _smax = 0,
Inner product means for recording the inner product result (Wc _n ) ^t (We) of the _nth weighted shape vector Wcn in the weighted shape codebook and the weighted noise signal We as a variable q _n ;
And n-th element of the weighted shapes power reciprocal within book (1 / ‖Wc _n ‖ ^2), the product of the variables q _n, and the ideal gain calculation means for recording as an ideal gain g ^~ _n,
A distance calculation means for recording a product of the ideal gains g ^to _n and the variable q _n as a distance d _s ;
If d _smax <d _s , confirmation means for d _s to be d _smax , n to be an optimal code i _opt , and g ^to _n to be optimal ideal gains g ^to _opt
Have
Causing the inner product means, the ideal gain calculation means, the distance calculation means, and the confirmation means to repeatedly perform processing until n becomes the number N of weighted shape vectors in the weighted shape codebook,
The repetition at the end of the _{i opt} and shape sign _{I s,}
An encoder characterized by having a shape code _Is and an optimum ideal gain g ^to _opt at the end of repetition as outputs.

An encoder that converts an input signal into a code composed of a shape code indicating a shape vector and a gain code indicating a gain,
A weighted shape codebook that associates shape codes with weighted shape vectors;
A weighted shape power reciprocal book that records the reciprocal of the weighted shape vector power;
A gain codebook that associates the gain code with the gain;
A weighting unit that gives a weight to the input signal e and outputs a weighted noise signal We;
As the distance between the weighted noise signal We is minimized to obtain the optimal ideal gain g ^~ _opt with selecting the weighted shape vector Wc _n weighted in shape codebook, the shape representing the weighted shape vector A shape calculation unit for outputting a code _Is and an optimum ideal gain g ^to _opt ;
As the distance between the optimal ideal gain g ^~ _opt is minimized, and the gain calculation unit that selects the gain g _m in the gain codebook, and outputs the gain code I _g indicating the gain,
A multiplexing unit that multiplexes the shape code _Is and the gain code _Ig and outputs the code _Ie ;
The gain calculator is
Let g _x be the gain of the xth smallest value in the gain codebook,
In order from a small gain of the value of the gain codebook ^searches for the first x satisfying _{_{_{g ~ opt <(g x +}}} g x + 1) / 2,
encoder and outputs a x as a gain code I _g.

The program which makes a computer operate | move each step of the method in any one of Claim 1 to 7, 15, 16.

A computer-readable recording medium on which the program according to claim 19 is recorded.