JP5235168B2

JP5235168B2 - Encoding method, decoding method, encoding device, decoding device, encoding program, decoding program

Info

Publication number: JP5235168B2
Application number: JP2009148887A
Authority: JP
Inventors: 公孝堤; 茂明佐々木; 祐介日和▲崎▼; 勝宏福井
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2009-06-23
Filing date: 2009-06-23
Publication date: 2013-07-10
Anticipated expiration: 2029-06-23
Also published as: JP2011007870A

Description

この発明は、音声信号等の入力された信号をベクトル量子化により符号化する技術及び圧縮された符号を復号する技術に関する。 The present invention relates to a technique for encoding an input signal such as a speech signal by vector quantization and a technique for decoding a compressed code.

カスタネットの楽音のように時間的にレベルが急激に変動する音響信号を周波数領域におけるベクトル量子化により符号化すると、入力した音響信号中のレベルが小さい時間区間で、入力信号に対する符号化歪が大きくなり、これが聴感を著しく損なう雑音として知覚される。このような雑音はプリエコーと呼ばれている。特許文献１に記載された符号化方法は、時間領域で符号化を行う第一符号化方法と、周波数領域で符号化を行う第二符号化方法とを備え、入力信号の性質に応じて第一符号化方法と第二符号化方法の何れかを選択して符号化していた（例えば、特許文献１参照。）。 When an acoustic signal whose level varies rapidly in time, such as a musical tone of castanets, is encoded by vector quantization in the frequency domain, the encoding distortion to the input signal is reduced in a time interval where the level in the input acoustic signal is small. This is perceived as noise that significantly impairs hearing. Such noise is called pre-echo. The encoding method described in Patent Literature 1 includes a first encoding method that performs encoding in the time domain and a second encoding method that performs encoding in the frequency domain. Either one encoding method or the second encoding method is selected and encoded (for example, refer to Patent Document 1).

入力信号の性質に応じて、時間領域で符号化を行う第一符号化方法と周波数領域で符号化を行う第二符号化方法とを切り替えることにより、プリエコーの発生を低減することができる。これにより、入力信号の特徴や性質に応じた、効率の良い高品質な復号信号を得ることができた。 The occurrence of pre-echo can be reduced by switching between the first encoding method for encoding in the time domain and the second encoding method for encoding in the frequency domain, depending on the nature of the input signal. As a result, it was possible to obtain an efficient and high-quality decoded signal according to the characteristics and properties of the input signal.

特許第３３１７４７０号公報Japanese Patent No. 3317470

特許文献１に記載された符号化器によれば、入力信号の性質に応じて最適な符号化方法で符号化できるが、第一符号化方法及びと第二符号化方法で用いる量子化方法が異なり、複数のコードブックを保持する必要があり、多くの記憶領域が必要となるという課題があった。 According to the encoder described in Patent Document 1, encoding can be performed with an optimal encoding method according to the nature of the input signal, but the quantization method used in the first encoding method and the second encoding method is On the other hand, there is a problem that it is necessary to hold a plurality of codebooks, and a lot of storage areas are required.

上記の課題を解決するために、符号化においては、入力された音響信号を一定時間長に対応するサンプルをまとめて構成されるフレーム単位で処理を行う。時間領域符号化方法と周波数領域符号化方法とのうち、復号した際により品質が高い音響信号を得ることができる符号化方法を選択して、その選択結果である信号識別情報を出力する。時間領域符号化方法が選択された場合には、フレーム単位で構成された音響信号を第一コードブックを用いてベクトル量子化により符号化して、信号量子化符号インデックスを出力する。周波数領域符号化方法が選択された場合には、フレーム単位で構成された音響信号を周波数領域信号に時間周波数変換し、第一コードブックを用いたベクトル量子化により符号化して、その符号化結果である信号量子化符号インデックスを出力する。 In order to solve the above problem, in encoding, an input acoustic signal is processed in units of frames configured by collecting samples corresponding to a predetermined time length. Of the time domain encoding method and the frequency domain encoding method, an encoding method capable of obtaining an acoustic signal with higher quality when decoded is selected, and signal identification information as a selection result is output. When the time domain encoding method is selected, an acoustic signal configured in units of frames is encoded by vector quantization using the first codebook, and a signal quantization code index is output. When a frequency domain encoding method is selected, an acoustic signal composed of frames is time-frequency converted into a frequency domain signal, encoded by vector quantization using the first codebook, and the encoding result The signal quantization code index is output.

復号においては、符号分離部から出力される信号量子化符号インデックス及び信号識別情報を用いる。信号識別情報が時間領域符号化方法を示す場合には、信号量子化符号インデックスを第一コードブックを用いて逆ベクトル量子化により復号し、時間領域復号音響信号を得る。信号識別情報が周波数領域符号化方法を示す場合には、信号量子化符号インデックスを第一コードブックを用いて逆ベクトル量子化により復号した後、周波数時間変換を行い、時間領域復号信号に変換する。最後に時間領域復号信号を重畳加算することにより、復号信号を出力する。 In decoding, the signal quantization code index and signal identification information output from the code separation unit are used. When the signal identification information indicates a time domain encoding method, the signal quantization code index is decoded by inverse vector quantization using the first codebook to obtain a time domain decoded acoustic signal. If the signal identification information indicates a frequency domain encoding method, the signal quantization code index is decoded by inverse vector quantization using the first codebook, and then frequency-time converted to convert it to a time domain decoded signal. . Finally, the decoded signal is output by superimposing and adding the time domain decoded signal.

時間領域符号化方法と周波数領域符号化方法とで同一のコードブックを用いることにより、従来よりも記憶領域を小さくすることができる。 By using the same codebook for the time domain encoding method and the frequency domain encoding method, the storage area can be made smaller than before.

第一実施形態の符号化装置の例の機能ブロック図。The functional block diagram of the example of the encoding apparatus of 1st embodiment. 第一実施形態の符号化方法の例の流れ図。The flowchart of the example of the encoding method of 1st embodiment. 第一実施形態の復号装置の例の機能ブロック図。The functional block diagram of the example of the decoding apparatus of 1st embodiment. 第一実施形態の復号方法の例の流れ図。The flowchart of the example of the decoding method of 1st embodiment. 第二実施形態の符号化装置の例の機能ブロック図。The functional block diagram of the example of the encoding apparatus of 2nd embodiment. 第二実施形態の符号化方法の例の流れ図。The flowchart of the example of the encoding method of 2nd embodiment. 第二実施形態の復号装置の例の機能ブロック図。The functional block diagram of the example of the decoding apparatus of 2nd embodiment. 第二実施形態の復号方法の例の流れ図。The flowchart of the example of the decoding method of 2nd embodiment. 第三実施形態の符号化装置の例の機能ブロック図。The functional block diagram of the example of the encoding apparatus of 3rd embodiment. 第三実施形態の符号化方法の例の流れ図。The flowchart of the example of the encoding method of 3rd embodiment. 第三実施形態の復号装置の例の機能ブロック図。The functional block diagram of the example of the decoding apparatus of 3rd embodiment. 第三実施形態の復号方法の例の流れ図。The flowchart of the example of the decoding method of 3rd embodiment. サブバンド分割およびサブバンド合成の模式図。The schematic diagram of subband division and subband synthesis. 動的ビット割当の例の流れ図。6 is a flowchart of an example of dynamic bit allocation. 動的ビット割当の模式図。Schematic diagram of dynamic bit allocation. 動的ビット割当に用いるビット割当テーブルの図。The figure of the bit allocation table used for dynamic bit allocation.

以下、図面を参照してこの発明の実施形態を詳細に説明する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.

［第一実施形態］
≪符号化≫
図１に第一実施形態の符号化装置の機能ブロックを例示する。図２に第一実施形態の符号化方法の流れ図を例示する。 [First embodiment]
<< Encoding >>
FIG. 1 illustrates functional blocks of the encoding apparatus according to the first embodiment. FIG. 2 illustrates a flowchart of the encoding method of the first embodiment.

第一実施形態の符号化装置は、図１に例示するように、フレーム構成部１、信号識別部２、スイッチ３１、時間領域符号化部４、周波数領域符号化部５、コードブック記憶部６、符号多重化部７を含む。 As illustrated in FIG. 1, the encoding apparatus according to the first embodiment includes a frame configuration unit 1, a signal identification unit 2, a switch 31, a time domain encoding unit 4, a frequency domain encoding unit 5, and a codebook storage unit 6. The code multiplexer 7 is included.

入力された音響信号は、フレーム構成部１に入力される。入力された音響信号は例えばマイクロフォン等のセンサからディジタル形式で取得した音響信号サンプルである。フレーム構成部１は、入力された音響信号を一定時間長に対応するサンプル数まとめて、フレーム単位の音響信号を構成し、フレームごとの音響信号を信号識別部２に送る（ステップＣ１）。例えば、サンプリング周波数を３２ｋＨｚとしたとき、５ｍｓｅｃに対応するフレームは１６０サンプルから構成される。 The input acoustic signal is input to the frame configuration unit 1. The input acoustic signal is an acoustic signal sample acquired in a digital format from a sensor such as a microphone. The frame construction unit 1 collects the input acoustic signals by the number of samples corresponding to a certain time length to construct an acoustic signal for each frame, and sends the acoustic signal for each frame to the signal identification unit 2 (step C1). For example, when the sampling frequency is 32 kHz, a frame corresponding to 5 msec is composed of 160 samples.

信号識別部２は、時間領域符号化方法と周波数領域符号化方法とのうち、復号した際により品質が高い音響信号を得ることができる符号化方法を選択し（ステップＣ２）その選択結果である信号識別情報が時間領域符号化方法を表す場合、スイッチ３１をフレーム単位の入力音響信号が時間領域符号化部４に送られるよう切り替えた上で、信号識別情報を符号多重化部７に送る（ステップＣ３）。一方、信号識別情報が周波数領域符号化方法を表す場合は、スイッチ３１をフレーム単位の入力音響信号が周波数領域符号化部５に送られるよう切り替えた上で、信号識別情報を符号多重化部７に送る（ステップＣ４）。 The signal identification unit 2 selects an encoding method capable of obtaining a higher quality acoustic signal when decoded from among the time domain encoding method and the frequency domain encoding method (step C2), and the selection result. When the signal identification information represents the time domain encoding method, the switch 31 is switched so that the input acoustic signal in units of frames is sent to the time domain encoding unit 4, and then the signal identification information is sent to the code multiplexing unit 7 ( Step C3). On the other hand, when the signal identification information represents the frequency domain encoding method, the switch 31 is switched so that the input acoustic signal in units of frames is sent to the frequency domain encoding unit 5, and then the signal identification information is encoded by the code multiplexing unit 7. (Step C4).

信号識別部２は、復号した際により品質が高い音響信号を得ることができる符号化方法を選択するために、例えばフレームごとの音響信号から特徴量を計算して、その特徴量と所定の閾値とを比較して、その大小により時間領域符号化方法と周波数領域符号化方法の何れかを選択する。識別に用いる特徴量として、例えばフレーム内のサンプルの値の絶対値の最大値と、そのフレーム内のサンプルの値の絶対値の平均値との差分などを利用することができる。この場合、特徴量が所定の閾値よりも大きい場合には、そのフレームにおいて音響信号が急激な時間変化をすると判断し、時間領域符号化方法を選択する。一方、特徴量が所定の閾値以下である場合には、周波数領域符号化方法を選択する。所定の閾値は、求める性能や仕様に基づいて適宜決定される。 In order to select an encoding method capable of obtaining an acoustic signal with higher quality when decoded, the signal identification unit 2 calculates a feature amount from the acoustic signal for each frame, for example, and calculates the feature amount and a predetermined threshold value. And the time domain coding method or the frequency domain coding method is selected depending on the size. As the feature amount used for identification, for example, the difference between the maximum absolute value of the sample values in the frame and the average absolute value of the sample values in the frame can be used. In this case, when the feature amount is larger than the predetermined threshold, it is determined that the acoustic signal undergoes a rapid time change in the frame, and the time domain encoding method is selected. On the other hand, if the feature amount is equal to or less than a predetermined threshold, a frequency domain encoding method is selected. The predetermined threshold is appropriately determined based on the required performance and specifications.

信号識別部２において時間領域符号化方法が選択された場合には、時間領域符号化部４のベクトル量子化部４１が、フレームごとの音響信号を第一コードブックを用いて、ベクトル量子化により符号化し（ステップＣ５）、その符号化結果である信号量子化符号インデックスを符号多重化部７に出力する。 When the time domain coding method is selected in the signal identification unit 2, the vector quantization unit 41 of the time domain coding unit 4 uses the first codebook to generate an acoustic signal for each frame by vector quantization. Encoding is performed (step C5), and a signal quantization code index as a result of the encoding is output to the code multiplexing unit 7.

信号識別部２において周波数領域符号化方法が選択された場合には、周波数領域符号化部５の時間周波数変換部５１は、フレームごとの音響信号を周波数領域信号に変換する（ステップＣ７）。例えば、修正離散コサイン変換（以下、ＭＤＣＴ）や離散コサイン変換（以下、ＤＣＴ）により周波数領域信号に変換する。 When the frequency domain encoding method is selected in the signal identification unit 2, the time-frequency conversion unit 51 of the frequency domain encoding unit 5 converts the acoustic signal for each frame into a frequency domain signal (step C7). For example, the signal is converted into a frequency domain signal by modified discrete cosine transform (hereinafter referred to as MDCT) or discrete cosine transform (hereinafter referred to as DCT).

周波数領域符号化部５のベクトル量子化部５２が、変換された周波数領域信号を入力ベクトルとみなし、第一コードブックを用いてベクトル量子化により符号化して（ステップＣ８）、その符号化結果である信号量子化符号インデックスを符号多重化部７に出力する。 The vector quantization unit 52 of the frequency domain encoding unit 5 regards the converted frequency domain signal as an input vector, encodes it by vector quantization using the first codebook (step C8), A certain signal quantization code index is output to the code multiplexer 7.

時間領域符号化部４のベクトル量子化部４１及び周波数領域符号化部５のベクトル量子化部５２は共に、コードブック記憶部６に記憶された同一の第一コードブックを用いてベクトル量子化を行う。このように、符号化方法によらず同一のコードブック（この実施形態では第一コードブック）を用いることにより、必要な記憶領域を従来よりも小さくすることができる。 Both the vector quantization unit 41 of the time domain encoding unit 4 and the vector quantization unit 52 of the frequency domain encoding unit 5 perform vector quantization using the same first codebook stored in the codebook storage unit 6. Do. In this way, by using the same code book (in this embodiment, the first code book) regardless of the encoding method, the necessary storage area can be made smaller than before.

符号多重化部７は、信号識別情報及び信号量子化符号インデックスを所定の順序に並べて１つのデータセットとして出力する（ステップＣ１０）。パケットとしてＩＰ通信網等に伝送する場合には、必要なヘッダ情報をデータセットに付加して伝送する。 The code multiplexing unit 7 arranges the signal identification information and the signal quantization code index in a predetermined order and outputs them as one data set (step C10). When the packet is transmitted to an IP communication network or the like, necessary header information is added to the data set and transmitted.

≪復号≫
図３は第一実施形態の復号装置の機能ブロックを例示する。図４は第一実施形態の復号方法の流れ図を例示する。 << Decryption >>
FIG. 3 illustrates functional blocks of the decoding device according to the first embodiment. FIG. 4 illustrates a flowchart of the decoding method of the first embodiment.

第一実施形態の復号装置は、図１に例示するように、符号分離部８、時間領域復号部９、周波数領域復号部１０、コードブック記憶部１１、重畳加算部１２、スイッチ３１，３２、信号識別部１４を含む。 As illustrated in FIG. 1, the decoding device according to the first embodiment includes a code separation unit 8, a time domain decoding unit 9, a frequency domain decoding unit 10, a codebook storage unit 11, a superposition addition unit 12, switches 31 and 32, A signal identification unit 14 is included.

符号分離部８は、入力されたデータセットから所定の位置から所定のビット数を読み込み、信号識別情報及び信号量子化符号インデックスを分離する（ステップＤ１）。信号識別情報は、信号識別部１４に送られる。信号量子化符号インデックスは、スイッチ３２の切り替え状態に応じて、時間領域復号部９又は周波数領域復号部１０に送られる。符号分離部８が特許請求の範囲の入力部に対応し、ステップＤ１が特許請求の範囲の入力ステップに対応する。 The code separation unit 8 reads a predetermined number of bits from a predetermined position from the input data set, and separates the signal identification information and the signal quantization code index (step D1). The signal identification information is sent to the signal identification unit 14. The signal quantization code index is sent to the time domain decoding unit 9 or the frequency domain decoding unit 10 according to the switching state of the switch 32. The code separation unit 8 corresponds to the input unit in the claims, and step D1 corresponds to the input step in the claims.

信号識別部１４は、信号識別情報が時間領域復号方法と周波数領域復号方法の何れを示すか識別し、スイッチ３２を切り替える（ステップＤ２）。 The signal identification unit 14 identifies whether the signal identification information indicates a time domain decoding method or a frequency domain decoding method, and switches the switch 32 (step D2).

信号識別情報が時間領域符号化方法を示す場合には、信号識別部１４は、符号分離部８と時間領域復号部９とを接続するようにスイッチ３２を切り替え、時間領域復号部９と重畳加算部１２とを接続するようにスイッチ３３を切り替える。次に、時間領域復号部９が信号量子化符号インデックスを復号する（ステップＤ３）。 When the signal identification information indicates a time domain encoding method, the signal identification unit 14 switches the switch 32 so as to connect the code separation unit 8 and the time domain decoding unit 9, and overlaps the time domain decoding unit 9. The switch 33 is switched so as to connect the unit 12. Next, the time domain decoding unit 9 decodes the signal quantization code index (step D3).

具体的には、時間領域復号部９の逆ベクトル量子化部９１が、コードブック記憶部１１に記憶された第一コードブックを用いて、逆ベクトル量子化により信号量子化符号インデックスを復号し（ステップＤ３）、フレームごとの時間領域信号を出力する。フレーム単位の時間領域信号は重畳加算部１２に送られる。 Specifically, the inverse vector quantization unit 91 of the time domain decoding unit 9 decodes the signal quantization code index by inverse vector quantization using the first codebook stored in the codebook storage unit 11 ( Step D3), outputting a time domain signal for each frame. The time domain signal in frame units is sent to the superposition addition unit 12.

一方、信号識別情報が周波数領域符号化方法を示す場合には、信号識別部１４は、符号分離部８と周波数領域復号部１０とを接続するようにスイッチ３２を切り替え、周波数領域復号部１０と重畳加算部１２とを接続するようにスイッチ３３を切り替える。次に、周波数領域復号部１０が信号量子化符号インデックスを復号する（ステップＤ４，Ｄ５）。 On the other hand, when the signal identification information indicates a frequency domain encoding method, the signal identification unit 14 switches the switch 32 so as to connect the code separation unit 8 and the frequency domain decoding unit 10, The switch 33 is switched so as to connect the superposition addition unit 12. Next, the frequency domain decoding unit 10 decodes the signal quantization code index (steps D4 and D5).

具体的には、周波数領域復号部１０の逆ベクトル量子化部１０１が、コードブック記憶部１１に記憶された第一コードブックを用いて逆ベクトル量子化により信号量子化符号インデックスを復号し（ステップＤ４）、フレームごとの周波数領域信号を周波数時間変換部１０２に送る。 Specifically, the inverse vector quantization unit 101 of the frequency domain decoding unit 10 decodes the signal quantization code index by inverse vector quantization using the first codebook stored in the codebook storage unit 11 (step D4), a frequency domain signal for each frame is sent to the frequency time conversion unit 102.

周波数時間変換部１０２は、フレームごとの周波数領域信号を、符号化の際に時間周波数変換部５１（図１）が行った時間周波数変換に対応する周波数時間変換により、フレームごとの時間領域信号に変換する（ステップＤ５）。例えば、時間周波数変換としてＭＤＣＴを用いた場合には、逆ＭＤＣＴを実行する。周波数時間変換により出力されたフレームごとの時間領域信号は、重畳加算部１２に送られる。 The frequency time conversion unit 102 converts the frequency domain signal for each frame into a time domain signal for each frame by frequency time conversion corresponding to the time frequency conversion performed by the time frequency conversion unit 51 (FIG. 1) at the time of encoding. Conversion is performed (step D5). For example, when MDCT is used as time frequency conversion, inverse MDCT is executed. The time domain signal for each frame output by the frequency time conversion is sent to the superposition addition unit 12.

時間領域復号部９の逆ベクトル量子化部９１及び周波数領域復号部１０の逆ベクトル量子化部１０１は共に、コードブック記憶部１１に記憶された同一の第一コードブックを用いて逆ベクトル量子化を行う。このように、復号方法によらず同一のコードブック（この実施形態では第一コードブック）を用いることにより、必要な記憶領域を従来よりも小さくすることができる。 Both the inverse vector quantization unit 91 of the time domain decoding unit 9 and the inverse vector quantization unit 101 of the frequency domain decoding unit 10 use the same first codebook stored in the codebook storage unit 11 to perform inverse vector quantization. I do. In this way, by using the same code book (in this embodiment, the first code book) regardless of the decoding method, the necessary storage area can be made smaller than before.

なお、コードブック記憶部１１に記憶された第一コードブックは、符号化装置のコードブック記憶部６に記憶された第一コードブックと同一のコードブックである。 The first code book stored in the code book storage unit 11 is the same code book as the first code book stored in the code book storage unit 6 of the encoding device.

重畳加算部１２は、フレームごとの時間領域信号を重畳加算して、連続した時間領域信号を生成する（ステップＤ６）。すなわち、フレームごとの時間領域信号に、例えばハミング窓等の窓関数を掛けて得られる信号をフレーム長の半分づつ重ねて加算をすることにより、連続した時間領域信号を生成する。 The superposition adding unit 12 superimposes and adds time domain signals for each frame to generate a continuous time domain signal (step D6). That is, a continuous time-domain signal is generated by adding a signal obtained by multiplying the time-domain signal for each frame by a window function such as a Hamming window, for example, by overlapping half the frame length.

［第二実施形態］
第二実施形態は、フレームを分割したサブフレーム単位でベクトル量子化、逆ベクトル量子化を行う部分で、第一実施形態とは異なる。他の部分は、第一実施形態と同様である。以下、第一実施形態と異なる部分を重点的に説明し、第一実施形態と同様である部分については同一の符号を付けて重複説明を略する。 [Second Embodiment]
The second embodiment is a part that performs vector quantization and inverse vector quantization in units of subframes obtained by dividing a frame, and is different from the first embodiment. Other parts are the same as in the first embodiment. Hereinafter, parts different from the first embodiment will be described with emphasis, and parts that are the same as those of the first embodiment will be denoted by the same reference numerals, and redundant description will be omitted.

≪符号化≫
図５に第二実施形態の符号化装置の機能ブロックを例示する。図６に第二実施形態の符号化方法の流れ図を例示する。 << Encoding >>
FIG. 5 illustrates functional blocks of the encoding apparatus according to the second embodiment. FIG. 6 illustrates a flowchart of the encoding method of the second embodiment.

第二実施形態の時間領域符号化部４は、ベクトル量子化部４１に加えて、サブフレーム分割部４２を含む。また、第二実施形態の周波数領域符号化部５は、時間周波数変換部５１及びベクトル量子化部５２に加えて、サブバンド分割部５３を含む。
入力した音響信号を、フレーム構成部１でフレーム単位の音響信号とし、信号識別部２において、信号識別情報を決定し、出力する(ステップＣ１，Ｃ２，Ｃ３，Ｃ４)。以上は、第一実施形態と同じである。 The time domain encoding unit 4 of the second embodiment includes a subframe dividing unit 42 in addition to the vector quantization unit 41. Further, the frequency domain encoding unit 5 of the second embodiment includes a subband division unit 53 in addition to the time frequency conversion unit 51 and the vector quantization unit 52.
The input acoustic signal is converted into an acoustic signal for each frame by the frame construction unit 1, and signal identification information is determined and output by the signal identification unit 2 (steps C1, C2, C3, and C4). The above is the same as in the first embodiment.

信号識別情報が時間領域符号化方法を表す場合には、時間領域符号化部４のサブフレーム分割部４２は、図１３（Ａ）に例示するように、フレーム構成部１から出力されたフレーム単位の音響信号を、サブフレームに分割する（ステップＣ１１）。例えば、１フレームのサンプル数を１６０、サブフレームあるいはサブバンドの数を１０とした場合には、１サブフレームのサンプル数は１６である。 When the signal identification information represents a time domain encoding method, the subframe division unit 42 of the time domain encoding unit 4 performs frame unit output from the frame configuration unit 1 as illustrated in FIG. Are divided into subframes (step C11). For example, when the number of samples in one frame is 160 and the number of subframes or subbands is 10, the number of samples in one subframe is 16.

サブフレーム単位で構成された音響信号は、ベクトル量子化部４１に送られる。ベクトル量子化部４１は、サブフレーム単位で構成された音響信号をサブフレーム毎に第二コードブックを用いてベクトル量子化により符号化して、その符号化結果である信号量子化符号インデックスを出力する（ステップＣ５）。 The acoustic signal configured in units of subframes is sent to the vector quantization unit 41. The vector quantization unit 41 encodes an acoustic signal configured in units of subframes by vector quantization using a second codebook for each subframe, and outputs a signal quantization code index as a result of the encoding. (Step C5).

信号識別情報が周波数領域符号化を表す場合には、時間周波数変換部５１がフレーム単位の入力音響信号を時間周波数変換して、フレーム単位の周波数領域信号を出力し、サブバンド分割部５３が、フレーム単位の周波数領域信号を、サブバンド単位に分割し（ステップＣ１２）、ベクトル量子化部５２に送る。ベクトル量子化部５２は、サブバンドで分割された周波数領域信号をサブバンド毎に、第二コードブックを用いたベクトル量子化により符号化して、信号量子化符号インデックスを出力する。 When the signal identification information represents frequency domain coding, the time-frequency conversion unit 51 performs time-frequency conversion on the input acoustic signal in units of frames and outputs a frequency unit signal in units of frames, and the subband division unit 53 The frequency domain signal in frame units is divided into subband units (step C12) and sent to the vector quantization unit 52. The vector quantization unit 52 encodes the frequency domain signal divided by the subbands for each subband by vector quantization using the second codebook, and outputs a signal quantization code index.

時間領域符号化部４のベクトル量子化部４１及び周波数領域符号化部５のベクトル量子化部５２は共に、コードブック記憶部１５に記憶された同一の第二コードブックを用いてベクトル量子化を行う。 Both the vector quantization unit 41 of the time domain encoding unit 4 and the vector quantization unit 52 of the frequency domain encoding unit 5 perform vector quantization using the same second codebook stored in the codebook storage unit 15. Do.

≪復号≫
図７に第二実施形態の復号装置の機能ブロックを例示する。図８に第二実施形態の復号方法の流れ図を例示する。 << Decryption >>
FIG. 7 illustrates functional blocks of the decoding device according to the second embodiment. FIG. 8 illustrates a flowchart of the decoding method according to the second embodiment.

第二実施形態の時間領域復号部９は、逆ベクトル量子化部９１に加えて、サブフレーム合成部９２を含む。第二実施形態の周波数領域復号部１０は、逆ベクトル量子化部１０１及び周波数時間変換部１０２に加えて、サブバンド合成部１０３を含む。 The time domain decoding unit 9 of the second embodiment includes a subframe synthesis unit 92 in addition to the inverse vector quantization unit 91. The frequency domain decoding unit 10 of the second embodiment includes a subband synthesis unit 103 in addition to the inverse vector quantization unit 101 and the frequency time conversion unit 102.

信号識別情報が時間領域符号化方法を示す場合には、時間領域復号部９の逆ベクトル量子化部９１は、コードブック記憶部１６に記憶された第二コードブックを用いて逆ベクトル量子化により信号量子化符号インデックスの復号を行い、サブフレーム単位の時間領域信号を生成する（ステップＤ３）。 When the signal identification information indicates a time domain encoding method, the inverse vector quantization unit 91 of the time domain decoding unit 9 performs inverse vector quantization using the second codebook stored in the codebook storage unit 16. The signal quantization code index is decoded to generate a time domain signal in subframe units (step D3).

そして、サブフレーム合成部９２は、図１３（Ｂ）に例示するように、サブフレームごとの時間領域信号を時系列順に並べてフレーム単位の時間領域信号を生成し（ステップＤ７）、重畳加算部１２に送る。 Then, as illustrated in FIG. 13B, the subframe synthesis unit 92 arranges the time domain signals for each subframe in time series order to generate a time domain signal in units of frames (step D7), and the superposition addition unit 12 Send to.

信号識別情報が周波数領域符号化方法を示す場合には、周波数領域復号部１０の逆ベクトル量子化部１０１は、コードブック記憶部１６に記憶された第二コードブックを用いて逆ベクトル量子化により信号量子化インデックスを復号し、サブフレーム単位の周波数領域信号を生成する（ステップＤ４）。 When the signal identification information indicates a frequency domain coding method, the inverse vector quantization unit 101 of the frequency domain decoding unit 10 performs inverse vector quantization using the second codebook stored in the codebook storage unit 16. The signal quantization index is decoded to generate a frequency domain signal in subframe units (step D4).

そして、サブバンド合成部１０３は、サブフレーム単位の周波数領域信号を合成して、フレーム単位の周波数領域信号を生成し（ステップＤ８）、時間周波数変換部１０２に送る。 Then, the subband synthesizing unit 103 synthesizes the frequency domain signals in units of subframes, generates a frequency domain signal in units of frames (step D8), and sends it to the time frequency conversion unit 102.

時間領域復号部９の逆ベクトル量子化部９１及び周波数領域復号部１０の逆ベクトル量子化部１０１は共に、コードブック記憶部１６に記憶された同一の第二コードブックを用いて逆ベクトル量子化を行う。 Both the inverse vector quantization unit 91 of the time domain decoding unit 9 and the inverse vector quantization unit 101 of the frequency domain decoding unit 10 use the same second codebook stored in the codebook storage unit 16 to perform inverse vector quantization. I do.

なお、コードブック記憶部１６に記憶された第二コードブックは、符号化装置のコードブック記憶部１５に記憶された第二コードブックと同一のコードブックである。 Note that the second code book stored in the code book storage unit 16 is the same code book as the second code book stored in the code book storage unit 15 of the encoding device.

フレームよりも短いサブフレーム、サブバンドごとにベクトル量子化を行うことにより、コードブック記憶部１５，１６に記憶された第二コードブックを構成するコードベクトルの次元を下げることができ、記憶領域をより小さくすることができる。 By performing vector quantization for each subframe and subband shorter than the frame, the dimension of the code vector constituting the second codebook stored in the codebook storage units 15 and 16 can be reduced, and the storage area can be reduced. It can be made smaller.

［第三実施形態］
第三実施形態は、サブフレームあるいはサブバンドごとに割り当てるビット数を動的に変化させる点が第二実施形態とは異なる。他の部分は第二実施形態と同様である。以下、第二実施形態と異なる部分を重点的に説明し、第二実施形態と同様である部分については同一の符号を付けて重複説明を省略する。 [Third embodiment]
The third embodiment is different from the second embodiment in that the number of bits allocated for each subframe or subband is dynamically changed. Other parts are the same as in the second embodiment. Hereinafter, parts different from those of the second embodiment will be described with emphasis, and parts that are the same as those of the second embodiment will be denoted by the same reference numerals and redundant description thereof will be omitted.

≪符号化≫
図９に第三実施形態の符号化装置の機能ブロックを例示する。図１０に第三実施形態の符号化方法の流れ図を例示する。 << Encoding >>
FIG. 9 illustrates functional blocks of the encoding apparatus according to the third embodiment. FIG. 10 illustrates a flowchart of the encoding method of the third embodiment.

第三実施形態の符号化装置は動的ビット割当部１３を含む。 The encoding device of the third embodiment includes a dynamic bit allocation unit 13.

サブフレーム分割部４２又はサブバンド分割部５３で分割されたサブフレームあるいはサブバンド単位の音響信号は、動的ビット割当部１３に送られる。 The subframe or subband unit acoustic signal divided by the subframe division unit 42 or the subband division unit 53 is sent to the dynamic bit allocation unit 13.

動的ビット割当部１３は、サブフレーム又はサブバンドごとに異なる数のビットを各サブフレーム又はサブバンドに割り当て（ステップＣ１３，Ｃ１４）、その割当結果である割当ビット情報をベクトル量子化部４１又はベクトル量子化部５２、符号多重化部７に送る。例えば、聴感的重要度の大きいサブフレーム、サブバンドほど多くのビットが割当られるような動的ビット割当を行う。または、平均振幅が大きなサブフレーム、サブバンドほど多くのビットを割り当てる。ここで、平均振幅とは、サブフレーム、サブバンドを構成するサンプルの値の平均値である。 The dynamic bit allocation unit 13 allocates a different number of bits for each subframe or subband to each subframe or subband (steps C13 and C14), and allocates bit information as a result of the allocation to the vector quantization unit 41 or The data is sent to the vector quantization unit 52 and the code multiplexing unit 7. For example, dynamic bit allocation is performed such that more bits are allocated to subframes and subbands having higher auditory importance. Alternatively, more bits are assigned to subframes and subbands having a larger average amplitude. Here, the average amplitude is an average value of values of samples constituting the subframe and the subband.

以下、動的ビット割当の一例を説明する。
サブバンド単位で構成される周波数領域信号をＳ^（ｗ）［ｋ］（ｗ＝０，…，Ｗ−１、ｋ＝０，…，Ｌ’−１）として、第ｗサブバンド周波数領域信号列Ｓ^（ｗ）［ｋ］の平均振幅指標Ａ^〜［ｗ］（ｗ＝０，…，Ｗ−１）（以下、「第ｗサブバンド平均振幅指標」と呼ぶ。）を次式にしたがって算出する。ここで、Ｗをサブバンドの数、Ｌ’をサブバンドを構成するサンプル数とした。ｒｏｕｎｄ（ｘ）は、ｘの小数点以下を切り捨てて整数化する関数である。 Hereinafter, an example of dynamic bit allocation will be described.
A frequency domain signal composed of subbands is denoted as S ^(w) [k] (w = 0,..., W−1, k = 0,..., L′−1), and the wth subband frequency domain signal sequence. S ^(w) [k] average amplitude index A ^to [w] (w = 0,..., W−1) (hereinafter referred to as “wth subband average amplitude index”) is calculated according to the following equation. . Here, W is the number of subbands, and L ′ is the number of samples constituting the subband. round (x) is a function that rounds off the decimal part of x to make it an integer.

次に、以下のような手順で第ｗサブバンドの割当ビット情報Ｂ［ｗ］（ｗ＝０，…，Ｗ−１）を算出する。まず、第ｗサブバンド平均振幅指標Ａ^〜［ｗ］から次式で決まる第ｗサブバンド聴覚的重要度ｉｐ［ｗ］（ｗ＝０，…，Ｗ−１）を算出する。
ｉｐ［ｗ］＝Ａ^〜［ｗ］／２ Next, allocated bit information B [w] (w = 0,..., W−1) of the w-th subband is calculated in the following procedure. First, w-th subband auditory importance ip [w] (w = 0,..., W−1) determined by the following equation is calculated from the w-th subband average amplitude index A ^to [w].
ip [w] = A ^to [w] / 2

次に第ｗサブバンド聴覚的重要度ｉｐ［ｗ］とビット割当テーブルＲを利用した二分探索法により、第ｗサブバンドの割当ビット情報Ｂ［ｗ］を出力する。動的ビット割当では、“ｗａｔｅｒＬｅｖｅｌ”を以下の式に基づく二分探索により選択し、“ｗａｔｅｒＬｅｖｅｌ λ”と第ｗサブバンド聴覚的重要度ｉｐ［ｗ］を利用して次式の第ｗサブバンド割当ビット情報Ｂ［ｗ］を算出する。ビット割当テーブルＲは、互いに異なるＲ個の自然数ｂ_０，ｂ_１，…，ｂ_Ｒ−１が登録されたテーブルである。ｂ_０＜ｂ_１＜…＜ｂ_Ｒ−１とする。 Next, the assigned bit information B [w] of the wth subband is output by the binary search method using the wth subband auditory importance degree ip [w] and the bit assignment table R. In dynamic bit allocation, “water Level” is selected by a binary search based on the following equation, and “water Level λ” and the w th subband auditory importance degree ip [w] are used. Band allocation bit information B [w] is calculated. Bit allocation table R is different R-number of natural numbers _b _0, b 1 together, _{..., b R-1} is a table registered. b ₀ <b ₁ <... <b _R−1 .

具体的には、例えば図１４に示す方法で算出すればよい。まず、パラメータ（ｍａｘＩＰ，ｍｉｎＩＰ，λ，ｉ）を初期値に設定する（Ｓ５０３５１）。一時的なＢ［ｗ］の値として、Ｂｔ［ｗ］（ｗ＝１，…，Ｗ−１）を計算し、計算されたＢｔ［ｗ］の合計Ｓｕｍ＿Ｂｔ＝Σ_０ ^Ｗ−１Ｂｔ［ｗ］を求める（Ｓ５０３５２）。Ｓｕｍ＿Ｂｔが割当可能な総ビット数（total_bit_budget）を越えるかを確認する（Ｓ５０３５３）。 Specifically, for example, the calculation may be performed by the method shown in FIG. First, parameters (maxIP, minIP, λ, i) are set to initial values (S50351). Bt [w] (w = 1,..., W−1) is calculated as a temporary B [w] value, and the sum of the calculated Bt [w] Sum_Bt = Σ ₀ ^W−1 Bt [w] Is obtained (S50352). It is confirmed whether Sum_Bt exceeds the allocatable total number of bits (total_bit_budget) (S50353).

ステップＳ５０３５３がＹｅｓの場合、パラメータ（ｍｉｎＩＰ，λ，ｉ）を変更する（Ｓ５０３５４）。ステップＳ５０３５３がＮｏの場合、Ｂ^ｉ［ｗ］をＢｔ［ｗ］とし、パラメータ（ｍａｘＩＰ，λ，ｉ）を変更する（Ｓ５０３５５）。 If step S50353 is Yes, the parameter (minIP, λ, i) is changed (S50354). When step S50353 is No, B ⁱ [w] is set to Bt [w], and the parameters (maxIP, λ, i) are changed (S50355).

ｉが事前に定めた定数未満かを確認する（Ｓ５０３５６）。ステップＳ５０３５６がＹｅｓの場合、ステップＳ５０３５２に戻る。ステップＳ５０３５６がＮｏの場合、Ｂ^ｉ［ｗ］を、第ｗサブバンドの割当ビット情報Ｂ［ｗ］として出力する。また、事前に決定した回数の反復探索が終了した時点で上記のＢ［ｗ］の式の評価を行う。 It is confirmed whether i is less than a predetermined constant (S50356). If step S50356 is Yes, the process returns to step S50352. When Step S50356 is No, B ⁱ [w] is output as allocated bit information B [w] of the w-th subband. In addition, when the iterative search of a predetermined number of times is completed, the above expression B [w] is evaluated.

反復処理を終了する収束条件を別途定義して処理を終了してもよい。例えば、割当ビット数の合計が割当可能な総ビット数（total_bit_budget）に等しくなるような場合に処理を終了するなどの方法が考えられる。最終的なビット数の合計が割当可能な総ビット数を越える場合には、例えばｉｐ［ｗ］が小さいサブバンドから順に、上式で選択したビット数よりテーブル一つ分小さいビットを割り当てることにより削減し、割当ビット数の合計が総ビット数の合計よりも小さくなるように調整し、最終的な第ｗサブバンド割当ビット情報を決定する。 The process may be ended by separately defining a convergence condition for ending the iterative process. For example, a method of ending the process when the total number of allocated bits is equal to the total number of bits that can be allocated (total_bit_budget) can be considered. If the final total number of bits exceeds the allocatable total number of bits, for example, by assigning bits that are smaller by one table than the number of bits selected in the above equation, in order from the subband with the smallest ip [w]. The final w-th subband allocation bit information is determined by adjusting so that the total number of allocated bits is smaller than the total number of total bits.

サブフレーム単位の入力信号において、動的ビット割当を行う場合には、サブバンド単位で構成される周波数領域信号の代わりに、サブフレーム単位で構成される時間領域信号を用いればよい。 When dynamic bit allocation is performed in an input signal in subframe units, a time domain signal configured in subframe units may be used instead of a frequency domain signal configured in subband units.

信号識別情報が時間領域符号化方法を示す場合には、ベクトル量子化部４１は、割り当てられたビット数Ｂ［ｗ］に応じて定まる探索範囲で、コードブック記憶部６に記憶された第二コードブックを探索して、各サブフレームｗ（ｗ＝１，…，Ｗ）をベクトル量子化する（ステップＣ５、図１５、図１６参照）。例えば、Ｂ［ｗ］個のビット数で表すことができる数は２＾Ｂ［ｗ］であるため、第二コードブックの中の０番目から２＾Ｂ［ｗ］−１番目までの２＾Ｂ［ｗ］個のコードベクトルを探索して、サブフレーム単位で構成される音響信号からなるベクトルに最も近いベクトルを選択し、Ｂ［ｗ］個のビットで、選択したコードベクトルを表現して、信号量子化符号インデックスとする。 When the signal identification information indicates a time domain encoding method, the vector quantization unit 41 stores the second data stored in the codebook storage unit 6 within a search range determined according to the allocated number of bits B [w]. The code book is searched, and each subframe w (w = 1,..., W) is vector quantized (see step C5, FIG. 15 and FIG. 16). For example, since the number that can be represented by the number of B [w] bits is 2 ^ B [w], 2 ^ 0th to 2 ^ B [w] -1th in the second codebook B [w] code vectors are searched to select a vector closest to a vector composed of acoustic signals in subframe units, and the selected code vector is expressed by B [w] bits. , The signal quantization code index.

信号識別情報が周波数領域符号化方法を示す場合には、ベクトル量子化部５２は、ベクトル量子化部４１と同様に、割り当てられたビット数Ｂ［ｗ］に応じて定まる探索範囲で、コードブック記憶部６に記憶された第二コードブックを探索して、各サブバンドｗ（ｗ＝１，…，Ｗ）をベクトル量子化する（ステップＣ８、図１５、図１６参照）。 When the signal identification information indicates a frequency domain encoding method, the vector quantization unit 52, like the vector quantization unit 41, uses a codebook within a search range determined according to the allocated number of bits B [w]. The second codebook stored in the storage unit 6 is searched, and each subband w (w = 1,..., W) is vector quantized (see step C8, FIG. 15 and FIG. 16).

割当ビット情報は符号多重化部７に送られ、符号多重化部７は、信号識別情報と、信号量子化符号インデックスと、割当ビット情報とを所定の順序で並べてデータセットとする（ステップＣ９）。ＩＰ網上を伝送する場合は、必要なヘッダ情報を付加してパケットとし、出力する。 The assigned bit information is sent to the code multiplexing unit 7. The code multiplexing unit 7 arranges the signal identification information, the signal quantization code index, and the assigned bit information in a predetermined order to form a data set (step C9). . When transmitting on the IP network, necessary header information is added to form a packet and output.

≪復号≫
図１１に第三実施形態の復号装置の機能ブロックを例示する。第三実施形態の復号方法の流れ図を図１２に例示する。 << Decryption >>
FIG. 11 illustrates functional blocks of the decoding device according to the third embodiment. FIG. 12 illustrates a flowchart of the decoding method according to the third embodiment.

符号分離部８は、入力されたデータセットの所定の位置から所定のビット数を読出し、信号識別情報、信号量子化符号インデックス、割当ビット情報を生成する（ステップＤ１）。 The code separation unit 8 reads a predetermined number of bits from a predetermined position of the input data set, and generates signal identification information, a signal quantization code index, and assigned bit information (step D1).

信号識別部１４は、信号識別情報に応じてスイッチ３２，３３、３４の切り替えを行う。信号識別情報が時間領域符号化方法を示す場合には、信号識別部１４は符号分離部８と時間領域復号部９とを接続するようにスイッチ３４を切り替える。一方、信号識別情報が周波数領域符号化方法を示す場合には、信号識別部１４は符号分離部８と周波数領域復号部１０とを接続するようにスイッチ３４を切り替える。 The signal identification unit 14 switches the switches 32, 33, and 34 according to the signal identification information. When the signal identification information indicates a time domain encoding method, the signal identification unit 14 switches the switch 34 so as to connect the code separation unit 8 and the time domain decoding unit 9. On the other hand, when the signal identification information indicates a frequency domain encoding method, the signal identification unit 14 switches the switch 34 so as to connect the code separation unit 8 and the frequency domain decoding unit 10.

信号識別情報が時間領域符号化方法を示す場合には、時間領域復号部９の逆ベクトル量子化部９１は、割当ビット情報を参照して、各サブフレームの信号量子化インデックスを求める（ステップＤ９）。そして、コードブック記憶部１６に記憶された第二コードブックを用いて逆ベクトル量子化により信号量子化符号インデックスの復号を行い、サブフレームごとの時間領域信号を生成する（ステップＤ３）。 When the signal identification information indicates a time domain encoding method, the inverse vector quantization unit 91 of the time domain decoding unit 9 obtains a signal quantization index for each subframe with reference to the assigned bit information (step D9). ). Then, the signal quantization code index is decoded by inverse vector quantization using the second codebook stored in the codebook storage unit 16 to generate a time domain signal for each subframe (step D3).

信号識別情報が周波数領域符号化方法を示す場合には、周波数領域復号部１０の逆ベクトル量子化部１０１は、割当ビット情報を参照して、各サブフレームの信号量子化インデックスを求める（ステップＤ９）。そして、コードブック記憶部１６に記憶された第二コードブックを用いて逆ベクトル量子化により信号量子化符号インデックスの復号を行い、サブバンドごとの周波数領域信号を生成する（ステップＤ４）。 When the signal identification information indicates a frequency domain encoding method, the inverse vector quantization unit 101 of the frequency domain decoding unit 10 refers to the assigned bit information and obtains a signal quantization index for each subframe (step D9). ). Then, the signal quantization code index is decoded by inverse vector quantization using the second codebook stored in the codebook storage unit 16 to generate a frequency domain signal for each subband (step D4).

時間領域復号部９の逆ベクトル量子化部９１及び周波数領域復号部１０の逆ベクトル量子化部１０１は共に、コードブック記憶部１６に記憶された同一の第二コードブックを用いて逆ベクトル量子化を行う。このように、復号方法によらず同一のコードブック（この実施形態では第二コードブック）を用いることにより、必要な記憶領域を従来よりも小さくすることができる。 Both the inverse vector quantization unit 91 of the time domain decoding unit 9 and the inverse vector quantization unit 101 of the frequency domain decoding unit 10 use the same second codebook stored in the codebook storage unit 16 to perform inverse vector quantization. I do. In this way, by using the same code book (in this embodiment, the second code book) regardless of the decoding method, the necessary storage area can be made smaller than before.

サブフレーム合成部９２およびサブバンド合成部１０３、および周波数時間変換部１０２、重畳加算部１２の動作は第二実施形態と同じである。 The operations of the subframe synthesis unit 92, the subband synthesis unit 103, the frequency time conversion unit 102, and the superposition addition unit 12 are the same as those in the second embodiment.

このように、平均振幅が大きなサブフレームあるいはサブバンド、すなわち聴覚的に重要なサブフレームあるいはサブバンドほど多くのビットを割り当てることにより、限られた符号長の中でできる限り精度良く符号化をすることができる。 In this way, by assigning more bits to subframes or subbands with a large average amplitude, that is, auditory important subframes or subbands, encoding is performed as accurately as possible within a limited code length. be able to.

［変形例等］
符号化方法及び復号方法は、コンピュータによって実現することができる。各装置が有すべき機能の処理内容はプログラムによって記述される。そして、このプログラムをコンピュータで実行することにより、各装置における各処理機能が、コンピュータ上で実現される。 [Modifications, etc.]
The encoding method and the decoding method can be realized by a computer. Processing contents of functions that each device should have are described by a program. Then, by executing this program on a computer, each processing function in each apparatus is realized on the computer.

この処理内容を記述したプログラムは、コンピュータで読み取り可能な記録媒体に記録しておくことができる。また、この形態では、コンピュータ上で所定のプログラムを実行させることにより、これらの装置を構成することとしたが、これらの処理内容の少なくとも一部をハードウェア的に実現することとしてもよい。 The program describing the processing contents can be recorded on a computer-readable recording medium. In this embodiment, these apparatuses are configured by executing a predetermined program on a computer. However, at least a part of these processing contents may be realized by hardware.

この発明は、上述の実施形態に限定されるものではなく、この発明の趣旨を逸脱しない範囲で適宜変更が可能である。 The present invention is not limited to the above-described embodiment, and can be modified as appropriate without departing from the spirit of the present invention.

１フレーム構成部
２信号識別部
４時間領域符号化部
４１ベクトル量子化部
４２サブフレーム分割部
５周波数領域符号化部
５１時間周波数変換部
５２ベクトル量子化部
５３サブフレーム分割部
６第一コードブック記憶部
９時間領域復号部
９１逆ベクトル量子化部
９２サブフレーム合成部
１０周波数領域復号部
１０１逆ベクトル量子化部
１０２時間周波数変換部
１０３サブフレーム合成部
１１第二コードブック記憶部
１３動的ビット割当部
１４信号識別部
１５第一コードブック記憶部
１６第二コードブック記憶部 DESCRIPTION OF SYMBOLS 1 Frame structure part 2 Signal identification part 4 Time domain encoding part 41 Vector quantization part 42 Subframe division | segmentation part 5 Frequency domain encoding part 51 Time frequency conversion part 52 Vector quantization part 53 Subframe division part 6 1st codebook Storage unit 9 Time domain decoding unit 91 Inverse vector quantization unit 92 Subframe synthesis unit 10 Frequency domain decoding unit 101 Inverse vector quantization unit 102 Time frequency conversion unit 103 Subframe synthesis unit 11 Second codebook storage unit 13 Dynamic bit Allocation unit 14 Signal identification unit 15 First codebook storage unit 16 Second codebook storage unit

Claims

A frame configuration step for configuring a sound signal for each frame by collecting a predetermined number of samples constituting the input sound signal;
A signal identification method that selects a coding method capable of obtaining an acoustic signal with higher quality when decoded from among a time domain coding method and a frequency domain coding method, and outputs signal identification information as a result of the selection. Steps,
When the time domain encoding method is selected, the time for encoding the acoustic signal for each frame by vector quantization using the first codebook and outputting the signal quantization code index that is the encoding result A region encoding step;
When a frequency domain encoding method is selected, the acoustic signal for each frame is converted into a frequency domain signal, encoded by vector quantization using the first codebook, and a signal that is the encoding result A frequency domain encoding step for outputting a quantization code index;
An encoding method including:

The encoding method according to claim 1,
The time domain encoding step includes a subframe division step for dividing the acoustic signal for each frame into subframes shorter than the predetermined time length, and vector quantization of the acoustic signal for each subframe using a second codebook. And a vector quantization step for outputting a signal quantization code index as a result of the encoding, and
The frequency domain encoding step includes a time frequency conversion step for converting the acoustic signal for each frame into a frequency domain signal, and a subband division step for dividing the converted frequency domain signal into subbands shorter than the predetermined time length. And a vector quantization step of encoding a frequency domain signal for each subband by vector quantization using the second codebook, and outputting a signal quantization code index that is a result of the encoding.
An encoding method characterized by the above.

The encoding method according to claim 2, wherein
The time domain encoding step further includes a dynamic bit allocation step of allocating a different number of bits for each subframe to each subframe and outputting allocation bit information which is a result of the bit allocation, and the allocated bits are Searching for the code vector included in the second codebook, the more the number is, and representing the signal quantization code index with the assigned bits.
The frequency domain encoding step further includes a dynamic bit allocation step of allocating a different number of bits for each subband to each subband and outputting allocation bit information as a result of the bit allocation, and the allocated bits are Searching for the code vector included in the second codebook, the more the number is, and representing the signal quantization code index with the assigned bits.
An encoding method characterized by the above.

An input step in which the signal quantization code index output by the encoding method according to claim 1 and the signal identification information are input;
A signal identification step for identifying the encoding method indicated by the signal identification information;
When the signal identification information indicates a time domain encoding method, a time domain decoding step of decoding the signal quantization code index by inverse vector quantization using the first codebook;
If the signal identification information indicates a frequency domain encoding method, a frequency domain decoding step of decoding the signal quantization code index by inverse vector quantization using the first codebook and converting it to a time domain signal; ,
A decoding method including:

An input step in which the signal quantization code index output by the encoding method according to claim 2 or 3 and the signal identification information are input;
The time-domain decoding step includes: an inverse vector quantization step for generating a time-domain signal for each subframe by decoding the signal quantization code index by inverse vector quantization using the second codebook; Including a subframe synthesis step of generating time domain signals for each frame by arranging time domain signals for each subframe in chronological order;
The frequency domain decoding step includes: an inverse vector quantization step for generating a frequency domain signal for each subband by decoding the signal quantization code index by inverse vector quantization using the second codebook; A subband combining step for generating a frequency domain signal for each frame by combining frequency domain signals for each subband; and a time domain signal conversion step for converting the generated frequency domain signal for each frame into a time domain signal. ,
A decoding method characterized by the above.

A frame configuration unit that configures an acoustic signal for each frame by collecting a predetermined number of samples constituting the input acoustic signal,
A signal identification method that selects a coding method capable of obtaining an acoustic signal with higher quality when decoded from among a time domain coding method and a frequency domain coding method, and outputs signal identification information as a result of the selection. And
When the time domain encoding method is selected, the time for encoding the acoustic signal for each frame by vector quantization using the first codebook and outputting the signal quantization code index that is the encoding result A region encoding unit;
When a frequency domain encoding method is selected, the acoustic signal for each frame is converted into a frequency domain signal, encoded by vector quantization using the first codebook, and a signal that is the encoding result A frequency domain encoding unit for outputting a quantization code index;
An encoding device including:

The encoding device according to claim 6, wherein
The time domain encoding unit includes a subframe dividing unit that divides the acoustic signal for each frame into subframes shorter than the predetermined time length, and vector quantization of the acoustic signal for each subframe using a second codebook. And a vector quantization unit that outputs a signal quantization code index that is a result of the encoding, and
The frequency domain encoding unit includes a time frequency conversion unit that converts the acoustic signal for each frame into a frequency domain signal, and a subband division unit that divides the converted frequency domain signal into subbands shorter than the predetermined time length. And a vector quantization unit that encodes a frequency domain signal for each subband by vector quantization using the second codebook, and outputs a signal quantization code index that is a result of the encoding.
An encoding apparatus characterized by that.

The encoding device according to claim 7,
The time domain encoding unit further includes a dynamic bit allocation unit that allocates a different number of bits for each subframe to each subframe, and outputs allocation bit information that is a result of the bit allocation. The greater the number, the more code vectors included in the second codebook are searched, and the assigned bit represents the signal quantization code index.
The frequency domain encoding unit further includes a dynamic bit allocation unit that allocates a different number of bits for each subframe to each subframe and outputs allocation bit information that is a result of the bit allocation. The greater the number, the more code vectors included in the second codebook are searched, and the assigned bit represents the signal quantization code index.
An encoding apparatus characterized by that.

An input unit to which the signal quantization code index and the signal identification information output by the encoding device according to claim 6 are input;
A signal identification unit for identifying an encoding method indicated by the signal identification information;
When the signal identification information indicates a time domain encoding method, a time domain decoding unit that decodes the signal quantization code index by inverse vector quantization using the first codebook;
When the signal identification information indicates a frequency domain encoding method, a frequency domain decoding unit that decodes the signal quantization code index by inverse vector quantization using the first codebook and converts it into a time domain signal; ,
A decoding device.

An input unit to which the signal quantization code index and the signal identification information output by the encoding device according to claim 7 or 8 are input;
The time domain decoding unit is generated with an inverse vector quantization unit that generates a time domain signal for each subframe by decoding the signal quantization code index using the second codebook by inverse vector quantization. A time domain signal for each subframe is arranged in chronological order, and a subframe synthesis unit that generates a time domain signal for each frame, and
The frequency domain decoding unit is generated with an inverse vector quantization unit that generates a frequency domain signal for each subband by decoding the signal quantization code index by inverse vector quantization using the second codebook. A subband synthesis unit that synthesizes frequency domain signals for each subband to generate a frequency domain signal for each frame, and a time domain signal conversion unit that converts the generated frequency domain signal for each frame into a time domain signal ,
A decoding device characterized by the above.

A program for causing a computer to execute each step of the encoding method according to claim 1 or each step of the decoding method according to claim 4 or 5.