JP6793221B2

JP6793221B2 - Audio decoding device

Info

Publication number: JP6793221B2
Application number: JP2019087812A
Authority: JP
Inventors: 菊入　圭; 圭菊入; 山口　貴史; 貴史山口
Original assignee: NTT Docomo Inc
Current assignee: NTT Docomo Inc
Priority date: 2012-04-27
Filing date: 2019-05-07
Publication date: 2020-12-02
Anticipated expiration: 2032-11-20
Also published as: JP6526126B2; JP2019144591A; JP2016173595A; JP6200034B2; JP2017204010A

Description

本発明は、音声復号装置に関する。 The present invention relates to a voice decoding device.

音声信号、音響信号のデータ量を数十分の一に圧縮する音声符号化技術は、信号の伝送・蓄積において極めて重要な技術である。広く利用されている音声符号化技術の例として、時間領域にて信号を符号化する符号励振線形予測符号化（CELP）、周波数領域にて信号を符号化する変換符号励振符号化（TCX）、“ISO/IEC MPEG”で標準化された“MPEG4 AAC”などを挙げることができる。 A voice coding technology that compresses the amount of data of a voice signal and an acoustic signal to one tenth is an extremely important technology in signal transmission / storage. Examples of widely used speech coding techniques include code-excited linear predictive coding (CELP), which encodes a signal in the time domain, conversion code excitation coding (TCX), which encodes a signal in the frequency domain. "MPEG4 AAC" standardized by "ISO / IEC MPEG" can be mentioned.

音声符号化の性能をさらに向上させ、低いビットレートで高い音声品質を得る方法として、音声の低周波成分を用いて高周波成分を生成する帯域拡張技術が近年広く用いられるようになった。帯域拡張技術の代表的な例は“MPEG4 AAC”で利用されるSBR（Spectral Band Replication）技術が挙げられる。 In recent years, band expansion technology that generates high-frequency components using low-frequency components of speech has become widely used as a method for further improving the performance of speech coding and obtaining high speech quality at a low bit rate. A typical example of the band expansion technology is the SBR (Spectral Band Replication) technology used in "MPEG4 AAC".

音声符号化においては、入力信号を符号化して得られた符号化系列を復号して得られる復号信号の時間包絡形状が入力信号の時間包絡形状と大きく異なり、歪みとして知覚される場合がある。また、帯域拡張技術を用いる場合には、音声信号の低周波数成分を上記のような音声符号化技術で符号化・復号して得られた信号を用いて高周波数成分を生成するため、同様に高周波数成分の時間包絡形状も異なり、歪みとして知覚される場合がある。 In voice coding, the time-envelope shape of the decoded signal obtained by decoding the coding sequence obtained by encoding the input signal is significantly different from the time-envelope shape of the input signal, and may be perceived as distortion. Further, when the band expansion technology is used, the low frequency component of the voice signal is encoded / decoded by the voice coding technology as described above, and the high frequency component is generated using the signal obtained. The time-wrapping shape of high-frequency components is also different and may be perceived as distortion.

この課題に対する解決手法として、以下の手法が知られている（下記特許文献１参照）。すなわち、高周波数成分を生成するために、任意の時間セグメント内において高周波数成分を周波数帯域に分割し、当該周波数帯域ごとのエネルギーの情報を算出し符号化する際に、当該周波数帯域ごとのエネルギーの情報を上記時間セグメントよりも短い時間セグメント毎に算出・符号化する。この際、上記分割する周波数帯域、及び短い時間セグメントについて、各周波数帯域の帯域幅、及び短い時間セグメントの長さを柔軟に設定できる。これにより、復号装置においては、時間方向については、短い時間セグメント毎に高周波数成分のエネルギーを制御する、すなわち短い時間セグメント毎に高周波数成分の時間包絡を制御することができる。 The following methods are known as solutions to this problem (see Patent Document 1 below). That is, in order to generate a high frequency component, the high frequency component is divided into frequency bands within an arbitrary time segment, and when the energy information for each frequency band is calculated and encoded, the energy for each frequency band is calculated. Is calculated and encoded for each time segment shorter than the above time segment. At this time, the bandwidth of each frequency band and the length of the short time segment can be flexibly set for the divided frequency band and the short time segment. Thereby, in the decoding device, in the time direction, the energy of the high frequency component can be controlled for each short time segment, that is, the time envelope of the high frequency component can be controlled for each short time segment.

米国特許第7,191,121号U.S. Pat. No. 7,191,121

しかし、上記特許文献１の方法によると、高周波数成分の時間包絡を詳細に制御するためには、非常に短い時間セグメントに区切り、当該短い時間セグメント毎に周波数帯域毎のエネルギー情報を算出・符号化する必要があるため、当該情報の情報量が非常に大きくなり低ビットレートでの符号化が困難になるという問題がある。 However, according to the method of Patent Document 1, in order to control the time wrapping of high frequency components in detail, it is divided into very short time segments, and energy information for each frequency band is calculated and coded for each short time segment. Therefore, there is a problem that the amount of information of the information becomes very large and it becomes difficult to encode at a low bit rate.

上記の問題に鑑み、本発明は、少ない情報量で復号信号の時間包絡形状を修正し知覚される歪みを軽減することを目的とする。 In view of the above problems, it is an object of the present invention to modify the time-envelope shape of the decoded signal with a small amount of information and reduce the perceived distortion.

本発明の一実施形態に係る音声符号化装置は、入力される音声信号を符号化して符号化系列を出力する音声符号化装置であって、前記音声信号を符号化する音声符号化部と、前記音声信号の時間包絡情報を算出し符号化する時間包絡情報符号化部と、前記音声符号化部で得られる前記音声信号を含む符号化系列と、前記時間包絡情報符号化部で得られる時間包絡情報の符号化系列とを多重化する符号化系列多重化部と、を備え、前記時間包絡情報は、前記音声信号の高周波数信号の時間包絡の相加平均と相乗平均との比に基づいて生成される。 The voice coding device according to the embodiment of the present invention is a voice coding device that encodes an input voice signal and outputs a coded sequence, and includes a voice coding unit that encodes the voice signal. A time-wrapping information coding unit that calculates and encodes the time-wrapping information of the voice signal, a coding series including the voice signal obtained by the voice coding unit, and a time obtained by the time-wrapping information coding unit. A coding sequence multiplexing unit for multiplexing a coding sequence of the inclusion information is provided, and the time inclusion information is based on the ratio of the additive average and the synergistic average of the time inclusion of the high frequency signal of the voice signal. Is generated.

本発明の一実施形態に係る音声符号化方法は、入力される音声信号を符号化して符号化系列を出力する音声符号化装置、により実行される音声符号化方法であって、前記音声信号を符号化する音声符号化ステップと、前記音声信号の時間包絡情報を算出し符号化する時間包絡情報符号化ステップと、前記音声符号化ステップで得られる前記音声信号を含む符号化系列と、前記時間包絡情報符号化ステップで得られる時間包絡情報の符号化系列とを多重化する符号化系列多重化ステップと、を備え、前記時間包絡情報は、前記音声信号の高周波数信号の時間包絡の相加平均と相乗平均との比に基づいて生成される。 The voice coding method according to the embodiment of the present invention is a voice coding method executed by a voice coding device that encodes an input voice signal and outputs a coded sequence, and obtains the voice signal. A voice coding step to be encoded, a time-wrapping information coding step for calculating and coding the time-wrapping information of the voice signal, a coding series including the voice signal obtained in the voice coding step, and the time. The time-encapsulation information includes a coding sequence multiplexing step for multiplexing the coding sequence of the time-encapsulation information obtained in the encapsulation information coding step, and the time-encapsulation information includes time-encapsulation of the high-frequency signal of the voice signal. Generated based on the ratio of the average to the synergistic average.

本発明によれば、少ない情報量で復号信号の時間包絡形状を修正し知覚される歪みを軽減することができる。 According to the present invention, it is possible to correct the time envelope shape of the decoded signal with a small amount of information and reduce the perceived distortion.

第1の実施形態に係る音声復号装置10の構成を示す図である。It is a figure which shows the structure of the voice decoding apparatus 10 which concerns on 1st Embodiment. 第1の実施形態に係る音声復号装置10の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice decoding apparatus 10 which concerns on 1st Embodiment. 第1の実施形態に係る音声符号化装置20の構成を示す図である。It is a figure which shows the structure of the voice coding apparatus 20 which concerns on 1st Embodiment. 第1の実施形態に係る音声符号化装置20の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice coding apparatus 20 which concerns on 1st Embodiment. 第1の実施形態に係る音声復号装置の第1の変形例10Aの構成を示す図である。It is a figure which shows the structure of the 1st modification 10A of the voice decoding apparatus which concerns on 1st Embodiment. 第1の実施形態に係る音声復号装置の第1の変形例10Aの動作を示すフローチャートである。It is a flowchart which shows the operation of the 1st modification 10A of the voice decoding apparatus which concerns on 1st Embodiment. 第1の実施形態に係る音声復号装置の第2の変形例10Bの構成を示す図である。It is a figure which shows the structure of the 2nd modification 10B of the voice decoding apparatus which concerns on 1st Embodiment. 第1の実施形態に係る音声復号装置の第3の変形例10Cの構成を示す図である。It is a figure which shows the structure of the 3rd modification 10C of the voice decoding apparatus which concerns on 1st Embodiment. 第1の実施形態に係る音声符号化装置の第1の変形例20Aの構成を示す図である。It is a figure which shows the structure of the 1st modification 20A of the voice coding apparatus which concerns on 1st Embodiment. 第1の実施形態に係る音声符号化装置の第1の変形例20Aの動作を示すフローチャートである。It is a flowchart which shows the operation of the 1st modification 20A of the voice coding apparatus which concerns on 1st Embodiment. 第2の実施形態に係る音声復号装置11の構成を示す図である。It is a figure which shows the structure of the voice decoding apparatus 11 which concerns on 2nd Embodiment. 第2の実施形態に係る音声復号装置11の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice decoding apparatus 11 which concerns on 2nd Embodiment. 第2の実施形態に係る音声符号化装置21の構成を示す図である。It is a figure which shows the structure of the voice coding apparatus 21 which concerns on 2nd Embodiment. 第2の実施形態に係る音声符号化装置21の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice coding apparatus 21 which concerns on 2nd Embodiment. 第2の実施形態に係る音声符号化装置の第1の変形例21Aの構成を示す図である。It is a figure which shows the structure of the 1st modification 21A of the voice coding apparatus which concerns on 2nd Embodiment. 第2の実施形態に係る音声符号化装置の第1の変形例21Aの動作を示すフローチャートである。It is a flowchart which shows the operation of the 1st modification 21A of the voice coding apparatus which concerns on 2nd Embodiment. 第3の実施形態に係る音声復号装置12の構成を示す図である。It is a figure which shows the structure of the voice decoding apparatus 12 which concerns on 3rd Embodiment. 第3の実施形態に係る音声復号装置12の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice decoding apparatus 12 which concerns on 3rd Embodiment. 第3の実施形態に係る音声符号化装置22の構成を示す図である。It is a figure which shows the structure of the voice coding apparatus 22 which concerns on 3rd Embodiment. 第3の実施形態に係る音声符号化装置22の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice coding apparatus 22 which concerns on 3rd Embodiment. 第3の実施形態に係る音声符号化装置の第1の変形例22Aの構成を示す図である。It is a figure which shows the structure of the 1st modification 22A of the voice coding apparatus which concerns on 3rd Embodiment. 第3の実施形態に係る音声符号化装置の第1の変形例22Aの動作を示すフローチャートである。It is a flowchart which shows the operation of the 1st modification 22A of the voice coding apparatus which concerns on 3rd Embodiment. 第3の実施形態に係る音声符号化装置の第2の変形例22Bの構成を示す図である。It is a figure which shows the structure of the 2nd modification 22B of the voice coding apparatus which concerns on 3rd Embodiment. 第3の実施形態に係る音声符号化装置の第1の変形例22Bの動作を示すフローチャートである。It is a flowchart which shows the operation of the 1st modification 22B of the voice coding apparatus which concerns on 3rd Embodiment. 第4の実施形態に係る音声復号装置13の構成を示す図である。It is a figure which shows the structure of the voice decoding apparatus 13 which concerns on 4th Embodiment. 第4の実施形態に係る音声復号装置13の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice decoding apparatus 13 which concerns on 4th Embodiment. 第4の実施形態に係る音声符号化装置23の構成を示す図である。It is a figure which shows the structure of the voice coding apparatus 23 which concerns on 4th Embodiment. 第4の実施形態に係る音声符号化装置23の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice coding apparatus 23 which concerns on 4th Embodiment. 第4の実施形態に係る音声復号装置の第1の変形例13Aの構成を示す図である。It is a figure which shows the structure of the 1st modification 13A of the voice decoding apparatus which concerns on 4th Embodiment. 第4の実施形態に係る音声復号装置の第1の変形例13Aの動作を示すフローチャートである。It is a flowchart which shows the operation of the 1st modification 13A of the voice decoding apparatus which concerns on 4th Embodiment. 第4の実施形態に係る音声復号装置の第2の変形例13Bの構成を示す図である。It is a figure which shows the structure of the 2nd modification 13B of the voice decoding apparatus which concerns on 4th Embodiment. 第4の実施形態に係る音声復号装置の第3の変形例13Cの構成を示す図である。It is a figure which shows the structure of the 3rd modification 13C of the voice decoding apparatus which concerns on 4th Embodiment. 第4の実施形態に係る音声符号化装置の第1の変形例23Aの構成を示す図である。It is a figure which shows the structure of the 1st modification 23A of the voice coding apparatus which concerns on 4th Embodiment. 第4の実施形態に係る音声符号化装置の第1の変形例23Aの動作を示すフローチャートである。It is a flowchart which shows the operation of the 1st modification 23A of the voice coding apparatus which concerns on 4th Embodiment. 第5の実施形態に係る音声復号装置14の構成を示す図である。It is a figure which shows the structure of the voice decoding apparatus 14 which concerns on 5th Embodiment. 第5の実施形態に係る音声復号装置14の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice decoding apparatus 14 which concerns on 5th Embodiment. 第5の実施形態に係る音声符号化装置24の構成を示す図である。It is a figure which shows the structure of the voice coding apparatus 24 which concerns on 5th Embodiment. 第5の実施形態に係る音声符号化装置24の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice coding apparatus 24 which concerns on 5th Embodiment. 第5の実施形態に係る音声復号装置の第1の変形例14Aの構成を示す図である。It is a figure which shows the structure of the 1st modification 14A of the voice decoding apparatus which concerns on 5th Embodiment. 第5の実施形態に係る音声復号装置の第1の変形例14Aの動作を示すフローチャートである。It is a flowchart which shows the operation of the 1st modification 14A of the voice decoding apparatus which concerns on 5th Embodiment. 第6の実施形態に係る音声復号装置15の構成を示す図である。It is a figure which shows the structure of the voice decoding apparatus 15 which concerns on 6th Embodiment. 第6の実施形態に係る音声復号装置15の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice decoding apparatus 15 which concerns on 6th Embodiment. 第6の実施形態に係る音声符号化装置25の構成を示す図である。It is a figure which shows the structure of the voice coding apparatus 25 which concerns on 6th Embodiment. 第6の実施形態に係る音声符号化装置25の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice coding apparatus 25 which concerns on 6th Embodiment. 第6の実施形態に係る音声復号装置の第1の変形例15Aの構成を示す図である。It is a figure which shows the structure of the 1st modification 15A of the voice decoding apparatus which concerns on 6th Embodiment. 第6の実施形態に係る音声復号装置の第1の変形例15Aの動作を示すフローチャートである。It is a flowchart which shows the operation of the 1st modification 15A of the voice decoding apparatus which concerns on 6th Embodiment. 第7の実施形態に係る音声復号装置16の構成を示す図である。It is a figure which shows the structure of the voice decoding apparatus 16 which concerns on 7th Embodiment. 第7の実施形態に係る音声復号装置の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice decoding apparatus which concerns on 7th Embodiment. 第7の実施形態に係る音声符号化装置26の構成を示す図である。It is a figure which shows the structure of the voice coding apparatus 26 which concerns on 7th Embodiment. 第7の実施形態に係る音声符号化装置26の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice coding apparatus 26 which concerns on 7th Embodiment. 第7の実施形態に係る音声復号装置の第1の変形例16Aの構成を示す図である。It is a figure which shows the structure of the 1st modification 16A of the voice decoding apparatus which concerns on 7th Embodiment. 第7の実施形態に係る音声復号装置の第1の変形例16Aの動作を示すフローチャートである。It is a flowchart which shows the operation of the 1st modification 16A of the voice decoding apparatus which concerns on 7th Embodiment. 第7の実施形態に係る音声符号化装置の第1の変形例26Aの構成を示す図である。It is a figure which shows the structure of the 1st modification 26A of the voice coding apparatus which concerns on 7th Embodiment. 第7の実施形態に係る音声符号化装置の第1の変形例26Aの動作を示すフローチャートである。It is a flowchart which shows the operation of the 1st modification 26A of the voice coding apparatus which concerns on 7th Embodiment. 第8の実施形態に係る音声復号装置17の構成を示す図である。It is a figure which shows the structure of the voice decoding apparatus 17 which concerns on 8th Embodiment. 第8の実施形態に係る音声復号装置の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice decoding apparatus which concerns on 8th Embodiment. 第8の実施形態に係る音声符号化装置27の構成を示す図である。It is a figure which shows the structure of the voice coding apparatus 27 which concerns on 8th Embodiment. 第8の実施形態に係る音声符号化装置27の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice coding apparatus 27 which concerns on 8th Embodiment. 第9の実施形態に係る音声復号装置18の構成を示す図である。It is a figure which shows the structure of the voice decoding apparatus 18 which concerns on 9th Embodiment. 第9の実施形態に係る音声復号装置の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice decoding apparatus which concerns on 9th Embodiment. 第9の実施形態に係る音声符号化装置28の構成を示す図である。It is a figure which shows the structure of the voice coding apparatus 28 which concerns on 9th Embodiment. 第9の実施形態に係る音声符号化装置28の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice coding apparatus 28 which concerns on 9th Embodiment. 第9の実施形態に係る音声復号装置の第1の変形例18Aの構成を示す図である。It is a figure which shows the structure of the 1st modification 18A of the voice decoding apparatus which concerns on 9th Embodiment. 第9の実施形態に係る音声復号装置の第1の変形例18Aの動作を示すフローチャートである。It is a flowchart which shows the operation of the 1st modification 18A of the voice decoding apparatus which concerns on 9th Embodiment. 第10の実施形態に係る音声復号装置1の構成を示す図である。It is a figure which shows the structure of the voice decoding apparatus 1 which concerns on 10th Embodiment. 第10の実施形態に係る音声復号装置の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice decoding apparatus which concerns on tenth embodiment. 第10の実施形態に係る音声符号化装置2の構成を示す図である。It is a figure which shows the structure of the voice coding apparatus 2 which concerns on 10th Embodiment. 第10の実施形態に係る音声符号化装置2の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice coding apparatus 2 which concerns on 10th Embodiment. 第11の実施形態に係る音声復号装置100の構成を示す図である。It is a figure which shows the structure of the voice decoding apparatus 100 which concerns on eleventh embodiment. 第11の実施形態に係る音声復号装置の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice decoding apparatus which concerns on eleventh embodiment. 第11の実施形態に係る音声符号化装置200の構成を示す図である。It is a figure which shows the structure of the voice coding apparatus 200 which concerns on eleventh embodiment. 第11の実施形態に係る音声符号化装置200の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice coding apparatus 200 which concerns on eleventh embodiment. 第11の実施形態に係る音声復号装置の第1の変形例100Aの構成を示す図である。It is a figure which shows the structure of the 1st modification 100A of the voice decoding apparatus which concerns on 11th Embodiment. 第11の実施形態に係る音声復号装置の第1の変形例100Aの動作を示すフローチャートである。It is a flowchart which shows the operation of the 1st modification 100A of the voice decoding apparatus which concerns on 11th Embodiment. 第11の実施形態に係る音声符号化装置の第1の変形例100Aの構成を示す図である。It is a figure which shows the structure of the 1st modification 100A of the voice coding apparatus which concerns on 11th Embodiment. 第12の実施形態に係る音声復号装置110の構成を示す図である。It is a figure which shows the structure of the voice decoding apparatus 110 which concerns on 12th Embodiment. 第12の実施形態に係る音声復号装置の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice decoding apparatus which concerns on 12th Embodiment. 第12の実施形態に係る音声符号化装置210の構成を示す図である。It is a figure which shows the structure of the voice coding apparatus 210 which concerns on 12th Embodiment. 第12の実施形態に係る音声符号化装置210の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice coding apparatus 210 which concerns on 12th Embodiment. 第13の実施形態に係る音声復号装置120の構成を示す図である。It is a figure which shows the structure of the voice decoding apparatus 120 which concerns on 13th Embodiment. 第13の実施形態に係る音声復号装置120の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice decoding apparatus 120 which concerns on 13th Embodiment. 第13の実施形態に係る音声符号化装置220の構成を示す図である。It is a figure which shows the structure of the voice coding apparatus 220 which concerns on 13th Embodiment. 第13の実施形態に係る音声符号化装置220の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice coding apparatus 220 which concerns on 13th Embodiment. 第13の実施形態に係る音声復号装置の第1の変形例120Aの構成を示す図である。It is a figure which shows the structure of the 1st modification 120A of the voice decoding apparatus which concerns on 13th Embodiment. 第13の実施形態に係る音声復号装置の第1の変形例120Aの動作を示すフローチャートである。It is a flowchart which shows the operation of the 1st modification 120A of the voice decoding apparatus which concerns on 13th Embodiment. 第13の実施形態に係る音声復号装置の第2の変形例120Bの構成を示す図である。It is a figure which shows the structure of the 2nd modification 120B of the voice decoding apparatus which concerns on 13th Embodiment. 第13の実施形態に係る音声復号装置の第2の変形例120Bの動作を示すフローチャートである。It is a flowchart which shows the operation of the 2nd modification 120B of the voice decoding apparatus which concerns on 13th Embodiment. 第14の実施形態に係る音声復号装置130の構成を示す図である。It is a figure which shows the structure of the voice decoding apparatus 130 which concerns on 14th Embodiment. 第14の実施形態に係る音声復号装置の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice decoding apparatus which concerns on 14th Embodiment. 第14の実施形態に係る音声符号化装置230の構成を示す図である。It is a figure which shows the structure of the voice coding apparatus 230 which concerns on 14th Embodiment. 第14の実施形態に係る音声符号化装置230の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice coding apparatus 230 which concerns on 14th Embodiment. 第15の実施形態に係る音声復号装置140の構成を示す図である。It is a figure which shows the structure of the voice decoding apparatus 140 which concerns on 15th Embodiment. 第15の実施形態に係る音声復号装置の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice decoding apparatus which concerns on 15th Embodiment. 第15の実施形態に係る音声符号化装置240の構成を示す図である。It is a figure which shows the structure of the voice coding apparatus 240 which concerns on 15th Embodiment. 第15の実施形態に係る音声符号化装置240の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice coding apparatus 240 which concerns on 15th Embodiment. 第15の実施形態に係る音声復号装置の第1の変形例140Aの構成を示す図である。It is a figure which shows the structure of the 1st modification 140A of the voice decoding apparatus which concerns on 15th Embodiment. 第15の実施形態に係る音声復号装置の第1の変形例140Aの動作を示すフローチャートである。It is a flowchart which shows the operation of the 1st modification 140A of the voice decoding apparatus which concerns on 15th Embodiment. 第15の実施形態に係る音声復号装置の第2の変形例140Bの構成を示す図である。It is a figure which shows the structure of the 2nd modification 140B of the voice decoding apparatus which concerns on 15th Embodiment. 第16の実施形態に係る音声復号装置150の構成を示す図である。It is a figure which shows the structure of the voice decoding apparatus 150 which concerns on 16th Embodiment. 第16の実施形態に係る音声復号装置の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice decoding apparatus which concerns on 16th Embodiment. 第16の実施形態に係る音声符号化装置250の構成を示す図である。It is a figure which shows the structure of the voice coding apparatus 250 which concerns on 16th Embodiment. 第16の実施形態に係る音声符号化装置250の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice coding apparatus 250 which concerns on 16th Embodiment. 第16の実施形態に係る音声復号装置の第1の変形例150Aの構成を示す図である。It is a figure which shows the structure of the 1st modification 150A of the voice decoding apparatus which concerns on 16th Embodiment. 第16の実施形態に係る音声復号装置の第1の変形例150Aの動作を示すフローチャートである。It is a flowchart which shows the operation of the 1st modification 150A of the voice decoding apparatus which concerns on 16th Embodiment. 第16の実施形態に係る音声復号装置の第2の変形例150Bの構成を示す図である。It is a figure which shows the structure of the 2nd modification 150B of the voice decoding apparatus which concerns on 16th Embodiment. 第17の実施形態に係る音声復号装置160の構成を示す図である。It is a figure which shows the structure of the voice decoding apparatus 160 which concerns on 17th Embodiment. 第17の実施形態に係る音声復号装置の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice decoding apparatus which concerns on 17th Embodiment. 第17の実施形態に係る音声符号化装置260の構成を示す図である。It is a figure which shows the structure of the voice coding apparatus 260 which concerns on 17th Embodiment. 第17の実施形態に係る音声符号化装置260の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice coding apparatus 260 which concerns on 17th Embodiment. 第17の実施形態に係る音声復号装置の第1の変形例160Aの構成を示す図である。It is a figure which shows the structure of the 1st modification 160A of the voice decoding apparatus which concerns on 17th Embodiment. 第17の実施形態に係る音声復号装置の第1の変形例160Aの動作を示すフローチャートである。It is a flowchart which shows the operation of the 1st modification 160A of the voice decoding apparatus which concerns on 17th Embodiment. 第17の実施形態に係る音声復号装置の第2の変形例160Bの構成を示す図である。It is a figure which shows the structure of the 2nd modification 160B of the voice decoding apparatus which concerns on 17th Embodiment. 第18の実施形態に係る音声復号装置170の構成を示す図である。It is a figure which shows the structure of the voice decoding apparatus 170 which concerns on 18th Embodiment. 第18の実施形態に係る音声復号装置の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice decoding apparatus which concerns on 18th Embodiment. 第18の実施形態に係る音声符号化装置270の構成を示す図である。It is a figure which shows the structure of the voice coding apparatus 270 which concerns on 18th Embodiment. 第18の実施形態に係る音声符号化装置270の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice coding apparatus 270 which concerns on 18th Embodiment. 第19の実施形態に係る音声復号装置180の構成を示す図である。It is a figure which shows the structure of the voice decoding apparatus 180 which concerns on 19th Embodiment. 第19の実施形態に係る音声復号装置の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice decoding apparatus which concerns on 19th Embodiment. 第19の実施形態に係る音声符号化装置280の構成を示す図である。It is a figure which shows the structure of the voice coding apparatus 280 which concerns on 19th Embodiment. 第19の実施形態に係る音声符号化装置280の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice coding apparatus 280 which concerns on 19th Embodiment. 第20の実施形態に係る音声復号装置190の構成を示す図である。It is a figure which shows the structure of the voice decoding apparatus 190 which concerns on 20th Embodiment. 第20の実施形態に係る音声復号装置の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice decoding apparatus which concerns on 20th Embodiment. 第20の実施形態に係る音声符号化装置290の構成を示す図である。It is a figure which shows the structure of the voice coding apparatus 290 which concerns on 20th Embodiment. 第20の実施形態に係る音声符号化装置290の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice coding apparatus 290 which concerns on 20th Embodiment. 第21の実施形態に係る音声復号装置300の構成を示す図である。It is a figure which shows the structure of the voice decoding apparatus 300 which concerns on 21st Embodiment. 第21の実施形態に係る音声復号装置の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice decoding apparatus which concerns on 21st Embodiment. 第21の実施形態に係る音声符号化装置400の構成を示す図である。It is a figure which shows the structure of the voice coding apparatus 400 which concerns on 21st Embodiment. 第21の実施形態に係る音声符号化装置400の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice coding apparatus 400 which concerns on 21st Embodiment. 第22の実施形態に係る音声復号装置310の構成を示す図である。It is a figure which shows the structure of the voice decoding apparatus 310 which concerns on 22nd Embodiment. 第22の実施形態に係る音声復号装置の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice decoding apparatus which concerns on 22nd Embodiment. 第22の実施形態に係る音声符号化装置410の構成を示す図である。It is a figure which shows the structure of the voice coding apparatus 410 which concerns on 22nd Embodiment. 第22の実施形態に係る音声符号化装置410の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice coding apparatus 410 which concerns on 22nd Embodiment. 第23の実施形態に係る音声復号装置320の構成を示す図である。It is a figure which shows the structure of the voice decoding apparatus 320 which concerns on 23rd Embodiment. 第23の実施形態に係る音声復号装置の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice decoding apparatus which concerns on 23rd Embodiment. 第23の実施形態に係る音声符号化装置420の構成を示す図である。It is a figure which shows the structure of the voice coding apparatus 420 which concerns on 23rd Embodiment. 第23の実施形態に係る音声符号化装置420の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice coding apparatus 420 which concerns on 23rd Embodiment. 第23の実施形態の第1の変形例に係る音声復号装置320Aの構成を示す図である。It is a figure which shows the structure of the audio decoding apparatus 320A which concerns on 1st modification of 23rd Embodiment. 第23の実施形態の第1の変形例に係る音声復号装置320Aの動作を示すフローチャートである。It is a flowchart which shows the operation of the voice decoding apparatus 320A which concerns on 1st modification of 23rd Embodiment. 第24の実施形態に係る音声復号装置330の構成を示す図である。It is a figure which shows the structure of the audio decoding apparatus 330 which concerns on 24th Embodiment. 第24の実施形態に係る音声復号装置の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice decoding apparatus which concerns on 24th Embodiment. 第24の実施形態に係る音声符号化装置430の構成を示す図である。It is a figure which shows the structure of the voice coding apparatus 430 which concerns on 24th Embodiment. 第24の実施形態に係る音声符号化装置430の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice coding apparatus 430 which concerns on 24th Embodiment. 第25の実施形態に係る音声復号装置340の構成を示す図である。It is a figure which shows the structure of the audio decoding apparatus 340 which concerns on the 25th Embodiment. 第25の実施形態に係る音声復号装置の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice decoding apparatus which concerns on 25th Embodiment. 第25の実施形態に係る音声符号化装置440の構成を示す図である。It is a figure which shows the structure of the voice coding apparatus 440 which concerns on 25th Embodiment. 第25の実施形態に係る音声符号化装置440の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice coding apparatus 440 which concerns on 25th Embodiment. 第26の実施形態に係る音声復号装置350の構成を示す図である。It is a figure which shows the structure of the voice decoding apparatus 350 which concerns on the 26th Embodiment. 第26の実施形態に係る音声復号装置の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice decoding apparatus which concerns on 26th Embodiment. 第26の実施形態に係る音声符号化装置450の構成を示す図である。It is a figure which shows the structure of the voice coding apparatus 450 which concerns on the 26th Embodiment. 第26の実施形態に係る音声符号化装置450の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice coding apparatus 450 which concerns on 26th Embodiment. 第26の実施形態の第1の変形例に係る音声復号装置350Aの構成を示す図である。It is a figure which shows the structure of the voice decoding apparatus 350A which concerns on the 1st modification of the 26th Embodiment. 第26の実施形態の第1の変形例に係る音声復号装置350Aの動作を示すフローチャートである。It is a flowchart which shows the operation of the voice decoding apparatus 350A which concerns on 1st modification of 26th Embodiment. 第7の実施形態に係る音声復号装置の第2の変形例16Bの構成を示す図である。It is a figure which shows the structure of the 2nd modification 16B of the voice decoding apparatus which concerns on 7th Embodiment. 第7の実施形態に係る音声復号装置の第2の変形例16Bの動作を示すフローチャートである。It is a flowchart which shows the operation of the 2nd modification 16B of the voice decoding apparatus which concerns on 7th Embodiment. 第7の実施形態に係る音声復号装置の第3の変形例16Cの構成を示す図である。It is a figure which shows the structure of the 3rd modification 16C of the voice decoding apparatus which concerns on 7th Embodiment. 第7の実施形態に係る音声復号装置の第3の変形例16Cの動作を示すフローチャートである。It is a flowchart which shows the operation of the 3rd modification 16C of the voice decoding apparatus which concerns on 7th Embodiment. 第7の実施形態に係る音声復号装置の第4の変形例16Dの構成を示す図である。It is a figure which shows the structure of the 4th modification 16D of the voice decoding apparatus which concerns on 7th Embodiment. 第7の実施形態に係る音声復号装置の第4の変形例16Dの動作を示すフローチャートである。It is a flowchart which shows the operation of the 4th modification 16D of the voice decoding apparatus which concerns on 7th Embodiment. 第7の実施形態に係る音声復号装置の第5の変形例16Eの構成を示す図である。It is a figure which shows the structure of the 5th modification 16E of the voice decoding apparatus which concerns on 7th Embodiment. 第7の実施形態に係る音声復号装置の第5の変形例16Eの動作を示すフローチャートである。It is a flowchart which shows the operation of the 5th modification 16E of the voice decoding apparatus which concerns on 7th Embodiment. 第8の実施形態に係る音声復号装置の第1の変形例17Aの構成を示す図である。It is a figure which shows the structure of the 1st modification 17A of the voice decoding apparatus which concerns on 8th Embodiment. 第8の実施形態に係る音声復号装置の第1の変形例17Aの動作を示すフローチャートである。It is a flowchart which shows the operation of the 1st modification 17A of the voice decoding apparatus which concerns on 8th Embodiment. 第8の実施形態に係る音声復号装置の第2の変形例17Bの構成を示す図である。It is a figure which shows the structure of the 2nd modification 17B of the voice decoding apparatus which concerns on 8th Embodiment. 第8の実施形態に係る音声復号装置の第2の変形例17Bの動作を示すフローチャートである。It is a flowchart which shows the operation of the 2nd modification 17B of the voice decoding apparatus which concerns on 8th Embodiment. 第8の実施形態に係る音声復号装置の第3の変形例17Cの構成を示す図である。It is a figure which shows the structure of the 3rd modification 17C of the voice decoding apparatus which concerns on 8th Embodiment. 第8の実施形態に係る音声復号装置の第3の変形例17Cの動作を示すフローチャートである。It is a flowchart which shows the operation of the 3rd modification 17C of the voice decoding apparatus which concerns on 8th Embodiment. 第8の実施形態に係る音声復号装置の第4の変形例17Dの構成を示す図である。It is a figure which shows the structure of the 4th modification 17D of the voice decoding apparatus which concerns on 8th Embodiment. 第8の実施形態に係る音声復号装置の第4の変形例17Dの動作を示すフローチャートである。It is a flowchart which shows the operation of the 4th modification 17D of the voice decoding apparatus which concerns on 8th Embodiment. 第9の実施形態に係る音声復号装置の第2の変形例18Bの構成を示す図である。It is a figure which shows the structure of the 2nd modification 18B of the voice decoding apparatus which concerns on 9th Embodiment. 第9の実施形態に係る音声復号装置の第2の変形例18Bの動作を示すフローチャートである。It is a flowchart which shows the operation of the 2nd modification 18B of the voice decoding apparatus which concerns on 9th Embodiment. 第9の実施形態に係る音声復号装置の第3の変形例18Cの構成を示す図である。It is a figure which shows the structure of the 3rd modification 18C of the voice decoding apparatus which concerns on 9th Embodiment. 第9の実施形態に係る音声復号装置の第3の変形例18Cの動作を示すフローチャートである。It is a flowchart which shows the operation of the 3rd modification 18C of the voice decoding apparatus which concerns on 9th Embodiment. 第9の実施形態に係る音声復号装置の第4の変形例18Dの構成を示す図である。It is a figure which shows the structure of the 4th modification 18D of the voice decoding apparatus which concerns on 9th Embodiment. 第9の実施形態に係る音声復号装置の第4の変形例18Dの動作を示すフローチャートである。It is a flowchart which shows the operation of the 4th modification 18D of the voice decoding apparatus which concerns on 9th Embodiment. 第9の実施形態に係る音声復号装置の第5の変形例18Eの構成を示す図である。It is a figure which shows the structure of the 5th modification 18E of the voice decoding apparatus which concerns on 9th Embodiment. 第9の実施形態に係る音声復号装置の第5の変形例18Eの動作を示すフローチャートである。It is a flowchart which shows the operation of the 5th modification 18E of the voice decoding apparatus which concerns on 9th Embodiment. 第9の実施形態に係る音声復号装置の第6の変形例18Fの構成を示す図である。It is a figure which shows the structure of the 6th modification 18F of the voice decoding apparatus which concerns on 9th Embodiment. 第9の実施形態に係る音声復号装置の第6の変形例18Fの動作を示すフローチャートである。It is a flowchart which shows the operation of the 6th modification 18F of the voice decoding apparatus which concerns on 9th Embodiment. 第9の実施形態に係る音声復号装置の第7の変形例18Gの構成を示す図である。It is a figure which shows the structure of the 7th modification 18G of the voice decoding apparatus which concerns on 9th Embodiment. 第9の実施形態に係る音声復号装置の第7の変形例18Gの動作を示すフローチャートである。It is a flowchart which shows the operation of the 7th modification 18G of the voice decoding apparatus which concerns on 9th Embodiment. 第9の実施形態に係る音声復号装置の第8の変形例18Hの構成を示す図である。It is a figure which shows the structure of the 8th modification 18H of the voice decoding apparatus which concerns on 9th Embodiment. 第9の実施形態に係る音声復号装置の第8の変形例18Hの動作を示すフローチャートである。It is a flowchart which shows the operation of the 8th modification 18H of the voice decoding apparatus which concerns on 9th Embodiment. 第9の実施形態に係る音声復号装置の第9の変形例18Iの構成を示す図である。It is a figure which shows the structure of the 9th modification 18I of the voice decoding apparatus which concerns on 9th Embodiment. 第9の実施形態に係る音声復号装置の第9の変形例18Iの動作を示すフローチャートである。It is a flowchart which shows the operation of the 9th modification 18I of the voice decoding apparatus which concerns on 9th Embodiment. 第13の実施形態に係る音声復号装置の第3の変形例120Cの構成を示す図である。It is a figure which shows the structure of the 3rd modification 120C of the voice decoding apparatus which concerns on 13th Embodiment. 第13の実施形態に係る音声復号装置の第3の変形例120Cの動作を示すフローチャートである。It is a flowchart which shows the operation of the 3rd modification 120C of the voice decoding apparatus which concerns on 13th Embodiment. 第13の実施形態に係る音声復号装置の第4の変形例120Dの構成を示す図である。It is a figure which shows the structure of the 4th modification 120D of the voice decoding apparatus which concerns on 13th Embodiment. 第13の実施形態に係る音声復号装置の第4の変形例120Dの動作を示すフローチャートである。It is a flowchart which shows the operation of the 4th modification 120D of the voice decoding apparatus which concerns on 13th Embodiment. 第13の実施形態に係る音声復号装置の第5の変形例120Eの構成を示す図である。It is a figure which shows the structure of the 5th modification 120E of the voice decoding apparatus which concerns on 13th Embodiment. 第13の実施形態に係る音声復号装置の第5の変形例120Eの動作を示すフローチャートである。It is a flowchart which shows the operation of the 5th modification 120E of the voice decoding apparatus which concerns on 13th Embodiment. 第13の実施形態に係る音声復号装置の第6の変形例120Fの構成を示す図である。It is a figure which shows the structure of the 6th modification 120F of the voice decoding apparatus which concerns on 13th Embodiment. 第13の実施形態に係る音声復号装置の第6の変形例120Fの動作を示すフローチャートである。It is a flowchart which shows the operation of the 6th modification 120F of the voice decoding apparatus which concerns on 13th Embodiment. 第13の実施形態に係る音声復号装置の第7の変形例120Gの構成を示す図である。It is a figure which shows the structure of the 7th modification 120G of the voice decoding apparatus which concerns on 13th Embodiment. 第13の実施形態に係る音声復号装置の第7の変形例120Gの動作を示すフローチャートである。It is a flowchart which shows the operation of the 7th modification 120G of the voice decoding apparatus which concerns on 13th Embodiment. 第13の実施形態に係る音声復号装置の第8の変形例120Hの構成を示す図である。It is a figure which shows the structure of the 8th modification 120H of the voice decoding apparatus which concerns on 13th Embodiment. 第13の実施形態に係る音声復号装置の第8の変形例120Hの動作を示すフローチャートである。It is a flowchart which shows the operation of the 8th modification 120H of the voice decoding apparatus which concerns on 13th Embodiment. 第13の実施形態に係る音声復号装置の第9の変形例120Iの構成を示す図である。It is a figure which shows the structure of the 9th modification 120I of the voice decoding apparatus which concerns on 13th Embodiment. 第13の実施形態に係る音声復号装置の第9の変形例120Iの動作を示すフローチャートである。It is a flowchart which shows the operation of the 9th modification 120I of the voice decoding apparatus which concerns on 13th Embodiment. 第13の実施形態に係る音声復号装置の第10の変形例120Jの構成を示す図である。It is a figure which shows the structure of the tenth modification 120J of the voice decoding apparatus which concerns on thirteenth embodiment. 第13の実施形態に係る音声復号装置の第10の変形例120Jの動作を示すフローチャートである。It is a flowchart which shows the operation of the tenth modification 120J of the voice decoding apparatus which concerns on thirteenth embodiment. 第13の実施形態に係る音声復号装置の第11の変形例120Kの構成を示す図である。It is a figure which shows the structure of the eleventh modification 120K of the voice decoding apparatus which concerns on 13th Embodiment. 第13の実施形態に係る音声復号装置の第11の変形例120Kの動作を示すフローチャートである。It is a flowchart which shows the operation of the eleventh modification 120K of the voice decoding apparatus which concerns on 13th Embodiment. 第13の実施形態に係る音声復号装置の第12の変形例120Lの構成を示す図である。It is a figure which shows the structure of the twelfth modification 120L of the voice decoding apparatus which concerns on thirteenth embodiment. 第13の実施形態に係る音声復号装置の第12の変形例120Lの動作を示すフローチャートである。It is a flowchart which shows the operation of the twelfth modification 120L of the voice decoding apparatus which concerns on thirteenth embodiment. 第13の実施形態に係る音声復号装置の第13の変形例120Mの構成を示す図である。It is a figure which shows the structure of the 13th modification 120M of the audio decoding apparatus which concerns on 13th Embodiment. 第13の実施形態に係る音声復号装置の第13の変形例120Mの動作を示すフローチャートである。It is a flowchart which shows the operation of the 13th modification 120M of the voice decoding apparatus which concerns on 13th Embodiment. 第13の実施形態に係る音声復号装置の第14の変形例120Nの構成を示す図である。It is a figure which shows the structure of the 14th modification 120N of the voice decoding apparatus which concerns on 13th Embodiment. 第13の実施形態に係る音声復号装置の第14の変形例120Nの動作を示すフローチャートである。It is a flowchart which shows the operation of the 14th modification 120N of the voice decoding apparatus which concerns on 13th Embodiment. 第15の実施形態に係る音声復号装置の第3の変形例140Cの構成を示す図である。It is a figure which shows the structure of the 3rd modification 140C of the voice decoding apparatus which concerns on 15th Embodiment. 第15の実施形態に係る音声復号装置の第3の変形例140Cの動作を示すフローチャートである。It is a flowchart which shows the operation of the 3rd modification 140C of the voice decoding apparatus which concerns on 15th Embodiment. 第15の実施形態に係る音声復号装置の第4の変形例140Dの構成を示す図である。It is a figure which shows the structure of the 4th modification 140D of the voice decoding apparatus which concerns on 15th Embodiment. 第15の実施形態に係る音声復号装置の第4の変形例140Dの動作を示すフローチャートである。It is a flowchart which shows the operation of the 4th modification 140D of the voice decoding apparatus which concerns on 15th Embodiment. 第15の実施形態に係る音声復号装置の第5の変形例140Eの構成を示す図である。It is a figure which shows the structure of the 5th modification 140E of the voice decoding apparatus which concerns on 15th Embodiment. 第15の実施形態に係る音声復号装置の第5の変形例140Eの動作を示すフローチャートである。It is a flowchart which shows the operation of the 5th modification 140E of the voice decoding apparatus which concerns on 15th Embodiment. 第15の実施形態に係る音声復号装置の第6の変形例140Fの構成を示す図である。It is a figure which shows the structure of the 6th modification 140F of the voice decoding apparatus which concerns on 15th Embodiment. 第15の実施形態に係る音声復号装置の第6の変形例140Fの動作を示すフローチャートである。It is a flowchart which shows the operation of the 6th modification 140F of the voice decoding apparatus which concerns on 15th Embodiment. 第15の実施形態に係る音声復号装置の第7の変形例140Gの構成を示す図である。It is a figure which shows the structure of the 7th modification 140G of the voice decoding apparatus which concerns on 15th Embodiment. 第15の実施形態に係る音声復号装置の第7の変形例140Gの動作を示すフローチャートである。It is a flowchart which shows the operation of the 7th modification 140G of the voice decoding apparatus which concerns on 15th Embodiment. 第15の実施形態に係る音声復号装置の第8の変形例140Hの構成を示す図である。It is a figure which shows the structure of the 8th modification 140H of the voice decoding apparatus which concerns on 15th Embodiment. 第15の実施形態に係る音声復号装置の第8の変形例140Hの動作を示すフローチャートである。It is a flowchart which shows the operation of the 8th modification 140H of the voice decoding apparatus which concerns on 15th Embodiment. 第15の実施形態に係る音声復号装置の第9の変形例140Iの構成を示す図である。It is a figure which shows the structure of the 9th modification 140I of the voice decoding apparatus which concerns on 15th Embodiment. 第15の実施形態に係る音声復号装置の第9の変形例140Iの動作を示すフローチャートである。It is a flowchart which shows the operation of the 9th modification 140I of the voice decoding apparatus which concerns on 15th Embodiment. 第15の実施形態に係る音声復号装置の第10の変形例140Jの構成を示す図である。It is a figure which shows the structure of the tenth modification 140J of the voice decoding apparatus which concerns on the fifteenth embodiment. 第15の実施形態に係る音声復号装置の第10の変形例140Jの動作を示すフローチャートである。It is a flowchart which shows the operation of the tenth modification 140J of the voice decoding apparatus which concerns on the fifteenth embodiment. 第15の実施形態に係る音声復号装置の第11の変形例140Kの構成を示す図である。It is a figure which shows the structure of the eleventh modification 140K of the voice decoding apparatus which concerns on the fifteenth embodiment. 第15の実施形態に係る音声復号装置の第11の変形例140Kの動作を示すフローチャートである。It is a flowchart which shows the operation of the eleventh modification 140K of the voice decoding apparatus which concerns on the fifteenth embodiment. 第15の実施形態に係る音声復号装置の第12の変形例140Lの構成を示す図である。It is a figure which shows the structure of the twelfth modification 140L of the voice decoding apparatus which concerns on the fifteenth embodiment. 第15の実施形態に係る音声復号装置の第12の変形例140Lの動作を示すフローチャートである。It is a flowchart which shows the operation of the twelfth modification 140L of the voice decoding apparatus which concerns on the fifteenth embodiment. 第15の実施形態に係る音声復号装置の第13の変形例140Mの構成を示す図である。It is a figure which shows the structure of the 13th modification 140M of the voice decoding apparatus which concerns on 15th Embodiment. 第15の実施形態に係る音声復号装置の第13の変形例140Mの動作を示すフローチャートである。It is a flowchart which shows the operation of the 13th modification 140M of the audio decoding apparatus which concerns on 15th Embodiment. 第15の実施形態に係る音声復号装置の第14の変形例140Nの構成を示す図である。It is a figure which shows the structure of the 14th modification 140N of the voice decoding apparatus which concerns on 15th Embodiment. 第15の実施形態に係る音声復号装置の第14の変形例140Nの動作を示すフローチャートである。It is a flowchart which shows the operation of the 14th modification 140N of the voice decoding apparatus which concerns on 15th Embodiment. 第16の実施形態に係る音声復号装置の第3の変形例150Cの構成を示す図である。It is a figure which shows the structure of the 3rd modification 150C of the voice decoding apparatus which concerns on 16th Embodiment. 第16の実施形態に係る音声復号装置の第3の変形例150Cの動作を示すフローチャートである。It is a flowchart which shows the operation of the 3rd modification 150C of the voice decoding apparatus which concerns on 16th Embodiment. 第16の実施形態に係る音声復号装置の第4の変形例150Dの構成を示す図である。It is a figure which shows the structure of the 4th modification 150D of the voice decoding apparatus which concerns on 16th Embodiment. 第16の実施形態に係る音声復号装置の第4の変形例150Dの動作を示すフローチャートである。It is a flowchart which shows the operation of the 4th modification 150D of the voice decoding apparatus which concerns on 16th Embodiment. 第16の実施形態に係る音声復号装置の第5の変形例150Eの構成を示す図である。It is a figure which shows the structure of the 5th modification 150E of the voice decoding apparatus which concerns on 16th Embodiment. 第16の実施形態に係る音声復号装置の第5の変形例150Eの動作を示すフローチャートである。It is a flowchart which shows the operation of the 5th modification 150E of the voice decoding apparatus which concerns on 16th Embodiment. 第16の実施形態に係る音声復号装置の第6の変形例150Fの構成を示す図である。It is a figure which shows the structure of the 6th modification 150F of the voice decoding apparatus which concerns on 16th Embodiment. 第16の実施形態に係る音声復号装置の第6の変形例150Fの動作を示すフローチャートである。It is a flowchart which shows the operation of the 6th modification 150F of the voice decoding apparatus which concerns on 16th Embodiment. 第16の実施形態に係る音声復号装置の第7の変形例150Gの構成を示す図である。It is a figure which shows the structure of the 7th modification 150G of the voice decoding apparatus which concerns on 16th Embodiment. 第16の実施形態に係る音声復号装置の第7の変形例150Gの動作を示すフローチャートである。It is a flowchart which shows the operation of the 7th modification 150G of the voice decoding apparatus which concerns on 16th Embodiment. 第16の実施形態に係る音声復号装置の第8の変形例150Hの構成を示す図である。It is a figure which shows the structure of the 8th modification 150H of the voice decoding apparatus which concerns on 16th Embodiment. 第16の実施形態に係る音声復号装置の第8の変形例150Hの動作を示すフローチャートである。It is a flowchart which shows the operation of the 8th modification 150H of the voice decoding apparatus which concerns on 16th Embodiment. 第16の実施形態に係る音声復号装置の第9の変形例150Iの構成を示す図である。It is a figure which shows the structure of the 9th modification 150I of the voice decoding apparatus which concerns on 16th Embodiment. 第16の実施形態に係る音声復号装置の第9の変形例150Iの動作を示すフローチャートである。It is a flowchart which shows the operation of the 9th modification 150I of the voice decoding apparatus which concerns on 16th Embodiment. 第16の実施形態に係る音声復号装置の第10の変形例150Jの構成を示す図である。It is a figure which shows the structure of the tenth modification 150J of the voice decoding apparatus which concerns on 16th Embodiment. 第16の実施形態に係る音声復号装置の第10の変形例150Jの動作を示すフローチャートである。It is a flowchart which shows the operation of the tenth modification 150J of the voice decoding apparatus which concerns on 16th Embodiment. 第16の実施形態に係る音声復号装置の第11の変形例150Kの構成を示す図である。It is a figure which shows the structure of the eleventh modification 150K of the voice decoding apparatus which concerns on 16th Embodiment. 第16の実施形態に係る音声復号装置の第11の変形例150Kの動作を示すフローチャートである。It is a flowchart which shows the operation of the eleventh modification 150K of the voice decoding apparatus which concerns on 16th Embodiment. 第16の実施形態に係る音声復号装置の第12の変形例150Lの構成を示す図である。It is a figure which shows the structure of the twelfth modification 150L of the voice decoding apparatus which concerns on the 16th Embodiment. 第16の実施形態に係る音声復号装置の第12の変形例150Lの動作を示すフローチャートである。It is a flowchart which shows the operation of the twelfth modification 150L of the voice decoding apparatus which concerns on 16th Embodiment. 第16の実施形態に係る音声復号装置の第13の変形例150Mの構成を示す図である。It is a figure which shows the structure of the 13th modification 150M of the voice decoding apparatus which concerns on 16th Embodiment. 第16の実施形態に係る音声復号装置の第13の変形例150Mの動作を示すフローチャートである。It is a flowchart which shows the operation of the 13th modification 150M of the voice decoding apparatus which concerns on 16th Embodiment. 第16の実施形態に係る音声復号装置の第14の変形例150Nの構成を示す図である。It is a figure which shows the structure of the 14th modification 150N of the voice decoding apparatus which concerns on 16th Embodiment. 第16の実施形態に係る音声復号装置の第14の変形例150Nの動作を示すフローチャートである。It is a flowchart which shows the operation of the 14th modification 150N of the voice decoding apparatus which concerns on 16th Embodiment. 第17の実施形態に係る音声復号装置の第3の変形例160Cの構成を示す図である。It is a figure which shows the structure of the 3rd modification 160C of the voice decoding apparatus which concerns on 17th Embodiment. 第17の実施形態に係る音声復号装置の第3の変形例160Cの動作を示すフローチャートである。It is a flowchart which shows the operation of the 3rd modification 160C of the voice decoding apparatus which concerns on 17th Embodiment. 第17の実施形態に係る音声復号装置の第4の変形例160Dの構成を示す図である。It is a figure which shows the structure of the 4th modification 160D of the voice decoding apparatus which concerns on 17th Embodiment. 第17の実施形態に係る音声復号装置の第4の変形例160Dの動作を示すフローチャートである。It is a flowchart which shows the operation of the 4th modification 160D of the voice decoding apparatus which concerns on 17th Embodiment. 第17の実施形態に係る音声復号装置の第5の変形例160Eの構成を示す図である。It is a figure which shows the structure of the 5th modification 160E of the voice decoding apparatus which concerns on 17th Embodiment. 第17の実施形態に係る音声復号装置の第5の変形例160Eの動作を示すフローチャートである。It is a flowchart which shows the operation of the 5th modification 160E of the voice decoding apparatus which concerns on 17th Embodiment. 第17の実施形態に係る音声復号装置の第6の変形例160Fの構成を示す図である。It is a figure which shows the structure of the 6th modification 160F of the voice decoding apparatus which concerns on 17th Embodiment. 第17の実施形態に係る音声復号装置の第6の変形例160Fの動作を示すフローチャートである。It is a flowchart which shows the operation of the 6th modification 160F of the voice decoding apparatus which concerns on 17th Embodiment. 第17の実施形態に係る音声復号装置の第7の変形例160Gの構成を示す図である。It is a figure which shows the structure of the 7th modification 160G of the voice decoding apparatus which concerns on 17th Embodiment. 第17の実施形態に係る音声復号装置の第7の変形例160Gの動作を示すフローチャートである。It is a flowchart which shows the operation of the 7th modification 160G of the voice decoding apparatus which concerns on 17th Embodiment. 第17の実施形態に係る音声復号装置の第8の変形例160Hの構成を示す図である。It is a figure which shows the structure of the 8th modification 160H of the voice decoding apparatus which concerns on 17th Embodiment. 第17の実施形態に係る音声復号装置の第8の変形例160Hの動作を示すフローチャートである。It is a flowchart which shows the operation of the 8th modification 160H of the voice decoding apparatus which concerns on 17th Embodiment. 第17の実施形態に係る音声復号装置の第9の変形例160Iの構成を示す図である。It is a figure which shows the structure of the 9th modification 160I of the voice decoding apparatus which concerns on 17th Embodiment. 第17の実施形態に係る音声復号装置の第9の変形例160Iの動作を示すフローチャートである。It is a flowchart which shows the operation of the 9th modification 160I of the voice decoding apparatus which concerns on 17th Embodiment. 第17の実施形態に係る音声復号装置の第10の変形例160Jの構成を示す図である。It is a figure which shows the structure of the tenth modification 160J of the voice decoding apparatus which concerns on 17th Embodiment. 第17の実施形態に係る音声復号装置の第10の変形例160Jの動作を示すフローチャートである。It is a flowchart which shows the operation of the tenth modification 160J of the voice decoding apparatus which concerns on 17th Embodiment. 第17の実施形態に係る音声復号装置の第11の変形例160Kの構成を示す図である。It is a figure which shows the structure of the eleventh modification 160K of the voice decoding apparatus which concerns on 17th Embodiment. 第17の実施形態に係る音声復号装置の第11の変形例160Kの動作を示すフローチャートである。It is a flowchart which shows the operation of the eleventh modification 160K of the voice decoding apparatus which concerns on 17th Embodiment. 第17の実施形態に係る音声復号装置の第12の変形例160Lの構成を示す図である。It is a figure which shows the structure of the twelfth modification 160L of the audio decoding apparatus which concerns on 17th Embodiment. 第17の実施形態に係る音声復号装置の第12の変形例160Lの動作を示すフローチャートである。It is a flowchart which shows the operation of the twelfth modification 160L of the audio decoding apparatus which concerns on 17th Embodiment. 第17の実施形態に係る音声復号装置の第13の変形例160Mの構成を示す図である。It is a figure which shows the structure of the 13th modification 160M of the audio decoding apparatus which concerns on 17th Embodiment. 第17の実施形態に係る音声復号装置の第13の変形例160Mの動作を示すフローチャートである。It is a flowchart which shows the operation of the 13th modification 160M of the audio decoding apparatus which concerns on 17th Embodiment. 第17の実施形態に係る音声復号装置の第14の変形例160Nの構成を示す図である。It is a figure which shows the structure of the 14th modification 160N of the voice decoding apparatus which concerns on 17th Embodiment. 第17の実施形態に係る音声復号装置の第14の変形例160Nの動作を示すフローチャートである。It is a flowchart which shows the operation of the 14th modification 160N of the voice decoding apparatus which concerns on 17th Embodiment. 第18の実施形態に係る音声復号装置の第1の変形例170Aの構成を示す図である。It is a figure which shows the structure of the 1st modification 170A of the voice decoding apparatus which concerns on 18th Embodiment. 第18の実施形態に係る音声復号装置の第1の変形例170Aの動作を示すフローチャートである。It is a flowchart which shows the operation of the 1st modification 170A of the voice decoding apparatus which concerns on 18th Embodiment. 第18の実施形態に係る音声復号装置の第2の変形例170Bの構成を示す図である。It is a figure which shows the structure of the 2nd modification 170B of the voice decoding apparatus which concerns on 18th Embodiment. 第18の実施形態に係る音声復号装置の第2の変形例170Bの動作を示すフローチャートである。It is a flowchart which shows the operation of the 2nd modification 170B of the voice decoding apparatus which concerns on 18th Embodiment. 第18の実施形態に係る音声復号装置の第3の変形例170Cの構成を示す図である。It is a figure which shows the structure of the 3rd modification 170C of the voice decoding apparatus which concerns on 18th Embodiment. 第18の実施形態に係る音声復号装置の第3の変形例170Cの動作を示すフローチャートである。It is a flowchart which shows the operation of the 3rd modification 170C of the voice decoding apparatus which concerns on 18th Embodiment. 第18の実施形態に係る音声復号装置の第4の変形例170Dの構成を示す図である。It is a figure which shows the structure of the 4th modification 170D of the voice decoding apparatus which concerns on 18th Embodiment. 第18の実施形態に係る音声復号装置の第4の変形例170Dの動作を示すフローチャートである。It is a flowchart which shows the operation of the 4th modification 170D of the voice decoding apparatus which concerns on 18th Embodiment. 第19の実施形態に係る音声復号装置の第1の変形例180Aの構成を示す図である。It is a figure which shows the structure of the 1st modification 180A of the voice decoding apparatus which concerns on 19th Embodiment. 第19の実施形態に係る音声復号装置の第1の変形例180Aの動作を示すフローチャートである。It is a flowchart which shows the operation of the 1st modification 180A of the voice decoding apparatus which concerns on 19th Embodiment. 第19の実施形態に係る音声復号装置の第2の変形例180Bの構成を示す図である。It is a figure which shows the structure of the 2nd modification 180B of the voice decoding apparatus which concerns on 19th Embodiment. 第19の実施形態に係る音声復号装置の第2の変形例180Bの動作を示すフローチャートである。It is a flowchart which shows the operation of the 2nd modification 180B of the voice decoding apparatus which concerns on 19th Embodiment. 第19の実施形態に係る音声復号装置の第3の変形例180Cの構成を示す図である。It is a figure which shows the structure of the 3rd modification 180C of the voice decoding apparatus which concerns on 19th Embodiment. 第19の実施形態に係る音声復号装置の第3の変形例180Cの動作を示すフローチャートである。It is a flowchart which shows the operation of the 3rd modification 180C of the voice decoding apparatus which concerns on 19th Embodiment. 第19の実施形態に係る音声復号装置の第4の変形例180Dの構成を示す図である。It is a figure which shows the structure of the 4th modification 180D of the voice decoding apparatus which concerns on 19th Embodiment. 第19の実施形態に係る音声復号装置の第4の変形例180Dの動作を示すフローチャートである。It is a flowchart which shows the operation of the 4th modification 180D of the voice decoding apparatus which concerns on 19th Embodiment. 第20の実施形態に係る音声復号装置の第1の変形例190Aの構成を示す図である。It is a figure which shows the structure of the 1st modification 190A of the voice decoding apparatus which concerns on 20th Embodiment. 第20の実施形態に係る音声復号装置の第1の変形例190Aの動作を示すフローチャートである。It is a flowchart which shows the operation of the 1st modification 190A of the voice decoding apparatus which concerns on 20th Embodiment. 第20の実施形態に係る音声復号装置の第2の変形例190Bの構成を示す図である。It is a figure which shows the structure of the 2nd modification 190B of the voice decoding apparatus which concerns on 20th Embodiment. 第20の実施形態に係る音声復号装置の第2の変形例190Bの動作を示すフローチャートである。It is a flowchart which shows the operation of the 2nd modification 190B of the voice decoding apparatus which concerns on 20th Embodiment. 第20の実施形態に係る音声復号装置の第3の変形例190Cの構成を示す図である。It is a figure which shows the structure of the 3rd modification 190C of the voice decoding apparatus which concerns on 20th Embodiment. 第20の実施形態に係る音声復号装置の第3の変形例190Cの動作を示すフローチャートである。It is a flowchart which shows the operation of the 3rd modification 190C of the voice decoding apparatus which concerns on 20th Embodiment. 第20の実施形態に係る音声復号装置の第4の変形例190Dの構成を示す図である。It is a figure which shows the structure of the 4th modification 190D of the voice decoding apparatus which concerns on 20th Embodiment. 第20の実施形態に係る音声復号装置の第4の変形例190Dの動作を示すフローチャートである。It is a flowchart which shows the operation of the 4th modification 190D of the voice decoding apparatus which concerns on 20th Embodiment. 第20の実施形態に係る音声復号装置の第5の変形例190Eの構成を示す図である。It is a figure which shows the structure of the 5th modification 190E of the voice decoding apparatus which concerns on 20th Embodiment. 第20の実施形態に係る音声復号装置の第5の変形例190Eの動作を示すフローチャートである。It is a flowchart which shows the operation of the 5th modification 190E of the voice decoding apparatus which concerns on 20th Embodiment. 第20の実施形態に係る音声復号装置の第6の変形例190Fの構成を示す図である。It is a figure which shows the structure of the 6th modification 190F of the voice decoding apparatus which concerns on 20th Embodiment. 第20の実施形態に係る音声復号装置の第6の変形例190Fの動作を示すフローチャートである。It is a flowchart which shows the operation of the 6th modification 190F of the voice decoding apparatus which concerns on 20th Embodiment. 第20の実施形態に係る音声復号装置の第7の変形例190Gの構成を示す図である。It is a figure which shows the structure of the 7th modification 190G of the voice decoding apparatus which concerns on 20th Embodiment. 第20の実施形態に係る音声復号装置の第7の変形例190Gの動作を示すフローチャートである。It is a flowchart which shows the operation of the 7th modification 190G of the voice decoding apparatus which concerns on 20th Embodiment. 第20の実施形態に係る音声復号装置の第8の変形例190Hの構成を示す図である。It is a figure which shows the structure of the 8th modification 190H of the voice decoding apparatus which concerns on 20th Embodiment. 第20の実施形態に係る音声復号装置の第8の変形例190Hの動作を示すフローチャートである。It is a flowchart which shows the operation of the 8th modification 190H of the voice decoding apparatus which concerns on 20th Embodiment. 第20の実施形態に係る音声復号装置の第9の変形例190Iの構成を示す図である。It is a figure which shows the structure of the 9th modification 190I of the voice decoding apparatus which concerns on 20th Embodiment. 第20の実施形態に係る音声復号装置の第9の変形例190Iの動作を示すフローチャートである。It is a flowchart which shows the operation of the 9th modification 190I of the voice decoding apparatus which concerns on 20th Embodiment. 第21の実施形態に係る音声復号装置の第1の変形例300Aの構成を示す図である。It is a figure which shows the structure of the 1st modification 300A of the voice decoding apparatus which concerns on 21st Embodiment. 第21の実施形態に係る音声復号装置の第1の変形例300Aの動作を示すフローチャートである。It is a flowchart which shows the operation of the 1st modification 300A of the voice decoding apparatus which concerns on 21st Embodiment. 第21の実施形態に係る音声復号装置の第2の変形例300Bの構成を示す図である。It is a figure which shows the structure of the 2nd modification 300B of the voice decoding apparatus which concerns on 21st Embodiment. 第21の実施形態に係る音声復号装置の第2の変形例300Bの動作を示すフローチャートである。It is a flowchart which shows the operation of the 2nd modification 300B of the voice decoding apparatus which concerns on 21st Embodiment. 第21の実施形態に係る音声復号装置の第3の変形例300Cの構成を示す図である。It is a figure which shows the structure of the 3rd modification 300C of the voice decoding apparatus which concerns on 21st Embodiment. 第21の実施形態に係る音声復号装置の第3の変形例300Cの動作を示すフローチャートである。It is a flowchart which shows the operation of the 3rd modification 300C of the voice decoding apparatus which concerns on 21st Embodiment. 第21の実施形態に係る音声復号装置の第4の変形例300Dの構成を示す図である。It is a figure which shows the structure of the 4th modification 300D of the voice decoding apparatus which concerns on 21st Embodiment. 第21の実施形態に係る音声復号装置の第4の変形例300Dの動作を示すフローチャートである。It is a flowchart which shows the operation of the 4th modification 300D of the voice decoding apparatus which concerns on 21st Embodiment. 第22の実施形態に係る音声復号装置の第1の変形例310Aの構成を示す図である。It is a figure which shows the structure of the 1st modification 310A of the voice decoding apparatus which concerns on 22nd Embodiment. 第22の実施形態に係る音声復号装置の第1の変形例310Aの動作を示すフローチャートである。It is a flowchart which shows the operation of the 1st modification 310A of the voice decoding apparatus which concerns on 22nd Embodiment. 第22の実施形態に係る音声復号装置の第2の変形例310Bの構成を示す図である。It is a figure which shows the structure of the 2nd modification 310B of the voice decoding apparatus which concerns on 22nd Embodiment. 第22の実施形態に係る音声復号装置の第2の変形例310Bの動作を示すフローチャートである。It is a flowchart which shows the operation of the 2nd modification 310B of the voice decoding apparatus which concerns on 22nd Embodiment. 第22の実施形態に係る音声復号装置の第3の変形例310Cの構成を示す図である。It is a figure which shows the structure of the 3rd modification 310C of the voice decoding apparatus which concerns on 22nd Embodiment. 第22の実施形態に係る音声復号装置の第3の変形例310Cの動作を示すフローチャートである。It is a flowchart which shows the operation of the 3rd modification 310C of the voice decoding apparatus which concerns on 22nd Embodiment. 第22の実施形態に係る音声復号装置の第4の変形例310Dの構成を示す図である。It is a figure which shows the structure of the 4th modification 310D of the voice decoding apparatus which concerns on 22nd Embodiment. 第22の実施形態に係る音声復号装置の第4の変形例310Dの動作を示すフローチャートである。It is a flowchart which shows the operation of the 4th modification 310D of the voice decoding apparatus which concerns on 22nd Embodiment. 第23の実施形態に係る音声復号装置の第2の変形例320Bの構成を示す図である。It is a figure which shows the structure of the 2nd modification 320B of the voice decoding apparatus which concerns on 23rd Embodiment. 第23の実施形態に係る音声復号装置の第2の変形例320Bの動作を示すフローチャートである。It is a flowchart which shows the operation of the 2nd modification 320B of the voice decoding apparatus which concerns on 23rd Embodiment. 第23の実施形態に係る音声復号装置の第3の変形例320Cの構成を示す図である。It is a figure which shows the structure of the 3rd modification 320C of the voice decoding apparatus which concerns on 23rd Embodiment. 第23の実施形態に係る音声復号装置の第3の変形例320Cの動作を示すフローチャートである。It is a flowchart which shows the operation of the 3rd modification 320C of the voice decoding apparatus which concerns on 23rd Embodiment. 第23の実施形態に係る音声復号装置の第4の変形例320Dの構成を示す図である。It is a figure which shows the structure of the 4th modification 320D of the voice decoding apparatus which concerns on 23rd Embodiment. 第23の実施形態に係る音声復号装置の第4の変形例320Dの動作を示すフローチャートである。It is a flowchart which shows the operation of the 4th modification 320D of the voice decoding apparatus which concerns on 23rd Embodiment. 第23の実施形態に係る音声復号装置の第5の変形例320Eの構成を示す図である。It is a figure which shows the structure of the 5th modification 320E of the voice decoding apparatus which concerns on 23rd Embodiment. 第23の実施形態に係る音声復号装置の第5の変形例320Eの動作を示すフローチャートである。It is a flowchart which shows the operation of the 5th modification 320E of the voice decoding apparatus which concerns on 23rd Embodiment. 第23の実施形態に係る音声復号装置の第6の変形例320Fの構成を示す図である。It is a figure which shows the structure of the 6th modification 320F of the voice decoding apparatus which concerns on 23rd Embodiment. 第23の実施形態に係る音声復号装置の第6の変形例320Fの動作を示すフローチャートである。It is a flowchart which shows the operation of the 6th modification 320F of the voice decoding apparatus which concerns on 23rd Embodiment. 第23の実施形態に係る音声復号装置の第7の変形例320Gの構成を示す図である。It is a figure which shows the structure of the 7th modification 320G of the voice decoding apparatus which concerns on 23rd Embodiment. 第23の実施形態に係る音声復号装置の第7の変形例320Gの動作を示すフローチャートである。It is a flowchart which shows the operation of the 7th modification 320G of the voice decoding apparatus which concerns on 23rd Embodiment. 第23の実施形態に係る音声復号装置の第8の変形例320Hの構成を示す図である。It is a figure which shows the structure of the 8th modification 320H of the voice decoding apparatus which concerns on 23rd Embodiment. 第23の実施形態に係る音声復号装置の第8の変形例320Hの動作を示すフローチャートである。It is a flowchart which shows the operation of the 8th modification 320H of the voice decoding apparatus which concerns on 23rd Embodiment. 第23の実施形態に係る音声復号装置の第9の変形例320Iの構成を示す図である。It is a figure which shows the structure of the 9th modification 320I of the voice decoding apparatus which concerns on 23rd Embodiment. 第23の実施形態に係る音声復号装置の第9の変形例320Iの動作を示すフローチャートである。It is a flowchart which shows the operation of the 9th modification 320I of the voice decoding apparatus which concerns on 23rd Embodiment. 第24の実施形態に係る音声復号装置の第1の変形例330Aの構成を示す図である。It is a figure which shows the structure of the 1st modification 330A of the audio decoding apparatus which concerns on 24th Embodiment. 第24の実施形態に係る音声復号装置の第1の変形例330Aの動作を示すフローチャートである。It is a flowchart which shows the operation of the 1st modification 330A of the voice decoding apparatus which concerns on 24th Embodiment. 第24の実施形態に係る音声復号装置の第2の変形例330Bの構成を示す図である。It is a figure which shows the structure of the 2nd modification 330B of the voice decoding apparatus which concerns on 24th Embodiment. 第24の実施形態に係る音声復号装置の第2の変形例330Bの動作を示すフローチャートである。It is a flowchart which shows the operation of the 2nd modification 330B of the voice decoding apparatus which concerns on 24th Embodiment. 第24の実施形態に係る音声復号装置の第3の変形例330Cの構成を示す図である。It is a figure which shows the structure of the 3rd modification 330C of the voice decoding apparatus which concerns on 24th Embodiment. 第24の実施形態に係る音声復号装置の第3の変形例330Cの動作を示すフローチャートである。It is a flowchart which shows the operation of the 3rd modification 330C of the voice decoding apparatus which concerns on 24th Embodiment. 第24の実施形態に係る音声復号装置の第4の変形例330Dの構成を示す図である。It is a figure which shows the structure of the 4th modification 330D of the voice decoding apparatus which concerns on 24th Embodiment. 第24の実施形態に係る音声復号装置の第4の変形例330Dの動作を示すフローチャートである。It is a flowchart which shows the operation of the 4th modification 330D of the voice decoding apparatus which concerns on 24th Embodiment. 第25の実施形態に係る音声復号装置の第1の変形例340Aの構成を示す図である。It is a figure which shows the structure of the 1st modification 340A of the voice decoding apparatus which concerns on 25th Embodiment. 第25の実施形態に係る音声復号装置の第1の変形例340Aの動作を示すフローチャートである。It is a flowchart which shows the operation of the 1st modification 340A of the voice decoding apparatus which concerns on 25th Embodiment. 第25の実施形態に係る音声復号装置の第2の変形例340Bの構成を示す図である。It is a figure which shows the structure of the 2nd modification 340B of the voice decoding apparatus which concerns on 25th Embodiment. 第25の実施形態に係る音声復号装置の第2の変形例340Bの動作を示すフローチャートである。It is a flowchart which shows the operation of the 2nd modification 340B of the voice decoding apparatus which concerns on 25th Embodiment. 第25の実施形態に係る音声復号装置の第3の変形例340Cの構成を示す図である。It is a figure which shows the structure of the 3rd modification 340C of the voice decoding apparatus which concerns on 25th Embodiment. 第25の実施形態に係る音声復号装置の第3の変形例340Cの動作を示すフローチャートである。It is a flowchart which shows the operation of the 3rd modification 340C of the voice decoding apparatus which concerns on 25th Embodiment. 第25の実施形態に係る音声復号装置の第4の変形例340Dの構成を示す図である。It is a figure which shows the structure of the 4th modification 340D of the voice decoding apparatus which concerns on 25th Embodiment. 第25の実施形態に係る音声復号装置の第4の変形例340Dの動作を示すフローチャートである。It is a flowchart which shows the operation of the 4th modification 340D of the voice decoding apparatus which concerns on 25th Embodiment. 第26の実施形態に係る音声復号装置の第2の変形例350Bの構成を示す図である。It is a figure which shows the structure of the 2nd modification 350B of the voice decoding apparatus which concerns on 26th Embodiment. 第26の実施形態に係る音声復号装置の第2の変形例350Bの動作を示すフローチャートである。It is a flowchart which shows the operation of the 2nd modification 350B of the voice decoding apparatus which concerns on 26th Embodiment. 第26の実施形態に係る音声復号装置の第3の変形例350Cの構成を示す図である。It is a figure which shows the structure of the 3rd modification 350C of the voice decoding apparatus which concerns on 26th Embodiment. 第26の実施形態に係る音声復号装置の第3の変形例350Cの動作を示すフローチャートである。It is a flowchart which shows the operation of the 3rd modification 350C of the voice decoding apparatus which concerns on 26th Embodiment. 第26の実施形態に係る音声復号装置の第4の変形例350Dの構成を示す図である。It is a figure which shows the structure of the 4th modification 350D of the voice decoding apparatus which concerns on 26th Embodiment. 第26の実施形態に係る音声復号装置の第4の変形例350Dの動作を示すフローチャートである。It is a flowchart which shows the operation of the 4th modification 350D of the voice decoding apparatus which concerns on 26th Embodiment. 第26の実施形態に係る音声復号装置の第5の変形例350Eの構成を示す図である。It is a figure which shows the structure of the 5th modification 350E of the voice decoding apparatus which concerns on 26th Embodiment. 第26の実施形態に係る音声復号装置の第5の変形例350Eの動作を示すフローチャートである。It is a flowchart which shows the operation of the 5th modification 350E of the voice decoding apparatus which concerns on 26th Embodiment. 第26の実施形態に係る音声復号装置の第6の変形例350Fの構成を示す図である。It is a figure which shows the structure of the 6th modification 350F of the voice decoding apparatus which concerns on 26th Embodiment. 第26の実施形態に係る音声復号装置の第6の変形例350Fの動作を示すフローチャートである。It is a flowchart which shows the operation of the 6th modification 350F of the voice decoding apparatus which concerns on 26th Embodiment. 第26の実施形態に係る音声復号装置の第7の変形例350Gの構成を示す図である。It is a figure which shows the structure of the 7th modification 350G of the voice decoding apparatus which concerns on 26th Embodiment. 第26の実施形態に係る音声復号装置の第7の変形例350Gの動作を示すフローチャートである。It is a flowchart which shows the operation of the 7th modification 350G of the voice decoding apparatus which concerns on 26th Embodiment. 第26の実施形態に係る音声復号装置の第8の変形例350Hの構成を示す図である。It is a figure which shows the structure of the 8th modification 350H of the voice decoding apparatus which concerns on 26th Embodiment. 第26の実施形態に係る音声復号装置の第8の変形例350Hの動作を示すフローチャートである。It is a flowchart which shows the operation of the 8th modification 350H of the voice decoding apparatus which concerns on 26th Embodiment. 第26の実施形態に係る音声復号装置の第9の変形例350Iの構成を示す図である。It is a figure which shows the structure of the 9th modification 350I of the voice decoding apparatus which concerns on 26th Embodiment. 第26の実施形態に係る音声復号装置の第9の変形例350Iの動作を示すフローチャートである。It is a flowchart which shows the operation of the 9th modification 350I of the voice decoding apparatus which concerns on 26th Embodiment. 第27の実施形態に係る音声復号装置360の構成を示す図である。It is a figure which shows the structure of the voice decoding apparatus 360 which concerns on 27th Embodiment. 第27の実施形態に係る音声復号装置360の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice decoding apparatus 360 which concerns on 27th Embodiment. 第27の実施形態に係る音声復号装置の第1の変形例360Aの構成を示す図である。It is a figure which shows the structure of the 1st modification 360A of the voice decoding apparatus which concerns on 27th Embodiment. 第27の実施形態に係る音声復号装置の第1の変形例360Aの動作を示すフローチャートである。It is a flowchart which shows the operation of the 1st modification 360A of the voice decoding apparatus which concerns on 27th Embodiment. 第28の実施形態に係る音声復号装置370の構成を示す図である。It is a figure which shows the structure of the audio decoding apparatus 370 which concerns on 28th Embodiment. 第28の実施形態に係る音声復号装置370の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice decoding apparatus 370 which concerns on 28th Embodiment. 第28の実施形態に係る音声復号装置の第1の変形例370Aの構成を示す図である。It is a figure which shows the structure of the 1st modification 370A of the voice decoding apparatus which concerns on 28th Embodiment. 第28の実施形態に係る音声復号装置の第1の変形例370Aの動作を示すフローチャートである。It is a flowchart which shows the operation of the 1st modification 370A of the voice decoding apparatus which concerns on 28th Embodiment. 第29の実施形態に係る音声復号装置380の構成を示す図である。It is a figure which shows the structure of the audio decoding apparatus 380 which concerns on the 29th Embodiment. 第29の実施形態に係る音声復号装置380の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice decoding apparatus 380 which concerns on the 29th Embodiment. 第29の実施形態に係る音声復号装置の第1の変形例380Aの構成を示す図である。It is a figure which shows the structure of the 1st modification 380A of the voice decoding apparatus which concerns on 29th Embodiment. 第29の実施形態に係る音声復号装置の第1の変形例380Aの動作を示すフローチャートである。It is a flowchart which shows the operation of the 1st modification 380A of the voice decoding apparatus which concerns on 29th Embodiment. 第30の実施形態に係る音声復号装置390の構成を示す図である。It is a figure which shows the structure of the voice decoding apparatus 390 which concerns on the 30th Embodiment. 第30の実施形態に係る音声復号装置390の動作を示すフローチャートである。It is a flowchart which shows the operation of the voice decoding apparatus 390 which concerns on 30th Embodiment.

添付図面を参照しながら本発明の各種の実施形態を説明する。可能な場合には、同一の部分には同一の符号を付して、重複する説明を省略する。 Various embodiments of the present invention will be described with reference to the accompanying drawings. When possible, the same parts are designated by the same reference numerals and duplicate description is omitted.

［第1の実施形態］
図１は、第1の実施形態に係る音声復号装置10の構成を示す図である。音声復号装置10の通信装置は、下記音声符号化装置20から出力される多重化された符号化系列を受信し、更に、復号した音声信号を外部に出力する。音声復号装置10は、図１に示すように、機能的には、符号化系列逆多重化部10a、コア復号部10b、分析フィルタバンク部10c、符号化系列解析部10d、低周波数時間包絡形状決定部10e、低周波数時間包絡修正部10f、高周波数信号生成部10g、復号/逆量子化部10h、周波数包絡調整部10i、及び合成フィルタバンク部10jを備える。各部の機能・動作は、以下、説明する。 [First Embodiment]
FIG. 1 is a diagram showing a configuration of an audio decoding device 10 according to the first embodiment. The communication device of the voice decoding device 10 receives the multiplexed coding sequence output from the following voice coding device 20, and further outputs the decoded voice signal to the outside. As shown in FIG. 1, the voice decoding device 10 functionally has a coded sequence demultiplexing section 10a, a core decoding section 10b, an analysis filter bank section 10c, a coded sequence analysis section 10d, and a low frequency time wrapping shape. It includes a determination unit 10e, a low frequency time wrapping correction unit 10f, a high frequency signal generation unit 10g, a decoding / dequantization unit 10h, a frequency wrapping adjustment unit 10i, and a synthesis filter bank unit 10j. The functions and operations of each part will be described below.

図2は、第1の実施形態に係る音声復号装置10の動作を示すフローチャートである。 FIG. 2 is a flowchart showing the operation of the voice decoding device 10 according to the first embodiment.

符号化系列逆多重化部10aは、符号化系列を、低周波数信号を符号化したコア符号化部分、低周波数信号から高周波数信号を生成するための帯域拡張部分、及び低周波数時間包絡形状決定部10eで必要な情報（低周波時間包絡形状に関する情報）に分割する（ステップS10-1）。 The coded sequence demultiplexing unit 10a determines the coded sequence as a core coding part that encodes a low frequency signal, a band extension part for generating a high frequency signal from a low frequency signal, and a low frequency time wrapping shape. Part 10e divides it into the necessary information (information about the low frequency time entrainment shape) (step S10-1).

符号化系列解析部10dは、符号化系列逆多重化部10aで分割された符号化系列の帯域拡張部分を解析し、高周波数信号生成部10g、及び復号/逆量子化部10hで必要な情報に分割する（ステップS10-2）。 The coded sequence analysis unit 10d analyzes the band expansion part of the coded sequence divided by the coded sequence demultiplexing unit 10a, and the information required by the high frequency signal generation unit 10g and the decoding / dequantization unit 10h. Divide into (step S10-2).

コア復号部10bは、符号化系列逆多重化部10aから符号化系列のコア符号化部分を受け取り復号し、低周波数信号を生成する（ステップS10-3）。 The core decoding unit 10b receives the core coded portion of the coded sequence from the coded sequence demultiplexing unit 10a, decodes it, and generates a low frequency signal (step S10-3).

分析フィルタバンク部10cは、前記低周波数信号を複数のサブバンド信号に分割する（ステップS10-4）。 The analysis filter bank unit 10c divides the low frequency signal into a plurality of subband signals (step S10-4).

低周波数時間包絡形状決定部10eは、符号化系列解析部10dから低周波時間包絡形状に関する情報を受け取り、当該情報に基づき低周波数信号の時間包絡形状を決定する（ステップS10-5）。例えば、低周波数信号の時間包絡形状を平坦と決定するケース、低周波数信号の時間包絡形状を立ち上がりと決定するケース、低周波数信号の時間包絡形状を立ち下がりと決定するケースが挙げられる。 The low frequency time envelope shape determination unit 10e receives information on the low frequency time envelope shape from the coded sequence analysis unit 10d, and determines the time envelope shape of the low frequency signal based on the information (step S10-5). For example, there are cases where the time envelope shape of a low frequency signal is determined to be flat, a case where the time envelope shape of a low frequency signal is determined to be rising, and a case where the time envelope shape of a low frequency signal is determined to be falling.

低周波数時間包絡修正部10fは、低周波数時間包絡形状決定部10eで決定した時間包絡形状に基づいて、分析フィルタバンク部10cから出力される低周波数信号の複数のサブバンド信号の時間包絡の形状を修正する（ステップS10-6）。 The low frequency time envelope correction unit 10f has the shape of the time envelope of a plurality of subband signals of the low frequency signal output from the analysis filter bank unit 10c based on the time envelope shape determined by the low frequency time envelope shape determination unit 10e. (Step S10-6).

例えば、低周波数時間包絡修正部10fは、任意の時間セグメント内の前記低周波数信号の複数のサブバンド信号X_dec,LO(k,i) (0≦k<k_x, t_E(l)≦i<t_E(l+1))に対して、所定の関数F(X_dec,LO(k,i))を用いて以下の式（１）

により得られるX’_dec,LO(k,i)を時間包絡形状が修正された低周波数信号のサブバンド信号として出力する。 For example, the low frequency time envelope correction unit 10f may use a plurality of subband signals X _{dec, LO} (k, i) (0 ≤ k <k _x , t _E (l) ≤) of the low frequency signal in an arbitrary time segment. For i <t _E (l + 1)), the following equation (1) is used using the predetermined function F (X _{dec, LO} (k, i)).

_{X'dec, LO} (k, i) obtained by is output as a subband signal of a low frequency signal with a corrected time envelope shape.

例えば、前記低周波数信号の時間包絡形状が平坦と決定された場合、以下の処理により、低周波数信号の時間包絡形状を修正できる。
例えば、当該サブバンド信号X_dec,LO(k,i)をB_dec,LO(m) (m=0,…,M_LO, M_LO≧1) (B_dec,LO(0)≧0, B_dec,LO(M_LO)<k_x)で境界を表されるM_LO個の周波数帯域に分割し、m番目の周波数帯域に含まれるサブバンド信号X_dec,LO(k,i) (B_LO(m)≦k<B_LO(m+1), t_E(l)≦i<t_E(l+1))に対して、所定の関数F(X_dec,LO(k,i))を、

として、X’_dec,LO(k,i)を時間包絡形状が修正された低周波数信号のサブバンド信号として出力する。
また別の例によれば、所定の関数F(X_dec,LO(k,i))を、サブバンド信号X_dec,LO(k,i)に対して平滑化フィルタ処理を施す

(N_filt≧1)で定義して、X’_dec,LO(k,i)を時間包絡形状が修正された低周波数信号のサブバンド信号として出力する。さらに、前記B_dec,LO(m)を用いて境界が表される各周波数帯域内で、フィルタ処理前後のサブバンド信号のパワーをあわせるように処理できる。
また別の例によれば、サブバンド信号X_dec,LO(k,i)を前記B_dec,LO(m)を用いて境界が表される各周波数帯域内で周波数方向に線形予測して線形予測係数α_p(m) (m=0,…,M_LO-1)を得て、所定の関数F(X_dec,LO(k,i))を、サブバンド信号X_dec,LO(k,i)に対して線形予測逆フィルタ処理を施す

(N_pred≧1)で定義して、X’_dec,LO(k,i)を時間包絡形状が修正された低周波数信号のサブバンド信号として出力する。 For example, when the time envelope shape of the low frequency signal is determined to be flat, the time envelope shape of the low frequency signal can be corrected by the following processing.
For example, the subband signal X _{dec, LO} (k, i) is B _{dec, LO} (m) (m = 0,…, M _LO , M _LO ≧ 1) (B _{dec, LO} (0) ≧ 0, B Subband signal X _{dec, LO} (k, i) (B _LO ) divided into M _LO frequency bands whose boundaries are represented by _{dec, LO} (M _LO ) <k _x ) and included in the mth frequency band. For (m) ≤ k <B _LO (m + 1), t _E (l) ≤ i <t _E (l + 1)), the predetermined function F (X _{dec, LO} (k, i)) ,

As, _{X'dec, LO} (k, i) is output as a subband signal of a low frequency signal with a corrected time envelope shape.
According to another example, the predetermined function F (X _{dec, LO} (k, i)) is subjected to smoothing filtering on the subband signal X _{dec, LO} (k, i).

Defined by (N _filt ≧ 1), _{X'dec, LO} (k, i) is output as a subband signal of a low frequency signal with a corrected time envelope shape. Further, the B _{dec and LO} (m) can be used to match the powers of the subband signals before and after the filtering within each frequency band whose boundary is represented.
According to another example, the subband signals X _{dec, LO} (k, i) are linearly predicted in the frequency direction within each frequency band whose boundary is represented by using the B _{dec, LO} (m). Obtain the prediction coefficient α _p (m) (m = 0,…, M _LO -1) and _apply the predetermined function F (X _{dec, LO} (k, i)) to the subband signal X _{dec, LO} (k, Perform linear prediction inverse filtering on i)

Defined by (N _pred ≥ 1), _{X'dec, LO} (k, i) is output as a subband signal of a low frequency signal with a corrected time envelope shape.

上記の時間包絡形状を平坦に修正する処理の例は、それぞれを組み合わせて実施できる。低周波数時間包絡修正部10fは、低周波数信号の複数のサブバンド信号の時間包絡の形状を平坦に修正する処理を実施し、上記の例に限定されない。 The above-mentioned example of the process of correcting the time envelope shape flatly can be carried out in combination with each other. The low frequency time envelope correction unit 10f performs a process of flattening the shape of the time envelope of a plurality of subband signals of the low frequency signal, and is not limited to the above example.

さらには、例えば、前記低周波数信号の時間包絡形状が立ち上がりと決定された場合、以下の処理により、低周波数信号の時間包絡形状を修正できる。
例えば、所定の関数F(X_dec,LO(k,i))をiに対して単調増加する関数incr(i)を用いて

で定義して、X’_dec,LO(k,i)を時間包絡形状が修正された低周波数信号のサブバンド信号として出力する。さらに、前記B_dec,LO(m)を用いて境界が表される各周波数帯域内で、時間包絡形状の修正前後のサブバンド信号のパワーをあわせるように処理できる。 Further, for example, when the time envelope shape of the low frequency signal is determined to be rising, the time envelope shape of the low frequency signal can be corrected by the following processing.
For example, using the function incr (i) that monotonically increases a given function F (X _{dec, LO} (k, i)) with respect to i.

Defined in, _{X'dec, LO} (k, i) is output as a subband signal of a low frequency signal with a corrected time envelope shape. Further, the B _{dec and LO} (m) can be used to match the powers of the subband signals before and after the correction of the time envelope shape within each frequency band whose boundary is represented.

低周波数時間包絡修正部10fは、低周波数信号の複数のサブバンド信号の時間包絡の形状を立ち上がりに修正する処理を実施し、上記の例に限定されない。 The low frequency time envelope correction unit 10f performs a process of correcting the shape of the time envelope of a plurality of subband signals of the low frequency signal at the rising edge, and is not limited to the above example.

さらには、例えば、前記低周波数信号の時間包絡形状が立ち下がりと決定された場合、以下の処理により、低周波数信号の時間包絡形状を修正できる。
例えば、所定の関数F(X_dec,LO(k,i))を、iに対して単調減少する関数decr(i)を用いて

で定義して、X’_dec,LO(k,i)を時間包絡形状が修正された低周波数信号のサブバンド信号として出力する。さらに、前記B_dec,LO(m)を用いて境界が表される各周波数帯域内で、時間包絡形状の修正前後のサブバンド信号のパワーをあわせるように処理できる。 Further, for example, when the time envelope shape of the low frequency signal is determined to be falling, the time envelope shape of the low frequency signal can be corrected by the following processing.
For example, using a function decr (i) that monotonically decreases a given function F (X _{dec, LO} (k, i)) with respect to i.

低周波数時間包絡修正部10fは、低周波数信号の複数のサブバンド信号の時間包絡の形状を立ち下がりに修正する処理を実施し、上記の例に限定されない。 The low frequency time envelope correction unit 10f performs a process of correcting the shape of the time envelope of a plurality of subband signals of the low frequency signal to a falling edge, and is not limited to the above example.

復号/逆量子化部10hは、符号化系列解析部10dから出力された時間/周波数分解能の情報より、高周波数信号の生成/調整処理におけるスケールファクタバンドのデザイン、時間セグメントの長さを決定し、さらに、高周波数信号生成部10gにて生成される高周波数信号に対する、ゲインの情報および当該高周波数信号に付加するノイズ信号の情報を符号化系列解析部10dより受け取り，復号/逆量子化して高周波数信号に対するゲインおよびノイズ信号の大きさを取得する（ステップS10-7）。なお、上記スケールファクタバンドのデザイン、時間セグメントの長さについてあらかじめ決められている場合は決定する必要は無い。 The decoding / inverse quantization unit 10h determines the scale factor band design and time segment length in the high-frequency signal generation / adjustment processing from the time / frequency resolution information output from the coded sequence analysis unit 10d. Furthermore, for the high frequency signal generated by the high frequency signal generation unit 10g, the gain information and the noise signal information added to the high frequency signal are received from the coding sequence analysis unit 10d and decoded / dequantized. Obtain the gain and magnitude of the noise signal for the high frequency signal (step S10-7). If the design of the scale factor band and the length of the time segment are predetermined, it is not necessary to determine them.

高周波数信号生成部10gは、入力される低周波数信号のサブバンド信号から、符号化系列解析部10dから出力された情報、復号/逆量子化部10hから出力されたスケールファクタバンドのデザイン、時間セグメントの長さのうち少なくとも一つに基づいて、高周波数信号を生成する（ステップS10-8）。本実施形態においては、分析フィルタバンク部10cで分割された低周波数信号のサブバンド信号が入力される。 The high frequency signal generator 10g is the information output from the coded sequence analysis unit 10d from the input low frequency signal subband signal, the scale factor band design output from the decoding / inverse quantization unit 10h, and the time. Generate a high frequency signal based on at least one of the segment lengths (step S10-8). In the present embodiment, a subband signal of a low frequency signal divided by the analysis filter bank unit 10c is input.

周波数包絡調整部10iは、復号/逆量子化部10hで取得したゲインおよびノイズ信号の大きさに基づいて、高周波数信号生成部10gにて生成された高周波数信号に対してゲイン調整およびノイズ信号の付加をして高周波数信号の周波数包絡を調整する（ステップS10-9）。さらに、正弦波信号を付加することもでき、当該正弦波信号の付加は符号化系列の帯域拡張部分に含まれる情報に基づいても良い。 The frequency wrapping adjustment unit 10i adjusts the gain and noise signal for the high frequency signal generated by the high frequency signal generation unit 10g based on the gain and noise signal magnitude acquired by the decoding / inverse quantization unit 10h. Is added to adjust the frequency wrapping of the high frequency signal (step S10-9). Further, a sine wave signal can be added, and the addition of the sine wave signal may be based on the information included in the band extension portion of the coding series.

合成フィルタバンク部10jは、低周波数時間包絡修正部10fから出力された低周波数信号のサブバンド信号と、周波数包絡調整部10iから出力された高周波数信号のサブバンド信号から時間信号を合成し、出力音声信号として出力する（ステップS10-10）。 The synthesis filter bank section 10j synthesizes a time signal from a subband signal of a low frequency signal output from the low frequency time wrapping correction section 10f and a subband signal of a high frequency signal output from the frequency wrapping adjustment section 10i. Output Output as an audio signal (step S10-10).

ステップS10-1〜S10-4、S10-7〜S10-10の処理は、“ISO/IEC 14496-3”に規定される“SBR”および“Low Delay SBR”の各処理にて対応できる。 The processing of steps S10-1 to S10-4 and S10-7 to S10-10 can be handled by each processing of "SBR" and "Low Delay SBR" specified in "ISO / IEC 14496-3".

図3は、第1の実施形態に係る音声符号化装置20の構成を示す図である。音声符号化装置20の通信装置は、符号化の対象となる音声信号を外部から受信し、更に、符号化された符号化系列を外部に出力する。音声符号化装置20は、図3に示すように、機能的には、ダウンサンプリング部20a、コア符号化部20b、分析フィルタバンク部20c及び20c1、制御パラメータ符号化部20d、包絡算出部20e、量子化/符号化部20f、時間包絡情報符号化部20g、符号化系列多重化部20h、サブバンド信号パワー算出部20j、及びコア復号信号生成部20iを備える。各部の機能・動作は、以下、説明する。 FIG. 3 is a diagram showing a configuration of the voice coding device 20 according to the first embodiment. The communication device of the voice coding device 20 receives the voice signal to be coded from the outside, and further outputs the coded coding sequence to the outside. As shown in FIG. 3, the voice coding device 20 functionally includes a downsampling unit 20a, a core coding unit 20b, an analysis filter bank unit 20c and 20c1, a control parameter coding unit 20d, and an entrapment calculation unit 20e. It includes a quantization / coding unit 20f, a time-wrapping information coding unit 20g, a coding sequence multiplexing unit 20h, a subband signal power calculation unit 20j, and a core decoding signal generation unit 20i. The functions and operations of each part will be described below.

図4は、第1の実施形態に係る音声符号化装置20の動作を示すフローチャートである。 FIG. 4 is a flowchart showing the operation of the voice coding device 20 according to the first embodiment.

ダウンサンプリング部20aは、入力音声信号をダウンサンプルし、入力音声信号の低周波数信号に相当するダウンサンプル入力音声信号を得る（ステップS20-1）。 The downsampling unit 20a downsamples the input audio signal and obtains a downsampled input audio signal corresponding to the low frequency signal of the input audio signal (step S20-1).

コア符号化部20bは、ダウンサンプリング部20aで得られたダウンサンプル信号を符号化し、低周波数信号の符号化系列を生成する（ステップS20-2）。 The core coding unit 20b encodes the downsampling signal obtained by the downsampling unit 20a to generate a coded sequence of the low frequency signal (step S20-2).

分析フィルタバンク部20cは、入力音声信号を複数のサブバンド信号に分割する（ステップS20-3）。 The analysis filter bank unit 20c divides the input audio signal into a plurality of subband signals (step S20-3).

制御パラメータ符号化部20dは、音声復号装置10において高周波数信号を生成するために必要な制御パラメータを符号化する（ステップS20-4）。当該パラメータは、例えば時間/周波数分解能の情報を含む。例えば、音声復号装置10の復号/逆量子化部10hでスケールファクタバンドのデザイン、時間セグメントの長さを決定する際に用いる情報を含む。 The control parameter coding unit 20d encodes the control parameters required for generating the high frequency signal in the voice decoding device 10 (step S20-4). The parameters include, for example, time / frequency resolution information. For example, the decoding / dequantization unit 10h of the voice decoding device 10 includes information used when designing a scale factor band and determining the length of a time segment.

包絡算出部20eは、分析フィルタバンク部20cで得られたサブバンド信号から、音声復号装置10の復号/逆量子化部10hで復号/逆量子化される高周波数信号に対するゲインおよびノイズ信号の大きさを算出する（ステップS20-5）。 The encapsulation calculation unit 20e is the magnitude of the gain and noise signal for the high frequency signal decoded / dequantized by the decoding / dequantization unit 10h of the voice decoding device 10 from the subband signal obtained by the analysis filter bank unit 20c. Calculate the value (step S20-5).

量子化/符号化部20fは、包絡算出部20eにて算出された高周波数信号に対するゲインおよびノイズ信号の大きさを量子化および符号化する（ステップS20-6）。 The quantization / coding unit 20f quantizes and encodes the gain and noise signal magnitudes for the high frequency signal calculated by the envelope calculation unit 20e (step S20-6).

コア復号信号生成部20iは、コア符号化部20bで符号化された情報を用いて、コア復号信号を生成する(ステップS20-7)。当該処理は、音声復号装置10のコア復号部10bと同様に実施されてもよい。また、コア符号化部20bにおける符号化される前の量子化された情報を用いて、コア復号信号を生成してもよい。また、一部の情報は音声復号装置10のコア復号部10bと異なってもよく、例えばCELP符号化の場合、復号装置における適応符号帳に保持される信号は、過去に復号された励振信号またはそれに所定の処理を施した信号であるが、当該コア復号信号生成部20iでは、入力音声信号を線形予測した後の残差信号であってもよい。 The core decoding signal generation unit 20i generates a core decoding signal using the information encoded by the core coding unit 20b (step S20-7). The process may be performed in the same manner as the core decoding unit 10b of the audio decoding device 10. Further, the core decoding signal may be generated by using the quantized information before being encoded in the core coding unit 20b. Further, some information may be different from the core decoding unit 10b of the audio decoding device 10. For example, in the case of CELP coding, the signal held in the adaptive codebook in the decoding device is an excitation signal decoded in the past or an excitation signal. Although it is a signal that has undergone a predetermined process, the core decoding signal generation unit 20i may be a residual signal after linearly predicting an input audio signal.

分析フィルタバンク部20c1は、コア復号信号生成部20iで生成されたコア復号信号を複数のサブバンド信号に分割する（ステップS20-8）。当該処理において、コア復号信号からサブバンド信号に分割する際の分解能は、分析フィルタバンク部20cと同じであってもよい。 The analysis filter bank unit 20c1 divides the core decoding signal generated by the core decoding signal generation unit 20i into a plurality of subband signals (step S20-8). In this process, the resolution at the time of dividing the core decoded signal into the subband signal may be the same as that of the analysis filter bank unit 20c.

サブバンド信号パワー算出部20jは、分析フィルタバンク部20c1で得られたコア復号信号のサブバンド信号のパワーを算出する（ステップS20-9）。当該処理は、包絡算出部20eにおける低周波数信号のサブバンド信号のパワーの算出と同様に実施される。 The subband signal power calculation unit 20j calculates the power of the subband signal of the core decoded signal obtained by the analysis filter bank unit 20c1 (step S20-9). This process is performed in the same manner as the calculation of the power of the subband signal of the low frequency signal in the envelope calculation unit 20e.

時間包絡情報符号化部20gは、包絡算出部20eにて算出した低周波数信号のサブバンド信号のパワーを用いて低周波数信号の時間包絡を算出し、同様にコア復号信号のサブバンド信号のパワーを用いてコア復号信号の時間包絡を算出し、当該低周波数信号及びコア復号信号の時間包絡より時間包絡情報を算出し符号化する（ステップS20-10）。当該処理において、低周波数信号のサブバンド信号のパワーが算出されていない場合は、時間包絡情報符号化部20gにて低周波数信号のサブバンド信号のパワーを算出してもよく、低周波数信号のサブバンド信号のパワーがどこで算出されるかは限定されない。 The time wrapping information coding unit 20g calculates the time wrapping of the low frequency signal using the power of the subband signal of the low frequency signal calculated by the wrapping calculation unit 20e, and similarly, the power of the subband signal of the core decoding signal. Is used to calculate the time wrapping of the core decoding signal, and the time wrapping information is calculated and encoded from the time wrapping of the low frequency signal and the core decoding signal (step S20-10). If the power of the subband signal of the low frequency signal is not calculated in the process, the power of the subband signal of the low frequency signal may be calculated by the time wrapping information coding unit 20g, and the power of the subband signal of the low frequency signal Where the power of the subband signal is calculated is not limited.

例えば、任意の時間セグメントt_E(l)≦i<t_E(l+1)内でB_LO(m) (m=0,…,M_LO, M_LO≧1) (B_LO(0)≧0, B_LO(M_LO)<k_x)で境界を表されるM_LO個の周波数帯域に分割し、m番目の周波数帯域に含まれる低周波数信号のサブバンド信号X_LO(k,i) (B_LO(m)≦k<B_LO(m+1), t_E(l)≦i<t_E(l+1))の時間包絡E_LO(k,i)は、前記時間セグメント及び周波数帯域内で正規化した当該低周波数信号のサブバンド信号X_LO(k,i)のパワーとして算出できる。

同様に、コア復号信号の時間包絡E_dec,LO(k,i)を前記時間セグメント及び周波数帯域内で正規化した当該コア復号信号のサブバンド信号X_dec,LO(k,i)のパワーとして算出できる。

低周波数信号及びコア復号信号のサブバンド信号の時間包絡は、低周波数信号及びコア復号信号のサブバンド信号の大きさの時間方向の変動がわかるパラメータであれば良く、前記の例に限定されない。 For example, within any time segment t _E (l) ≤ i <t _E (l + 1), B _LO (m) (m = 0,…, M _LO , M _LO ≥ 1) (B _LO (0) ≥ It is divided into M _LO frequency bands whose boundaries are represented by 0, B _LO (M _LO ) <k _x ), and the sub-band signal X _LO (k, i) of the low frequency signal included in the mth frequency band. The time wrapping E _LO (k, i) of (B _LO (m) ≤ k <B _LO (m + 1), t _E (l) ≤ i <t _E (l + 1)) is the time segment and frequency. It can be calculated as the power of the subband signal X _LO (k, i) of the low frequency signal normalized in the band.

Similarly, as the power of the subband signal X _{dec, LO} (k, i) of the core decoded signal obtained by normalizing the time envelope E _{dec, LO} (k, i) of the core decoded signal within the time segment and frequency band. Can be calculated.

The time envelope of the sub-band signal of the low-frequency signal and the core-decoded signal may be a parameter that shows the variation in the magnitude of the sub-band signal of the low-frequency signal and the core decoded signal in the time direction, and is not limited to the above example.

例えば、時間包絡情報符号化部20gは時間包絡情報として平坦の程度を表す情報を算出する。例えば、低周波数信号及びコア復号信号のサブバンド信号の時間包絡の分散またはそれに準ずるパラメータを算出する。さらに別の例では、低周波数信号及びコア復号信号のサブバンド信号の時間包絡の相加平均と相乗平均の比またはそれに準ずるパラメータを算出する。この場合、時間包絡情報符号化部20gは、時間包絡情報として当該低周波数信号のサブバンド信号の時間包絡の平坦さを表す情報を算出すればよく、前記の例に限定されない。そして、前記パラメータを符号化する。例えば、低周波数信号とコア復号信号の当該パラメータの差分値またはその絶対値を符号化する。さらに、例えば、低周波数信号の当該パラメータの値または絶対値を符号化する。例えば、時間包絡の平坦さを平坦か否かで表現すれば1ビットで符号化でき、例えば、前記任意の時間セグメント内において前記M_LO個の周波数帯域毎に当該情報をM_LOビットで符号化できる。時間包絡情報の符号化方法は前記の例に限定されない。 For example, the time-envelope information coding unit 20g calculates information indicating the degree of flatness as time-envelope information. For example, the variance of the time envelope of the subband signal of the low frequency signal and the core decoding signal or a parameter equivalent thereto is calculated. In yet another example, the ratio of the arithmetic mean to the geometric mean of the time-envelope of the subband signals of the low frequency signal and the core decoded signal or a parameter equivalent thereto is calculated. In this case, the time-envelope information coding unit 20g may calculate information indicating the flatness of the time-envelope of the subband signal of the low-frequency signal as the time-envelope information, and is not limited to the above example. Then, the parameter is encoded. For example, the difference value or the absolute value of the parameter of the low frequency signal and the core decoding signal is encoded. Further, for example, the value or absolute value of the parameter of the low frequency signal is encoded. For example, if the flatness of the time envelope is expressed by whether it is flat or not, it can be encoded with 1 bit. For example, the information is encoded with M _LO bits for each of the M _LO frequency bands in the arbitrary time segment. it can. The method for encoding the time envelope information is not limited to the above example.

さらに例えば、時間包絡情報符号化部20gは時間包絡情報として立ち上がりの程度を表す情報を算出する。例えば、任意の時間セグメントt_E(l)≦i<t_E(l+1)内において、低周波数信号のサブバンド信号の時間包絡の時間方向の差分値の最大値を算出する。

さらには、式（９）において、時間包絡に代えて当該時間包絡を時間方向に平滑化したパラメータの時間方向の差分値の最大値を算出できる。 Further, for example, the time-envelope information coding unit 20g calculates information indicating the degree of rise as time-envelope information. For example, within an arbitrary time segment t _E (l) ≤ i <t _E (l + 1), the maximum value of the difference value in the time direction of the time envelope of the subband signal of the low frequency signal is calculated.

Further, in the equation (9), instead of the time envelope, the maximum value of the difference value in the time direction of the parameter obtained by smoothing the time envelope in the time direction can be calculated.

この場合、時間包絡情報符号化部20gは、時間包絡情報として当該低周波数信号のサブバンド信号の時間包絡の立ち上がりの程度を表す情報を算出すればよく、前記の例に限定されない。そして、前記パラメータを符号化する。例えば、低周波数信号とコア復号信号の当該パラメータの差分値またはその絶対値を符号化する。例えば、時間包絡の立ち上がりの程度を立ち上がりか否かで表現すれば1ビットで符号化でき、例えば、前記任意の時間セグメント内において前記M_LO個の周波数帯域毎に当該情報をM_LOビットで符号化できる。時間包絡情報の符号化方法は前記の例に限定されない。 In this case, the time-envelope information coding unit 20g may calculate as the time-envelope information information indicating the degree of rise of the time-envelope of the subband signal of the low-frequency signal, and is not limited to the above example. Then, the parameter is encoded. For example, the difference value or the absolute value of the parameter of the low frequency signal and the core decoding signal is encoded. For example, if the degree of rise of the time envelope is expressed by whether it rises or not, it can be encoded with 1 bit. For example, the information is coded with M _LO bits for each of the M _LO frequency bands in the arbitrary time segment. Can be converted. The method for encoding the time envelope information is not limited to the above example.

さらに例えば、時間包絡情報符号化部20gは時間包絡情報として立ち下がりの程度を表す情報を算出する。例えば、任意の時間セグメントt_E(l)≦i<t_E(l+1)内において、低周波数信号のサブバンド信号の時間包絡の時間方向の差分値の最小値を算出する。

さらには、式（１０）において、時間包絡に代えて当該時間包絡を時間方向に平滑化したパラメータの時間方向の差分値の最小値を算出できる。 Further, for example, the time-envelope information coding unit 20g calculates information indicating the degree of fall as time-envelope information. For example, within an arbitrary time segment t _E (l) ≤ i <t _E (l + 1), the minimum value of the difference value in the time direction of the time envelope of the subband signal of the low frequency signal is calculated.

Further, in the equation (10), instead of the time envelope, the minimum value of the difference value in the time direction of the parameter obtained by smoothing the time envelope in the time direction can be calculated.

この場合、時間包絡情報符号化部20gは、時間包絡情報として当該低周波数信号のサブバンド信号の時間包絡の立ち下がりの程度を表す情報を算出すればよく、前記の例に限定されない。そして、前記パラメータを符号化する。例えば、低周波数信号とコア復号信号の当該パラメータの差分値またはその絶対値を符号化する。例えば、時間包絡の立ち下がりの程度を立ち下がりか否かで表現すれば1ビットで符号化でき、例えば、前記任意の時間セグメント内において前記M_LO個の周波数帯域毎に当該情報をM_LOビットで符号化できる。時間包絡情報の符号化方法は前記の例に限定されない。 In this case, the time-envelope information coding unit 20g may calculate as the time-envelope information information indicating the degree of the fall of the time-envelope of the subband signal of the low-frequency signal, and is not limited to the above example. Then, the parameter is encoded. For example, the difference value or the absolute value of the parameter of the low frequency signal and the core decoding signal is encoded. For example, if the degree of the fall of the time envelope is expressed by whether or not it falls, it can be encoded with 1 bit. For example, the information can be encoded in the M _LO bits for each of the M _LO frequency bands in the arbitrary time segment. Can be encoded with. The method for encoding the time envelope information is not limited to the above example.

前記時間包絡情報として平坦の程度、立ち上がりの程度、及び立下りの程度を表す情報を算出する例において、低周波数信号及びコア復号信号の時間包絡のうち一方のみを用いる場合においては、他方の時間包絡の算出のみに係る各部及び各処理を省略することができる。 In the example of calculating the information indicating the degree of flatness, the degree of rising edge, and the degree of falling edge as the time envelope information, when only one of the time envelopes of the low frequency signal and the core decoded signal is used, the other time is used. Each part and each process related only to the calculation of the envelope can be omitted.

符号化系列多重化部20hは、入力される一つ以上の符号化系列または符号化された情報または符号化されたパラメータを多重化して、符号化系列として出力する（ステップS20-11）。ここでは、コア符号化部20bより低周波数信号の符号化系列を受け取り、制御パラメータ符号化部20dより符号化された制御パラメータを受け取り、量子化/符号化部20fより符号化された高周波数信号に対するゲインおよびノイズ信号の大きさを受け取り、時間包絡情報符号化部20gより符号化された時間包絡情報を受け取り、これらを多重化して符号化系列として出力する。 The coded sequence multiplexing unit 20h multiplexes one or more input coded sequences or encoded information or encoded parameters and outputs them as a coded sequence (step S20-11). Here, the low frequency signal coding sequence is received from the core coding unit 20b, the control parameters encoded by the control parameter coding unit 20d are received, and the high frequency signal encoded by the quantization / coding unit 20f. Receives the magnitude of the gain and noise signal with respect to, receives the time-wrapping information encoded from the time-wrapping information coding unit 20g, multiplexes these, and outputs them as a coded sequence.

ステップS20-1〜S20-6およびS20-80の処理は、“ISO/IEC 14496-3”に規定される“SBR”および“Low Delay SBR”の符号化器の各処理にて対応できる。 The processing of steps S20-1 to S20-6 and S20-80 can be handled by each processing of the "SBR" and "Low Delay SBR" encoders specified in "ISO / IEC 14496-3".

［第1の実施形態の音声復号装置の第1の変形例］
図5は、第1の実施形態に係る音声復号装置の第1の変形例10Aの構成を示す図である。なお、これ以降は、該当の変形例及び実施形態における特徴的な機能・動作について説明し、重複した説明は可能な範囲で省略する。 [First modification of the audio decoding device of the first embodiment]
FIG. 5 is a diagram showing a configuration of a first modification 10A of the audio decoding device according to the first embodiment. In the following, the characteristic functions / operations in the corresponding modification and the embodiment will be described, and duplicate explanations will be omitted to the extent possible.

符号化系列逆多重化部10aAは、符号化系列を、低周波数信号を符号化したコア符号化部分、低周波数信号から高周波数信号を生成するための帯域拡張部分に分割する（ステップS10-1a）。 The coded sequence demultiplexing section 10aA divides the coded sequence into a core coding part that encodes a low frequency signal and a band extension part for generating a high frequency signal from the low frequency signal (step S10-1a). ).

図6は、第1の実施形態に係る音声復号装置の第1の変形例10Aの動作を示すフローチャートである。 FIG. 6 is a flowchart showing the operation of the first modification 10A of the audio decoding device according to the first embodiment.

低周波数時間包絡形状決定部10eAは、コア復号部10bから低周波数信号を受け取り、低周波数信号の時間包絡形状を決定する（ステップS10-5a）。 The low frequency time envelope shape determining unit 10eA receives the low frequency signal from the core decoding unit 10b and determines the time envelope shape of the low frequency signal (step S10-5a).

例えば、低周波数信号の時間包絡形状を平坦と決定する。例えば、低周波数信号x_dec(t)のパワーまたはそれに準ずるパラメータを算出し、当該パラメータの分散またはそれに準ずるパラメータを算出する。算出したパラメータと所定の閾値とを比較して時間包絡形状が平坦か否かまたは平坦さの程度を決定する。さらに別の例では、低周波数信号x_dec(t)のパワーまたはそれに準ずるパラメータの相加平均と相乗平均の比またはそれに準ずるパラメータを算出し、所定の閾値とを比較して時間包絡形状が平坦か否かまたは平坦さの程度を決定する。低周波数信号の時間包絡形状を平坦と決定する方法は上記の例に限定されない。 For example, the time envelope shape of a low frequency signal is determined to be flat. For example, the power of the low frequency signal x _dec (t) or a parameter equivalent thereto is calculated, and the variance of the parameter or a parameter equivalent thereto is calculated. The calculated parameters are compared with a predetermined threshold value to determine whether or not the time envelope shape is flat or the degree of flatness. In yet another example, the ratio of the arithmetic mean to the geometric mean of the power of the low frequency signal x _dec (t) or its equivalent parameters or the equivalent parameter is calculated and compared with a predetermined threshold to flatten the time envelope shape. Whether or not or the degree of flatness is determined. The method for determining the time envelope shape of a low frequency signal as flat is not limited to the above example.

さらに例えば、低周波数信号の時間包絡形状を立ち上がりと決定する。例えば、低周波数信号x_dec(t)のパワーまたはそれに準ずるパラメータを算出し、当該パラメータの時間方向の差分値を算出し、当該差分値の任意の時間セグメント内の最大値を算出する。当該最大値と所定の閾値とを比較して、時間包絡形状が立ち上がりか否かまたは立ち上がりの程度を決定する。低周波数信号の時間包絡形状を立ち上がりと決定する方法は上記の例に限定されない。 Further, for example, the time envelope shape of the low frequency signal is determined to be the rising edge. For example, the power of the low frequency signal x _dec (t) or a parameter equivalent thereto is calculated, the difference value of the parameter in the time direction is calculated, and the maximum value of the difference value in an arbitrary time segment is calculated. The maximum value is compared with a predetermined threshold value to determine whether or not the time envelope shape rises or the degree of rise. The method for determining the time envelope shape of a low frequency signal as a rising edge is not limited to the above example.

さらに例えば、低周波数信号の時間包絡形状を立ち下がりと決定する。例えば、低周波数信号x_dec(t)のパワーまたはそれに準ずるパラメータを算出し、当該パラメータの時間方向の差分値を算出し、当該差分値の任意の時間セグメント内の最小値を算出する。当該最小値と所定の閾値とを比較して、時間包絡形状が立ち下がりか否かまたは立ち下がりの程度を決定する。低周波数信号の時間包絡形状を立ち下がりと決定する方法は上記の例に限定されない。 Further, for example, the time envelope shape of the low frequency signal is determined to be falling. For example, the power of the low frequency signal x _dec (t) or a parameter equivalent thereto is calculated, the difference value of the parameter in the time direction is calculated, and the minimum value of the difference value in an arbitrary time segment is calculated. The minimum value is compared with a predetermined threshold value to determine whether or not the time envelope shape falls or the degree of the fall. The method for determining the time envelope shape of a low frequency signal as a falling edge is not limited to the above example.

［第1の実施形態の音声復号装置の第2の変形例］
図7は、第1の実施形態に係る音声復号装置の第2の変形例10Bの構成を示す図である。 [Second variant of the audio decoding device of the first embodiment]
FIG. 7 is a diagram showing a configuration of a second modification 10B of the audio decoding device according to the first embodiment.

第1の実施形態に係る音声復号装置の第1の変形例との相違点は、低周波数時間包絡形状決定部10eBが、分析フィルタバンク部10cから低周波数信号の複数のサブバンド信号を受け取り、低周波数信号の時間包絡形状を決定する点である（ステップS10-5a相当処理）。 The difference from the first modification of the voice decoding apparatus according to the first embodiment is that the low frequency time entrainment shape determining unit 10eB receives a plurality of subband signals of low frequency signals from the analysis filter bank unit 10c. This is the point of determining the time entrainment shape of the low frequency signal (process equivalent to step S10-5a).

例えば、低周波数信号の時間包絡形状を平坦と決定する。例えば、任意の時間セグメントt_E(l)≦i<t_E(l+1)内でB_LO(m) (m=0,…,M_LO, M_LO≧1) (B_LO(0)≧0, B_LO(M_LO)<k_x)で境界を表されるM_LO個の周波数帯域に分割し、m番目の周波数帯域に含まれる低周波数信号のサブバンド信号X_dec,LO(k,i) (B_LO(m)≦k<B_LO(m+1), t_E(l)≦i<t_E(l+1))の時間包絡E_dec,LO(k,i)またはそれに準ずるパラメータを求め、所定の閾値と比較して時間包絡形状が平坦か否かまたは平坦さの程度を決定する。時間包絡E_dec,LO(k,i)は、例えば式（8）により算出できるが、これに限定されない。さらに別の例では、低周波数信号のサブバンド信号X_dec,LO(k,i) (B_LO(m)≦k<B_LO(m+1), t_E(l)≦i<t_E(l+1))の時間包絡E_dec,LO(k,i)またはそれに準ずるパラメータの相加平均と相乗平均の比またはそれに準ずるパラメータを算出し、所定の閾値とを比較して時間包絡形状が平坦か否かまたは平坦さの程度を決定する。時間包絡E_dec,LO(k,i)は、例えば式（8）により算出できるが、これに限定されない。低周波数信号の時間包絡形状を平坦と決定する方法は上記の例に限定されない。 For example, the time envelope shape of a low frequency signal is determined to be flat. For example, within any time segment t _E (l) ≤ i <t _E (l + 1), B _LO (m) (m = 0,…, M _LO , M _LO ≥ 1) (B _LO (0) ≥ It is divided into M _LO frequency bands whose boundaries are represented by 0, B _LO (M _LO ) <k _x ), and the sub-band signal of the low frequency signal included in the mth frequency band X _{dec, LO} (k, i) Time envelope of (B _LO (m) ≤ k <B _LO (m + 1), t _E (l) ≤ i <t _E (l + 1)) E _{dec, LO} (k, i) or equivalent The parameters are determined and compared with a predetermined threshold to determine whether the time envelope shape is flat or not or the degree of flatness. The time envelope E _{dec, LO} (k, i) can be calculated by, for example, Eq. (8), but is not limited thereto. In yet another example, the low frequency signal subband signal X _{dec, LO} (k, i) (B _LO (m) ≤ k <B _LO (m + 1), t _E (l) ≤ i <t _E ( l + 1)) Time envelope E _{dec, LO} (k, i) or the ratio of the arithmetic mean and geometric mean of the parameters equivalent to it, or the parameter equivalent to it is calculated, and the time envelope shape is compared with the predetermined threshold. Determines whether it is flat or not or the degree of flatness. The time envelope E _{dec, LO} (k, i) can be calculated by, for example, Eq. (8), but is not limited thereto. The method for determining the time envelope shape of a low frequency signal as flat is not limited to the above example.

さらに例えば、低周波数信号の時間包絡形状を立ち上がりと決定する。例えば、任意の時間セグメントt_E(l)≦i<t_E(l+1)内において、低周波数信号のサブバンド信号X_dec,LO(k,i) (B_LO(m)≦k<B_LO(m+1), t_E(l)≦i<t_E(l+1))の時間包絡E_dec,LO(k,i)の差分値の最大値を算出する。例えば式（９）により算出できる。当該差分値の最大値を所定の閾値と比較して時間包絡形状が立ち上がりか否かまたは立ち上がりの程度を決定する。さらには、時間包絡に代えて当該時間包絡を時間方向に平滑化したパラメータを用いることができる。低周波数信号の時間包絡形状を立ち上がりと決定する方法は上記の例に限定されない。 Further, for example, the time envelope shape of the low frequency signal is determined to be the rising edge. For example, within an arbitrary time segment t _E (l) ≤ i <t _E (l + 1), the subband signal X _{dec, LO} (k, i) of the low frequency signal (B _LO (m) ≤ k <B Calculate the maximum difference value of the time envelope E _{dec, LO} (k, i) of _LO (m + 1), t _E (l) ≤ i <t _E (l + 1)). For example, it can be calculated by the formula (9). The maximum value of the difference value is compared with a predetermined threshold value to determine whether or not the time envelope shape rises or the degree of rise. Further, instead of the time envelope, a parameter obtained by smoothing the time envelope in the time direction can be used. The method for determining the time envelope shape of a low frequency signal as a rising edge is not limited to the above example.

さらに例えば、低周波数信号の時間包絡形状を立ち下がりと決定する。低周波数信号のサブバンド信号X_dec,LO(k,i) (B_LO(m)≦k<B_LO(m+1), t_E(l)≦i<t_E(l+1))の時間包絡E_dec,LO(k,i)の差分値の最小値を算出する。例えば式（１０）により算出できる。当該差分値の最小値を所定の閾値と比較して時間包絡形状が立ち下がりか否かまたは立ち下がりの程度を決定する。さらには、時間包絡に代えて当該時間包絡を時間方向に平滑化したパラメータを用いることができる。低周波数信号の時間包絡形状を立ち下がりと決定する方法は上記の例に限定されない。 Further, for example, the time envelope shape of the low frequency signal is determined to be falling. Low frequency signal subband signal X _{dec, LO} (k, i) (B _LO (m) ≤ k <B _LO (m + 1), t _E (l) ≤ i <t _E (l + 1)) Calculate the minimum value of the difference value of the time envelope E _{dec, LO} (k, i). For example, it can be calculated by the formula (10). The minimum value of the difference value is compared with a predetermined threshold value to determine whether or not the time envelope shape falls or the degree of the fall. Further, instead of the time envelope, a parameter obtained by smoothing the time envelope in the time direction can be used. The method for determining the time envelope shape of a low frequency signal as a falling edge is not limited to the above example.

［第1の実施形態の音声復号装置の第3の変形例］
図8は、第1の実施形態に係る音声復号装置の第3の変形例10Cの構成を示す図である。 [Third variant of the audio decoding device of the first embodiment]
FIG. 8 is a diagram showing a configuration of a third modification 10C of the audio decoding device according to the first embodiment.

低周波数時間包絡形状決定部10eCは、符号化系列解析部10dからの低周波時間包絡形状に関する情報、コア復号部10bからの低周波数信号、分析フィルタバンク部10cからの低周波数信号の複数のサブバンド信号のうち少なくとも一つを受け取り、低周波数信号の時間包絡形状を決定する（図2のステップS10-5に相当）。 The low frequency time entrainment shape determination unit 10eC is a plurality of subs of information on the low frequency time entrainment shape from the coding sequence analysis unit 10d, the low frequency signal from the core decoding unit 10b, and the low frequency signal from the analysis filter bank unit 10c. Receives at least one of the band signals and determines the time entrapment shape of the low frequency signal (corresponding to step S10-5 in Figure 2).

例えば、低周波数信号の時間包絡形状を平坦と決定する。この場合、上記第1の実施形態の音声復号装置、当該復号装置の第1及び第2の変形例にて記載の低周波数信号の時間包絡形状を平坦と決定する方法を少なくとも一つ以上組み合わせて時間包絡形状を平坦と決定する。低周波数信号の時間包絡形状を平坦と決定する方法は上記に限定されない。 For example, the time envelope shape of a low frequency signal is determined to be flat. In this case, at least one combination of the voice decoding device of the first embodiment and the method of determining the time envelope shape of the low frequency signal as flat according to the first and second modifications of the decoding device. Determine the time envelope shape to be flat. The method for determining the time envelope shape of a low frequency signal as flat is not limited to the above.

例えば、低周波数信号の時間包絡形状を立ち上がりと決定する。この場合、上記第1の実施形態の音声復号装置、当該復号装置の第1及び第2の変形例にて記載の低周波数信号の時間包絡形状を立ち上がりと決定する方法を少なくとも一つ以上組み合わせて時間包絡形状を立ち上がりと決定する。低周波数信号の時間包絡形状を立ち上がりと決定する方法は上記に限定されない。 For example, the time envelope shape of a low frequency signal is determined to be the rising edge. In this case, at least one combination of the voice decoding device of the first embodiment and the method of determining the time envelope shape of the low frequency signal as the rising edge described in the first and second modifications of the decoding device is combined. The time envelope shape is determined to be the rising edge. The method for determining the time envelope shape of the low frequency signal as the rising edge is not limited to the above.

例えば、低周波数信号の時間包絡形状を立ち下がりと決定する。この場合、上記第1の実施形態の音声復号装置、当該復号装置の第1及び第2の変形例にて記載の低周波数信号の時間包絡形状を立ち下がりと決定する方法を少なくとも一つ以上組み合わせて時間包絡形状を立ち下がりと決定する。低周波数信号の時間包絡形状を立ち下がりと決定する方法は上記に限定されない。 For example, the time envelope shape of a low frequency signal is determined to be falling. In this case, at least one combination of the voice decoding device of the first embodiment and the method of determining the time envelope shape of the low frequency signal as the falling edge described in the first and second modifications of the decoding device. The time-envelope shape is determined to be falling. The method for determining the time-envelope shape of a low-frequency signal as a falling edge is not limited to the above.

［第1の実施形態の音声符号化装置の第1の変形例］
図9は、第1の実施形態に係る音声符号化装置の第1の変形例20Aの構成を示す図である。 [First modification of the voice coding device of the first embodiment]
FIG. 9 is a diagram showing a configuration of a first modification 20A of the voice coding device according to the first embodiment.

図10は、第1の実施形態に係る音声符号化装置の第1の変形例20Aの動作を示すフローチャートである。 FIG. 10 is a flowchart showing the operation of the first modification 20A of the voice coding device according to the first embodiment.

時間包絡情報符号化部20gAは、包絡算出部20eにて算出した低周波数信号のサブバンド信号のパワーを用いて低周波数信号の時間包絡を算出し、当該時間包絡より時間包絡情報を符号化する（ステップS20-10a）。当該処理において、低周波数信号のサブバンド信号のパワーが算出されていない場合は、時間包絡情報符号化部20gAにて低周波数信号のサブバンド信号のパワーを算出してもよく、低周波数信号のサブバンド信号のパワーがどこで算出されるかは限定されない。 The time envelope information coding unit 20gA calculates the time envelope of the low frequency signal using the power of the subband signal of the low frequency signal calculated by the envelope calculation unit 20e, and encodes the time envelope information from the time envelope. (Step S20-10a). When the power of the subband signal of the low frequency signal is not calculated in the process, the power of the subband signal of the low frequency signal may be calculated by the time wrapping information coding unit 20 gA, and the power of the subband signal of the low frequency signal may be calculated. Where the power of the subband signal is calculated is not limited.

例えば、時間包絡情報として、時間包絡形状の平坦さの程度を表す情報を算出する。例えば、任意の時間セグメントt_E(l)≦i<t_E(l+1)内でB_LO(m) (m=0,…,M_LO, M_LO≧1) (B_LO(0)≧0, B_LO(M_LO)<k_x)で境界を表されるM_LO個の周波数帯域に分割し、m番目の周波数帯域に含まれる低周波数信号のサブバンド信号X_LO(k,i) (B_LO(m)≦k<B_LO(m+1), t_E(l)≦i<t_E(l+1))の時間包絡E_LO(k,i)を式（７）により算出する。また時間包絡E_LO(k,i)の算出方法は式（７）に限定されない。時間包絡E_LO(k,i)の分散またはそれに準ずるパラメータを算出し、当該パラメータを符号化する。さらに別の例では、時間包絡E_LO(k,i)の相加平均と相乗平均の比またはそれに準ずるパラメータを算出し、当該パラメータを符号化する。低周波数信号の時間包絡形状の平坦さの程度を表す情報の算出方法は上記の例に限定されない。 For example, as the time envelope information, information indicating the degree of flatness of the time envelope shape is calculated. For example, within any time segment t _E (l) ≤ i <t _E (l + 1), B _LO (m) (m = 0,…, M _LO , M _LO ≥ 1) (B _LO (0) ≥ It is divided into M _LO frequency bands whose boundaries are represented by 0, B _LO (M _LO ) <k _x ), and the subband signal X _LO (k, i) of the low frequency signal included in the mth frequency band. Calculate the time-wrapping E _LO (k, i) of (B _LO (m) ≤ k <B _LO (m + 1), t _E (l) ≤ i <t _E (l + 1)) by Eq. (7). To do. Moreover, the calculation method of the time envelope E _LO (k, i) is not limited to the equation (7). Calculate the variance of the time envelope E _LO (k, i) or a parameter equivalent thereto, and encode the parameter. In yet another example, the ratio of the arithmetic mean to the geometric mean of the time-envelope _ELO (k, i) or a parameter equivalent thereto is calculated and the parameter is encoded. The method of calculating the information indicating the degree of flatness of the time envelope shape of the low frequency signal is not limited to the above example.

さらに、例えば、時間包絡情報として、時間包絡形状の立ち上がりの程度を表す情報を算出する。例えば、時間包絡E_LO(k,i)のの時間方向の差分値を算出し、当該差分値の任意の時間セグメント内の最大値を算出し符号化する。低周波数信号の時間包絡形状を立ち上がりの程度を表す情報の算出方法は上記の例に限定されない。 Further, for example, as the time envelope information, information indicating the degree of rise of the time envelope shape is calculated. For example, the difference value in the time direction of the time envelope E _LO (k, i) is calculated, and the maximum value of the difference value in an arbitrary time segment is calculated and encoded. The method of calculating the information indicating the degree of rise of the time envelope shape of the low frequency signal is not limited to the above example.

さらに、例えば、時間包絡情報として、時間包絡形状の立ち下がりの程度を表す情報を算出する。例えば、時間包絡E_LO(k,i)のの時間方向の差分値を算出し、当該差分値の任意の時間セグメント内の最小値を算出し符号化する。低周波数信号の時間包絡形状を立ち下がりの程度を表す情報の算出方法は上記の例に限定されない。 Further, for example, as the time envelope information, information indicating the degree of falling of the time envelope shape is calculated. For example, the difference value in the time direction of the time envelope E _LO (k, i) is calculated, and the minimum value of the difference value in an arbitrary time segment is calculated and encoded. The method of calculating the information indicating the degree of falling of the time envelope shape of the low frequency signal is not limited to the above example.

［第2の実施形態］
図11は、第2の実施形態に係る音声復号装置11の構成を示す図である。音声復号装置11の通信装置は、下記音声符号化装置21から出力される多重化された符号化系列を受信し、更に、復号した音声信号を外部に出力する。音声復号装置11は、図11に示すように、機能的には、符号化系列逆多重化部10a、コア復号部10b、分析フィルタバンク部10c、符号化系列解析部10d、低周波数時間包絡形状決定部10e、低周波数時間包絡修正部10f、高周波数信号生成部10g、復号/逆量子化部10h、周波数包絡調整部10i、及び合成フィルタバンク部10jを備える。 [Second Embodiment]
FIG. 11 is a diagram showing a configuration of the audio decoding device 11 according to the second embodiment. The communication device of the voice decoding device 11 receives the multiplexed coding sequence output from the following voice coding device 21, and further outputs the decoded voice signal to the outside. As shown in FIG. 11, the voice decoding device 11 functionally has a coded sequence demultiplexing section 10a, a core decoding section 10b, an analysis filter bank section 10c, a coded sequence analysis section 10d, and a low frequency time wrapping shape. It includes a determination unit 10e, a low frequency time wrapping correction unit 10f, a high frequency signal generation unit 10g, a decoding / dequantization unit 10h, a frequency wrapping adjustment unit 10i, and a synthesis filter bank unit 10j.

図12は、第2の実施形態に係る音声復号装置11の動作を示すフローチャートである。 FIG. 12 is a flowchart showing the operation of the voice decoding device 11 according to the second embodiment.

高周波数信号生成部10gの動作における第1の実施形態に係る音声復号装置11の高周波数信号生成部10gとの相違点は、低周波数時間包絡修正部10fで時間包絡形状を修正された低周波数信号のサブバンド信号から高周波数信号を生成する点である。 The difference from the high frequency signal generation unit 10g of the voice decoding device 11 according to the first embodiment in the operation of the high frequency signal generation unit 10g is the low frequency whose time wrapping shape is corrected by the low frequency time wrapping correction unit 10f. The point is that a high frequency signal is generated from the subband signal of the signal.

図13は、第2の実施形態に係る音声符号化装置21の構成を示す図である。音声符号化装置21の通信装置は、符号化の対象となる音声信号を外部から受信し、更に、符号化された符号化系列を外部に出力する。音声符号化装置21は、図13に示すように、機能的には、ダウンサンプリング部20a、コア符号化部20b、分析フィルタバンク部20c及び20c1、制御パラメータ符号化部20d、包絡算出部20e、量子化/符号化部20f、時間包絡情報符号化部21a、符号化系列多重化部20h、サブバンド信号パワー算出部20j、及びコア復号信号生成部20iを備える。 FIG. 13 is a diagram showing a configuration of the voice coding device 21 according to the second embodiment. The communication device of the voice coding device 21 receives the voice signal to be coded from the outside, and further outputs the coded coding sequence to the outside. As shown in FIG. 13, the voice coding device 21 functionally includes a downsampling unit 20a, a core coding unit 20b, an analysis filter bank unit 20c and 20c1, a control parameter coding unit 20d, and an entrapment calculation unit 20e. It includes a quantization / coding unit 20f, a time-wrapping information coding unit 21a, a coding sequence multiplexing unit 20h, a subband signal power calculation unit 20j, and a core decoding signal generation unit 20i.

図14は、第2の実施形態に係る音声符号化装置21の動作を示すフローチャートである。 FIG. 14 is a flowchart showing the operation of the voice coding device 21 according to the second embodiment.

時間包絡情報符号化部21aは、包絡算出部20eにて算出した低周波数信号のサブバンド信号のパワー、高周波数信号のサブバンド信号のパワーを用いて低周波数信号の時間包絡及び高周波数信号の時間包絡を算出し、同様にサブバンド信号パワー算出部20jにて算出されたコア復号信号のサブバンド信号のパワーを用いてコア復号信号の時間包絡を算出し、当該低周波数信号の時間包絡、高周波数信号の時間包絡、及びコア復号信号の時間包絡より時間包絡情報を符号化する（ステップS21-1）。当該処理において、低周波数信号のサブバンド信号のパワーが算出されていない場合は、時間包絡情報符号化部21aにて低周波数信号のサブバンド信号のパワーを算出してもよく、低周波数信号のサブバンド信号のパワーがどこで算出されるかは限定されない。当該処理において、高周波数信号のサブバンド信号のパワーが算出されていない場合は、時間包絡情報符号化部21aにて高周波数信号のサブバンド信号のパワーを算出してもよく、高周波数信号のサブバンド信号のパワーがどこで算出されるかは限定されない。 The time wrapping information coding unit 21a uses the power of the subband signal of the low frequency signal calculated by the wrapping calculation unit 20e and the power of the subband signal of the high frequency signal to generate the time wrapping of the low frequency signal and the high frequency signal. The time wrapping is calculated, and the time wrapping of the core decoding signal is calculated using the power of the subband signal of the core decoding signal similarly calculated by the subband signal power calculation unit 20j. The time wrapping information is encoded from the time wrapping of the high frequency signal and the time wrapping of the core decoded signal (step S21-1). When the power of the subband signal of the low frequency signal is not calculated in the processing, the power of the subband signal of the low frequency signal may be calculated by the time wrapping information coding unit 21a, and the power of the subband signal of the low frequency signal may be calculated. Where the power of the subband signal is calculated is not limited. When the power of the subband signal of the high frequency signal is not calculated in the processing, the power of the subband signal of the high frequency signal may be calculated by the time wrapping information coding unit 21a, and the power of the subband signal of the high frequency signal may be calculated. Where the power of the subband signal is calculated is not limited.

具体的には、例えば、任意の時間セグメントt_E(l)≦i<t_E(l+1)内でB_LO(m) (m=0,…,M_LO, M_LO≧1) (B_LO(0)≧0, B_LO(M_LO)<k_x)で境界を表されるM_LO個の周波数帯域に分割し、m番目の周波数帯域に含まれる低周波数信号のサブバンド信号X_LO(k,i) (B_LO(m)≦k<B_LO(m+1), t_E(l)≦i<t_E(l+1))の時間包絡E_LO(k,i)、及びコア復号信号のサブバンド信号X_dec,LO(k,i) (B_LO(m)≦k<B_LO(m+1), t_E(l)≦i<t_E(l+1))の時間包絡E_dec,LO(k,i)を、それぞれ式（７）及び式（８）を用いて算出する。同様に、任意の時間セグメントt_E(l)≦i<t_E(l+1)内でB_HI(m) (m=0,…,M_HI, M_HI≧1) (B_HI(0)≧k_x, B_HI(M_HI)<k_h)で境界を表されるM_HI個の周波数帯域に分割し、m番目の周波数帯域に含まれる高周波数信号のサブバンド信号X_HI(k,i) (B_HI(m)≦k<B_HI(m+1), t_E(l)≦i<t_E(l+1))の時間包絡E_HI(k,i)を算出する。

高周波数信号のサブバンド信号の時間包絡は、高周波数信号のサブバンド信号の大きさの時間方向の変動がわかるパラメータであれば良く、前記の例に限定されない。 Specifically, for example, within an arbitrary time segment t _E (l) ≤ i <t _E (l + 1), B _LO (m) (m = 0,…, M _LO , M _LO ≥ 1) (B) The subband signal X _LO of the low frequency signal included in the mth frequency band is divided into M _LO frequency bands whose boundaries are represented by _LO (0) ≥ 0, B _LO (M _LO ) <k _x ). (k, i) (B _LO (m) ≤ k <B _LO (m + 1), t _E (l) ≤ i <t _E (l + 1)) time envelope E _LO (k, i), and Subband signal of core decoding signal X _{dec, LO} (k, i) (B _LO (m) ≤ k <B _LO (m + 1), t _E (l) ≤ i <t _E (l + 1)) The time envelope E _{dec and LO} (k, i) are calculated using Eqs. (7) and (8), respectively. Similarly, within any time segment t _E (l) ≤ i <t _E (l + 1), B _HI (m) (m = 0,…, M _HI , M _HI ≥ 1) (B _HI (0)) Divided into M _HI frequency bands whose boundaries are represented by ≧ k _x , B _HI (M _HI ) <k _h ), and the sub-band signal X _HI (k, k,) of the high frequency signal included in the mth frequency band. i) Calculate the time envelope E _HI (k, i) of (B _HI (m) ≤ k <B _HI (m + 1), t _E (l) ≤ i <t _E (l + 1)).

The time envelope of the subband signal of the high frequency signal may be any parameter as long as it is a parameter that shows the variation in the magnitude of the subband signal of the high frequency signal in the time direction, and is not limited to the above example.

例えば、時間包絡情報符号化部21aは時間包絡情報として平坦の程度を表す情報を算出する。例えば、低周波数信号、コア復号信号及び高周波数信号のサブバンド信号の時間包絡の分散またはそれに準ずるパラメータを算出する。さらに別の例では、低周波数信号、コア復号信号及び高周波数信号のサブバンド信号の時間包絡の相加平均と相乗平均の比またはそれに準ずるパラメータを算出する。この場合、時間包絡情報符号化部21aは、時間包絡情報として当該低周波数信号及び高周波数信号のうち少なくとも1つ以上のサブバンド信号の時間包絡の平坦さを表す情報を算出すればよく、前記の例に限定されない。そして、前記パラメータを符号化する。例えば、低周波数信号とコア復号信号の当該パラメータの差分値またはその絶対値を符号化する。さらに、例えば、低周波数信号と高周波数信号の当該パラメータの値または絶対値を符号化する。例えば、時間包絡の平坦さを平坦か否かで表現すれば1ビットで符号化でき、例えば、前記任意の時間セグメント内において前記M_LO個の周波数帯域毎に当該情報をM_LOビットで符号化できる。時間包絡情報の符号化方法は前記の例に限定されない。 For example, the time-envelope information coding unit 21a calculates information indicating the degree of flatness as time-envelope information. For example, the variance of the time envelope of the subband signals of the low frequency signal, the core decoding signal, and the high frequency signal, or a parameter equivalent thereto is calculated. In yet another example, the ratio of the arithmetic mean to the geometric mean of the time-envelopes of the subband signals of the low frequency signal, the core decoded signal and the high frequency signal or a parameter equivalent thereto is calculated. In this case, the time envelope information coding unit 21a may calculate as the time envelope information information indicating the flatness of the time envelope of at least one or more subband signals of the low frequency signal and the high frequency signal. It is not limited to the example of. Then, the parameter is encoded. For example, the difference value or the absolute value of the parameter of the low frequency signal and the core decoding signal is encoded. Further, for example, the values or absolute values of the parameters of the low frequency signal and the high frequency signal are encoded. For example, if the flatness of the time envelope is expressed by whether it is flat or not, it can be encoded with 1 bit. For example, the information is encoded with M _LO bits for each of the M _LO frequency bands in the arbitrary time segment. it can. The method for encoding the time envelope information is not limited to the above example.

さらに、例えば、時間包絡情報符号化部21aは時間包絡情報として立ち上がりの程度を表す情報を算出する。例えば、任意の時間セグメントt_E(l)≦i<t_E(l+1)内において、低周波数信号のサブバンド信号の時間包絡の時間方向の差分値の最大値を、式（９）を用いて算出する。同様に、例えば、任意の時間セグメントt_E(l)≦i<t_E(l+1)内において、高周波数信号のサブバンド信号の時間包絡の時間方向の差分値の最大値を算出する。

さらには、式（１２）において、時間包絡に代えて当該時間包絡を時間方向に平滑化したパラメータの時間方向の差分値の最大値を算出できる。この場合、時間包絡情報符号化部21aは、時間包絡情報として当該低周波数信号及び高周波数信号のうち少なくとも1つ以上のサブバンド信号の時間包絡の立ち上がりの程度を表す情報を算出すればよく、前記の例に限定されない。そして、前記パラメータを符号化する。例えば、低周波数信号とコア復号信号の当該パラメータの差分値またはその絶対値を符号化する。さらに、例えば、低周波数信号と高周波数信号の当該パラメータの値を符号化する。例えば、時間包絡の立ち上がりの程度を立ち上がりか否かで表現すれば1ビットで符号化でき、例えば、前記任意の時間セグメント内において前記M_LO個の周波数帯域毎に当該情報をM_LOビットで符号化できる。時間包絡情報の符号化方法は前記の例に限定されない。 Further, for example, the time-envelope information coding unit 21a calculates information indicating the degree of rise as time-envelope information. For example, in an arbitrary time segment t _E (l) ≤ i <t _E (l + 1), the maximum value of the difference value in the time direction of the time envelope of the subband signal of the low frequency signal is calculated by the equation (9). Calculate using. Similarly, for example, within an arbitrary time segment t _E (l) ≤ i <t _E (l + 1), the maximum value of the difference value in the time direction of the time envelope of the subband signal of the high frequency signal is calculated.

Further, in the equation (12), instead of the time envelope, the maximum value of the difference value in the time direction of the parameter obtained by smoothing the time envelope in the time direction can be calculated. In this case, the time-envelope information coding unit 21a may calculate as the time-envelope information information indicating the degree of rise of the time-envelope of at least one or more subband signals of the low-frequency signal and the high-frequency signal. It is not limited to the above example. Then, the parameter is encoded. For example, the difference value or the absolute value of the parameter of the low frequency signal and the core decoding signal is encoded. Further, for example, the values of the parameters of the low frequency signal and the high frequency signal are encoded. For example, if the degree of rise of the time envelope is expressed by whether it rises or not, it can be encoded with 1 bit. For example, the information is coded with M _LO bits for each of the M _LO frequency bands in the arbitrary time segment. Can be converted. The method for encoding the time envelope information is not limited to the above example.

さらに例えば、時間包絡情報符号化部21aは時間包絡情報として立ち下がりの程度を表す情報を算出する。例えば、任意の時間セグメントt_E(l)≦i<t_E(l+1)内において、低周波数信号のサブバンド信号の時間包絡の時間方向の差分値の最小値を、式（１０）を用いて算出する。同様に、例えば、任意の時間セグメントt_E(l)≦i<t_E(l+1)内において、高周波数信号のサブバンド信号の時間包絡の時間方向の差分値の最小値を算出する。

さらには、式（１３）において、時間包絡に代えて当該時間包絡を時間方向に平滑化したパラメータの時間方向の差分値の最小値を算出できる。この場合、時間包絡情報符号化部21aは、時間包絡情報として当該低周波数信号及び高周波数信号のうち少なくとも1つ以上のサブバンド信号の時間包絡の立ち下がりの程度を表す情報を算出すればよく、前記の例に限定されない。そして、前記パラメータを符号化する。例えば、低周波数信号とコア復号信号の当該パラメータの差分値またはその絶対値を符号化する。さらに、例えば、低周波数信号と高周波数信号の当該パラメータの値を符号化する。。例えば、時間包絡の立ち下がりの程度を立ち下がりか否かで表現すれば1ビットで符号化でき、例えば、前記任意の時間セグメント内において前記M_LO個の周波数帯域毎に当該情報をM_LOビットで符号化できる。時間包絡情報の符号化方法は前記の例に限定されない。 Further, for example, the time-envelope information coding unit 21a calculates information indicating the degree of fall as time-envelope information. For example, in an arbitrary time segment t _E (l) ≤ i <t _E (l + 1), the minimum value of the difference value in the time direction of the time envelope of the subband signal of the low frequency signal is calculated by the equation (10). Calculate using. Similarly, for example, within an arbitrary time segment t _E (l) ≤ i <t _E (l + 1), the minimum value of the difference value in the time direction of the time envelope of the subband signal of the high frequency signal is calculated.

Further, in the equation (13), instead of the time envelope, the minimum value of the difference value in the time direction of the parameter obtained by smoothing the time envelope in the time direction can be calculated. In this case, the time envelope information coding unit 21a may calculate as the time envelope information information indicating the degree of the fall of the time envelope of at least one or more subband signals of the low frequency signal and the high frequency signal. , Not limited to the above example. Then, the parameter is encoded. For example, the difference value or the absolute value of the parameter of the low frequency signal and the core decoding signal is encoded. Further, for example, the values of the parameters of the low frequency signal and the high frequency signal are encoded. .. For example, if the degree of the fall of the time envelope is expressed by whether or not it falls, it can be encoded with 1 bit. For example, the information can be encoded in the M _LO bits for each of the M _LO frequency bands in the arbitrary time segment. Can be encoded with. The method for encoding the time envelope information is not limited to the above example.

［第2の実施形態の音声符号化装置の第1の変形例］
図15は、第2の実施形態に係る音声符号化装置の第1の変形例21Aの構成を示す図である。 [First modification of the voice coding device of the second embodiment]
FIG. 15 is a diagram showing a configuration of a first modification 21A of the voice coding device according to the second embodiment.

図16は、第2の実施形態に係る音声符号化装置の第1の変形例21Aの動作を示すフローチャートである。 FIG. 16 is a flowchart showing the operation of the first modification 21A of the voice coding device according to the second embodiment.

時間包絡情報符号化部21aAは、包絡算出部20eにて算出した入力音声信号のサブバンド信号のパワーを用いて入力音声信号の時間包絡を算出し、当該時間包絡より時間包絡情報を符号化する（ステップS21-1a）。当該処理において、入力音声信号のサブバンド信号のパワーが算出されていない場合は、時間包絡情報符号化部21aAにて入力音声信号のサブバンド信号のパワーを算出してもよく、入力音声信号のサブバンド信号のパワーがどこで算出されるかは限定されない。 The time envelope information coding unit 21aA calculates the time envelope of the input voice signal using the power of the subband signal of the input voice signal calculated by the envelope calculation unit 20e, and encodes the time envelope information from the time envelope. (Step S21-1a). If the power of the sub-band signal of the input audio signal is not calculated in the process, the time-wrapping information coding unit 21aA may calculate the power of the sub-band signal of the input audio signal. Where the power of the subband signal is calculated is not limited.

例えば、時間包絡情報として、時間包絡形状の平坦さの程度を表す情報を算出する。例えば、任意の時間セグメントt_E(l)≦i<t_E(l+1)内でB_LO(m) (m=0,…,M_LO, M_LO≧1) (B_LO(0)≧0, B_LO(M_LO)<k_x)で境界を表されるM_LO個の周波数帯域に分割し、m番目の周波数帯域に含まれる低周波数信号のサブバンド信号X_LO(k,i) (B_LO(m)≦k<B_LO(m+1), t_E(l)≦i<t_E(l+1))の時間包絡E_LO(k,i)を式（７）により算出する。また時間包絡E_LO(k,i)の算出方法は式（７）に限定されない。同様に、任意の時間セグメントt_E(l)≦i<t_E(l+1)内でB_HI(m) (m=0,…,M_HI, M_HI≧1) (B_HI(0)≧k_x, B_HI(M_HI)<k_h)で境界を表されるM_HI個の周波数帯域に分割し、m番目の周波数帯域に含まれる低周波数信号のサブバンド信号X_HI(k,i) (B_HI(m)≦k<B_HI(m+1), t_E(l)≦i<t_E(l+1))の時間包絡E_HI(k,i)を式（１１）により算出する。また時間包絡E_HI(k,i)の算出方法は式（１１）に限定されない。時間包絡E_LO(k,i)の分散またはそれに準ずるパラメータ、及び時間包絡E_HI(k,i)の分散またはそれに準ずるパラメータのうち少なくとも1つ以上を算出し、当該パラメータをそれぞれ別々にまたは組み合わせて符号化する。さらに別の例では、時間包絡E_LO(k,i)の相加平均と相乗平均の比またはそれに準ずるパラメータ、及び時間包絡E_HI(k,i)の相加平均と相乗平均の比またはそれに準ずるパラメータを少なくとも1つ以上算出し、当該パラメータをそれぞれ別々にまたは組み合わせて符号化する。時間包絡形状の平坦さの程度を表す情報の算出方法は上記の例に限定されない。 For example, as the time envelope information, information indicating the degree of flatness of the time envelope shape is calculated. For example, within any time segment t _E (l) ≤ i <t _E (l + 1), B _LO (m) (m = 0,…, M _LO , M _LO ≥ 1) (B _LO (0) ≥ It is divided into M _LO frequency bands whose boundaries are represented by 0, B _LO (M _LO ) <k _x ), and the subband signal X _LO (k, i) of the low frequency signal included in the mth frequency band. Calculate the time-wrapping E _LO (k, i) of (B _LO (m) ≤ k <B _LO (m + 1), t _E (l) ≤ i <t _E (l + 1)) by Eq. (7). To do. Moreover, the calculation method of the time envelope E _LO (k, i) is not limited to the equation (7). Similarly, within any time segment t _E (l) ≤ i <t _E (l + 1), B _HI (m) (m = 0,…, M _HI , M _HI ≥ 1) (B _HI (0)) Divided into M _HI frequency bands whose boundaries are represented by ≧ k _x , B _HI (M _HI ) <k _h ), and the sub-band signal X _HI (k, k,) of the low frequency signal included in the mth frequency band. i) (B _HI (m) ≤ k <B _HI (m + 1), t _E (l) ≤ i <t _E (l + 1)) time-wrapping E _HI (k, i) Calculated by Moreover, the calculation method of the time envelope E _HI (k, i) is not limited to the equation (11). Calculate at least one or more of the variance of the time-envelope E _LO (k, i) or its equivalent, and the variance of the time-envelope E _HI (k, i) or its equivalent, and combine the parameters separately or in combination. To encode. In yet another example, the ratio of the arithmetic mean to the geometric mean of the time-wrapped E _LO (k, i) or a parameter equivalent thereto, and the ratio of the arithmetic mean to the geometric mean of the time-wrapped E _HI (k, i) or the geometric mean thereof. Calculate at least one equivalent parameter and encode the parameters separately or in combination. The method of calculating the information indicating the degree of flatness of the time envelope shape is not limited to the above example.

さらに、例えば、時間包絡情報として、時間包絡形状の立ち上がりの程度を表す情報を算出する。例えば、時間包絡E_LO(k,i)の時間方向の差分値を算出し、当該差分値の任意の時間セグメント内の最大値を算出する。同様に、時間包絡E_HI(k,i)の時間方向の差分値を算出し、当該差分値の任意の時間セグメント内の最大値を算出する。当該パラメータをそれぞれ別々にまたは組み合わせて符号化する。低周波数信号の時間包絡形状を立ち上がりの程度を表す情報の算出方法は上記の例に限定されない。 Further, for example, as the time envelope information, information indicating the degree of rise of the time envelope shape is calculated. For example, the difference value in the time direction of the time envelope E _LO (k, i) is calculated, and the maximum value of the difference value in an arbitrary time segment is calculated. Similarly, the difference value in the time direction of the time envelope E _HI (k, i) is calculated, and the maximum value of the difference value in an arbitrary time segment is calculated. The parameters are encoded separately or in combination. The method of calculating the information indicating the degree of rise of the time envelope shape of the low frequency signal is not limited to the above example.

さらに、例えば、時間包絡情報として、時間包絡形状の立ち下がりの程度を表す情報を算出する。例えば、時間包絡E_LO(k,i)の時間方向の差分値を算出し、当該差分値の任意の時間セグメント内の最小値を算出する。同様に、時間包絡E_HI(k,i)の時間方向の差分値を算出し、当該差分値の任意の時間セグメント内の最小値を算出する。当該パラメータをそれぞれ別々にまたは組み合わせて符号化する。低周波数信号の時間包絡形状を立ち下がりの程度を表す情報の算出方法は上記の例に限定されない。 Further, for example, as the time envelope information, information indicating the degree of falling of the time envelope shape is calculated. For example, the difference value in the time direction of the time envelope E _LO (k, i) is calculated, and the minimum value of the difference value in an arbitrary time segment is calculated. Similarly, the difference value of the time envelope E _HI (k, i) in the time direction is calculated, and the minimum value of the difference value in an arbitrary time segment is calculated. The parameters are encoded separately or in combination. The method of calculating the information indicating the degree of falling of the time envelope shape of the low frequency signal is not limited to the above example.

当該第2の実施形態の低周波数時間包絡形状決定部10eに対して、本発明の第1の実施形態の第1、第2および第3の変形例が適用できることは明白である。 It is clear that the first, second and third modifications of the first embodiment of the present invention can be applied to the low frequency time envelope shape determining unit 10e of the second embodiment.

当該第2の実施形態の音声復号装置11は、本発明の第1の実施形態の音声符号化装置20及びその第1の変形例の音声符号化装置20Aにより符号化された符号化系列を復号できる。 The voice decoding device 11 of the second embodiment decodes the coding sequence encoded by the voice coding device 20 of the first embodiment of the present invention and the voice coding device 20A of the first modification thereof. it can.

［第3の実施形態］
図17は、第3の実施形態に係る音声復号装置12の構成を示す図である。音声復号装置12の通信装置は、下記音声符号化装置22から出力される多重化された符号化系列を受信し、更に、復号した音声信号を外部に出力する。音声復号装置12は、図17に示すように、機能的には、符号化系列逆多重化部10a、コア復号部10b、分析フィルタバンク部10c、符号化系列解析部10d、低周波数時間包絡形状決定部10e、低周波数時間包絡修正部12a、高周波数信号生成部10g、復号/逆量子化部10h、周波数包絡調整部10i、及び合成フィルタバンク部10jを備える。 [Third Embodiment]
FIG. 17 is a diagram showing a configuration of the audio decoding device 12 according to the third embodiment. The communication device of the voice decoding device 12 receives the multiplexed coding sequence output from the following voice coding device 22, and further outputs the decoded voice signal to the outside. As shown in FIG. 17, the voice decoding device 12 functionally has a coded sequence demultiplexing section 10a, a core decoding section 10b, an analysis filter bank section 10c, a coded sequence analysis section 10d, and a low frequency time wrapping shape. It includes a determination unit 10e, a low frequency time wrapping correction unit 12a, a high frequency signal generation unit 10g, a decoding / dequantization unit 10h, a frequency wrapping adjustment unit 10i, and a synthesis filter bank unit 10j.

図18は、第3の実施形態に係る音声復号装置12の動作を示すフローチャートである。 FIG. 18 is a flowchart showing the operation of the voice decoding device 12 according to the third embodiment.

低周波数時間包絡修正部12aは、低周波数時間包絡形状決定部10eで決定した時間包絡形状に基づいて、コア復号部10bから出力される低周波数信号の時間包絡の形状を修正する（ステップS12-1）。 The low frequency time envelope correction unit 12a corrects the time envelope shape of the low frequency signal output from the core decoding unit 10b based on the time envelope shape determined by the low frequency time envelope shape determination unit 10e (step S12-). 1).

例えば、低周波数時間包絡修正部12aは、任意の時間セグメントt_t,E(l)≦i<t_t,E(l+1))内の前記低周波数信号x_dec,LO(i)に対して、所定の関数F_t(x_dec,LO(i))を用いて以下の式（１４）

により得られるx’_dec,LO(i)を時間包絡形状が修正された低周波数信号として出力する。 For example, the low frequency time envelope correction unit 12a is applied to the low frequency signal x _{dec, LO} (i) in an arbitrary time segment t _{t, E} (l) ≤ i <t _{t, E} (l + 1)). Then, using the predetermined function F _t (x _{dec, LO} (i)), the following equation (14)

The _{x'dec, LO} (i) obtained by is output as a low frequency signal with the corrected time envelope shape.

例えば、前記低周波数信号の時間包絡形状が平坦と決定された場合、以下の処理により、低周波数信号の時間包絡形状を修正できる。例えば、当該低周波数信号x_dec,LO(i)に対して、所定の関数F_t(x_dec,LO(i))を、

として、x’_dec,LO(i)を時間包絡形状が修正された低周波数信号として出力する。
また別の例によれば、所定の関数F_t(x_dec,LO(i))を、低周波数信号x_dec,LO(i)に対して平滑化フィルタ処理を施す

(N_filt≧1)で定義して、x’_dec,LO(i)を時間包絡形状が修正された低周波数信号として出力する。上記の時間包絡形状を平坦に修正する処理の例は、それぞれを組み合わせて実施できる。低周波数時間包絡修正部10fは、低周波数信号の複数のサブバンド信号の時間包絡の形状を平坦に修正する処理を実施し、上記の例に限定されない。 For example, when the time envelope shape of the low frequency signal is determined to be flat, the time envelope shape of the low frequency signal can be corrected by the following processing. For example, for the low frequency signal x _{dec, LO} (i), a predetermined function F _t (x _{dec, LO} (i)) is applied.

As, _{x'dec, LO} (i) is output as a low frequency signal with the corrected time envelope shape.
According to another example, the predetermined function F _t (x _{dec, LO} (i)) is subjected to smoothing filtering on the low frequency signal x _{dec, LO} (i).

Defined by (N _filt ≧ 1), _{x'dec, LO} (i) is output as a low frequency signal with the corrected time envelope shape. The above-mentioned example of the process of correcting the time envelope shape flatly can be carried out in combination with each other. The low frequency time envelope correction unit 10f performs a process of flattening the shape of the time envelope of a plurality of subband signals of the low frequency signal, and is not limited to the above example.

さらには、例えば、前記低周波数信号の時間包絡形状が立ち上がりと決定された場合、以下の処理により、低周波数信号の時間包絡形状を修正できる。例えば、所定の関数F_t(x_dec,LO(i))を、iに対して単調増加する関数incr(i)を用いて

で定義して、x’_dec,LO(i)を時間包絡形状が修正された低周波数信号として出力する。低周波数時間包絡修正部10fは、低周波数信号の複数のサブバンド信号の時間包絡の形状を立ち上がりに修正する処理を実施し、上記の例に限定されない。 Further, for example, when the time envelope shape of the low frequency signal is determined to be rising, the time envelope shape of the low frequency signal can be corrected by the following processing. For example, using a function incr (i) that monotonically increases a given function F _t (x _{dec, LO} (i)) with respect to i.

Defined in, _{x'dec, LO} (i) is output as a low frequency signal with the corrected time envelope shape. The low frequency time envelope correction unit 10f performs a process of correcting the shape of the time envelope of a plurality of subband signals of the low frequency signal at the rising edge, and is not limited to the above example.

さらには、例えば、前記低周波数信号の時間包絡形状が立ち下がりと決定された場合、以下の処理により、低周波数信号の時間包絡形状を修正できる。例えば、所定の関数F_t(x_dec,LO(i))を、iに対して単調減少する関数decr(i)を用いて

で定義して、x’_dec,LO(i)を時間包絡形状が修正された低周波数信号として出力する。低周波数時間包絡修正部10fは、低周波数信号の複数のサブバンド信号の時間包絡の形状を立ち下がりに修正する処理を実施し、上記の例に限定されない。 Further, for example, when the time envelope shape of the low frequency signal is determined to be falling, the time envelope shape of the low frequency signal can be corrected by the following processing. For example, using the function decr (i) that monotonically decreases a given function F _t (x _{dec, LO} (i)) with respect to i.

Defined in, _{x'dec, LO} (i) is output as a low frequency signal with the corrected time envelope shape. The low frequency time envelope correction unit 10f performs a process of correcting the shape of the time envelope of a plurality of subband signals of the low frequency signal to a falling edge, and is not limited to the above example.

また別の例によれば、低周波数信号が離散フーリエ変換，離散コサイン変換，修正離散コサイン変換に代表される時間周波数変換により周波数領域の変換係数X_dec,LO(k) (0≦k<k_x)で表されたときは、所定の関数F_f(X_dec,LO(k))を用いて

により得られるX’_dec,LO(k)を時間包絡形状が修正された低周波数信号の周波数領域の変換係数として出力する。 According to another example, the low frequency signal is converted in the frequency domain by the time-frequency transform represented by the discrete Fourier transform, the discrete cosine transform, and the modified discrete cosine transform. X _{dec, LO} (k) (0 ≤ k <k). When represented by _x ), use the given function F _f (X _{dec, LO} (k))

_{X'dec, LO} (k) obtained by is output as the conversion coefficient of the frequency domain of the low frequency signal with the corrected time envelope shape.

例えば、前記低周波数信号の時間包絡形状が平坦と決定された場合、以下の処理により、低周波数信号の時間包絡形状を修正できる。
B_LO(m) (m=0,…,M_LO, M_LO≧1) (B_LO(0)≧0, B_LO(M_LO)<k_x)で境界を表されるM_LO個の任意の周波数帯域B_dec,LO(m)をにおいて、周波数方向に線形予測して線形予測係数α_p(m) (m=0,…,M_LO-1)を得て、所定の関数F_t(X_dec,LO(k))を、変換係数X_dec,LO(k)に対して線形予測逆フィルタ処理を施す

(N_pred≧1)で定義して、X’_dec,LO(k,i)を時間包絡形状が修正された低周波数信号の変換係数として出力する。 For example, when the time envelope shape of the low frequency signal is determined to be flat, the time envelope shape of the low frequency signal can be corrected by the following processing.
Any number of M _LOs whose boundaries are represented by B _LO (m) (m = 0,…, M _LO , M _LO ≧ 1) (B _LO (0) ≧ 0, B _LO (M _LO ) <k _x ) In the frequency band B _{dec, LO} (m) of, linear prediction is performed in the frequency direction to obtain the linear prediction coefficient α _p (m) (m = 0,…, M _LO -1), and the predetermined function F _t ( X _{dec, LO} (k)) is subjected to linear prediction inverse filtering on the conversion coefficients X _{dec, LO} (k).

Defined by (N _pred ≥ 1), _{X'dec, LO} (k, i) is output as the conversion coefficient of the low frequency signal with the corrected time envelope shape.

図19は、第3の実施形態に係る音声符号化装置22の構成を示す図である。音声符号化装置22の通信装置は、符号化の対象となる音声信号を外部から受信し、更に、符号化された符号化系列を外部に出力する。音声符号化装置22は、図19に示すように、機能的には、ダウンサンプリング部20a、コア符号化部20b、分析フィルタバンク部20c、制御パラメータ符号化部20d、包絡算出部20e、量子化/符号化部20f、時間包絡算出部22a及び22a1、時間包絡情報符号化部22b、符号化系列多重化部20h、及びコア復号信号生成部20iを備える。 FIG. 19 is a diagram showing a configuration of the voice coding device 22 according to the third embodiment. The communication device of the voice coding device 22 receives the voice signal to be coded from the outside, and further outputs the coded coding sequence to the outside. As shown in FIG. 19, the voice coding device 22 functionally includes a downsampling unit 20a, a core coding unit 20b, an analysis filter bank unit 20c, a control parameter coding unit 20d, an envelope calculation unit 20e, and a quantization unit. / It includes a coding unit 20f, a time envelope calculation unit 22a and 22a1, a time envelope information coding unit 22b, a coding sequence multiplexing unit 20h, and a core decoding signal generation unit 20i.

図20は、第3の実施形態に係る音声符号化装置22の動作を示すフローチャートである。 FIG. 20 is a flowchart showing the operation of the voice coding device 22 according to the third embodiment.

時間包絡算出部22aは、ダウンサンプリング部20aから得られるダウンサンプル信号の時間包絡を算出する（ステップ22-1）。 The time envelope calculation unit 22a calculates the time envelope of the downsample signal obtained from the downsampling unit 20a (step 22-1).

例えば、任意の時間セグメントt_t,E(l)≦i<t_t,E(l+1))内のダウンサンプル信号x_LO(i)の時間包絡E_LO(i)を、当該時間セグメント内で正規化したダウンサンプル信号のパワーとして算出できる。

ダウンサンプル信号の時間包絡は、ダウンサンプル信号の大きさの時間方向の変動がわかるパラメータであれば良く、前記の例に限定されない。 For example, the time envelope E _LO (i) of the downsample signal x _LO (i) in an arbitrary time segment t _{t, E} (l) ≤ i <t _{t, E} (l + 1)) is set in the time segment. It can be calculated as the power of the downsample signal normalized by.

The time envelope of the downsample signal is not limited to the above example, as long as it is a parameter that shows the variation of the magnitude of the downsample signal in the time direction.

時間包絡算出部22a1は、コア復号信号生成部20iにて生成されたコア復号信号の時間包絡を算出する（ステップ22-2）。コア復号信号の時間包絡は、前記ダウンサンプル信号の時間包絡と同様に算出できる。 The time envelope calculation unit 22a1 calculates the time envelope of the core decoding signal generated by the core decoding signal generation unit 20i (step 22-2). The time envelope of the core decoded signal can be calculated in the same manner as the time envelope of the downsample signal.

例えば、任意の時間セグメントt_t,E(l)≦i<t_t,E(l+1))内の前記コア復号信号x_dec,LO(i)の時間包絡E_dec,LO(i)を、当該時間セグメント内で正規化したコア復号信号のパワーとして算出できる。

コア復号信号の時間包絡は、コア復号信号の大きさの時間方向の変動がわかるパラメータであれば良く、前記の例に限定されない。 For example, the time envelope E _{dec, LO} (i) of the core decoding signal x _{dec, LO} (i) in an arbitrary time segment t _{t, E} (l) ≤ i <t _{t, E} (l + 1)). , Can be calculated as the power of the core decoded signal normalized within the time segment.

The time envelope of the core decoding signal is not limited to the above example as long as it is a parameter that shows the fluctuation of the magnitude of the core decoding signal in the time direction.

時間包絡情報符号化部22bは、時間包絡算出部22aで算出されたダウンサンプル信号の時間包絡と、時間包絡算出部22a1で算出されたコア復号信号の時間包絡とを用いて、時間包絡情報を算出し、当該時間包絡より時間包絡情報を符号化する（ステップS22-3）。 The time envelope information coding unit 22b uses the time envelope of the downsample signal calculated by the time envelope calculation unit 22a and the time envelope of the core decoded signal calculated by the time envelope calculation unit 22a1 to obtain the time envelope information. It is calculated and the time envelope information is encoded from the time envelope (step S22-3).

例えば、時間包絡情報符号化部22bは時間包絡情報として平坦の程度を表す情報を算出する。例えば、ダウンサンプル信号及びコア復号信号の時間包絡の分散またはそれに準ずるパラメータを算出する。さらに別の例では、ダウンサンプル信号及びコア復号信号のサブバンド信号の時間包絡の相加平均と相乗平均の比またはそれに準ずるパラメータを算出する。この場合、時間包絡情報符号化部22bは、時間包絡情報として当該ダウンサンプル信号の時間包絡の平坦さを表す情報を算出すればよく、前記の例に限定されない。そして、前記パラメータを符号化する。例えば、ダウンサンプル信号とコア復号信号の当該パラメータの差分値またはその絶対値を符号化する。さらに、例えば、ダウンサンプル信号の当該パラメータの値または絶対値を符号化する。例えば、時間包絡の平坦さを平坦か否かで表現すれば1ビットで符号化でき、例えば、前記任意の時間セグメントについては1ビットで符号化できる。時間包絡情報の符号化方法は前記の例に限定されない。 For example, the time-envelope information coding unit 22b calculates information indicating the degree of flatness as time-envelope information. For example, the variance of the time envelope of the downsample signal and the core decoding signal or a parameter equivalent thereto is calculated. In yet another example, the ratio of the arithmetic mean to the geometric mean of the time-envelope of the subband signals of the downsample signal and the core decoding signal or a parameter equivalent thereto is calculated. In this case, the time-envelope information coding unit 22b may calculate information representing the flatness of the time-envelope of the downsample signal as the time-envelope information, and is not limited to the above example. Then, the parameter is encoded. For example, the difference value or the absolute value of the parameter of the downsample signal and the core decoding signal is encoded. Further, for example, the value or absolute value of the parameter of the downsample signal is encoded. For example, if the flatness of the time envelope is expressed by whether it is flat or not, it can be encoded with 1 bit. For example, the arbitrary time segment can be encoded with 1 bit. The method for encoding the time envelope information is not limited to the above example.

さらに例えば、時間包絡情報符号化部22bは時間包絡情報として立ち上がりの程度を表す情報を算出する。例えば、任意の時間セグメントt_t,E(l)≦i<t_t,E(l+1)内において、ダウンサンプル信号の時間包絡の時間方向の差分値の最大値を算出する。

さらには、式（２３）において、時間包絡に代えて当該時間包絡を時間方向に平滑化したパラメータの時間方向の差分値の最大値を算出できる。この場合、時間包絡情報符号化部22bは、時間包絡情報として当該ダウンサンプル信号の時間包絡の立ち上がりの程度を表す情報を算出すればよく、前記の例に限定されない。そして、前記パラメータを符号化する。例えば、ダウンサンプル信号とコア復号信号の当該パラメータの差分値またはその絶対値を符号化する。例えば、時間包絡の立ち上がりの程度を立ち上がりか否かで表現すれば1ビットで符号化でき、例えば、前記任意の時間セグメントについては1ビットで符号化できる。時間包絡情報の符号化方法は前記の例に限定されない。 Further, for example, the time-envelope information coding unit 22b calculates information indicating the degree of rise as time-envelope information. For example, within an arbitrary time segment t _{t, E} (l) ≤ i <t _{t, E} (l + 1), the maximum value of the difference value in the time direction of the time envelope of the downsample signal is calculated.

Further, in the equation (23), instead of the time envelope, the maximum value of the difference value in the time direction of the parameter obtained by smoothing the time envelope in the time direction can be calculated. In this case, the time-envelope information coding unit 22b may calculate information indicating the degree of rise of the time-envelope of the downsample signal as the time-envelope information, and is not limited to the above example. Then, the parameter is encoded. For example, the difference value or the absolute value of the parameter of the downsample signal and the core decoding signal is encoded. For example, if the degree of rise of the time envelope is expressed by whether it rises or not, it can be encoded with 1 bit. For example, the arbitrary time segment can be encoded with 1 bit. The method for encoding the time envelope information is not limited to the above example.

さらに例えば、時間包絡情報符号化部20gは時間包絡情報として立ち下がりの程度を表す情報を算出する。例えば、任意の時間セグメントt_t,E(l)≦i<t_t,E(l+1)内において、低周波数信号のサブバンド信号の時間包絡の時間方向の差分値の最小値を算出する。

さらには、式（２４）において、時間包絡に代えて当該時間包絡を時間方向に平滑化したパラメータの時間方向の差分値の最小値を算出できる。この場合、時間包絡情報符号化部22bは、時間包絡情報として当該ダウンサンプル信号の時間包絡の立ち下がりの程度を表す情報を算出すればよく、前記の例に限定されない。そして、前記パラメータを符号化する。例えば、ダウンサンプル信号とコア復号信号の当該パラメータの差分値またはその絶対値を符号化する。例えば、時間包絡の立ち下がりの程度を立ち下がりか否かで表現すれば1ビットで符号化でき、例えば、前記任意の時間セグメントについては1ビットで符号化できる。時間包絡情報の符号化方法は前記の例に限定されない。 Further, for example, the time-envelope information coding unit 20g calculates information indicating the degree of fall as time-envelope information. For example, within an arbitrary time segment t _{t, E} (l) ≤ i <t _{t, E} (l + 1), the minimum value of the difference value in the time direction of the time envelope of the subband signal of the low frequency signal is calculated. ..

Further, in the equation (24), the minimum value of the difference value in the time direction of the parameter obtained by smoothing the time envelope in the time direction can be calculated instead of the time envelope. In this case, the time-envelope information coding unit 22b may calculate as the time-envelope information information indicating the degree of the fall of the time-envelope of the downsample signal, and is not limited to the above example. Then, the parameter is encoded. For example, the difference value or the absolute value of the parameter of the downsample signal and the core decoding signal is encoded. For example, if the degree of the fall of the time envelope is expressed by whether or not it falls, it can be encoded with 1 bit. For example, the arbitrary time segment can be encoded with 1 bit. The method for encoding the time envelope information is not limited to the above example.

前記時間包絡情報として平坦の程度、立ち上がりの程度、及び立下りの程度を表す情報を算出する例において、ダウンサンプル信号及びコア復号信号の時間包絡のうち一方のみを用いる場合においては、他方の時間包絡の算出のみに係る各部及び各処理を省略することができる。 In the example of calculating the information representing the degree of flatness, the degree of rising edge, and the degree of falling edge as the time envelope information, when only one of the time envelopes of the downsample signal and the core decoding signal is used, the other time is used. Each part and each process related only to the calculation of the envelope can be omitted.

［第3の実施形態の音声符号化装置の第1の変形例］
図21は、第3の実施形態に係る音声符号化装置の第1の変形例22Aの構成を示す図である。 [First modification of the voice coding device of the third embodiment]
FIG. 21 is a diagram showing a configuration of a first modification 22A of the voice coding device according to the third embodiment.

図22は、第3の実施形態に係る音声符号化装置の第1の変形例22Aの動作を示すフローチャートである。 FIG. 22 is a flowchart showing the operation of the first modification 22A of the voice coding device according to the third embodiment.

時間包絡情報符号化部22bAは、時間包絡算出部22aにて算出されたダウンサンプル信号の時間包絡より時間包絡情報を算出し、当該時間包絡情報を符号化する（ステップS22-3a）。 The time envelope information coding unit 22bA calculates the time envelope information from the time envelope of the downsample signal calculated by the time envelope calculation unit 22a, and encodes the time envelope information (step S22-3a).

例えば、時間包絡情報として、時間包絡形状の平坦さの程度を表す情報を算出する。例えば、任意の時間セグメントt_t,E(l)≦i<t_t,E(l+1)内のダウンサンプル信号x_LO(i) (t_t,E(l)≦i<t_t,E(l+1))の時間包絡E_LO(i)を式（２１）により算出する。また時間包絡E_LO(i)の算出方法は式（２１）に限定されない。時間包絡E_LO(i)の分散またはそれに準ずるパラメータを算出し、当該パラメータを符号化する。さらに別の例では、時間包絡E_LO(i)の相加平均と相乗平均の比またはそれに準ずるパラメータを算出し、当該パラメータを符号化する。ダウンサンプル信号の時間包絡形状の平坦さの程度を表す情報の算出方法は上記の例に限定されない。 For example, as the time envelope information, information indicating the degree of flatness of the time envelope shape is calculated. For example, the downsampling signal in any time segment t _{t, E} (l) ≤ i <t _{t, E} (l + 1) x _LO (i) (t _{t, E} (l) ≤ i <t _{t, E} The time envelope E _LO (i) of (l + 1)) is calculated by Eq. (21). Further, the calculation method of the time envelope E _LO (i) is not limited to the equation (21). Calculate the variance of the time envelope E _LO (i) or a parameter equivalent thereto, and encode the parameter. In yet another example, the ratio of the arithmetic mean to the geometric mean of the time-envelope E _LO (i) or a parameter equivalent thereto is calculated and the parameter is encoded. The method of calculating the information indicating the degree of flatness of the time-envelope shape of the downsample signal is not limited to the above example.

さらに、例えば、時間包絡情報として、時間包絡形状の立ち上がりの程度を表す情報を算出する。例えば、時間包絡E_LO(i)のの時間方向の差分値を算出し、当該差分値の任意の時間セグメント内の最大値を算出し符号化する。ダウンサンプル信号の時間包絡形状を立ち上がりの程度を表す情報の算出方法は上記の例に限定されない。 Further, for example, as the time envelope information, information indicating the degree of rise of the time envelope shape is calculated. For example, the difference value in the time direction of the time envelope E _LO (i) is calculated, and the maximum value of the difference value in an arbitrary time segment is calculated and encoded. The method of calculating the information indicating the degree of rise of the time envelope shape of the downsample signal is not limited to the above example.

さらに、例えば、時間包絡情報として、時間包絡形状の立ち下がりの程度を表す情報を算出する。例えば、時間包絡E_LO(i)のの時間方向の差分値を算出し、当該差分値の任意の時間セグメント内の最小値を算出し符号化する。ダウンサンプル信号の時間包絡形状を立ち下がりの程度を表す情報の算出方法は上記の例に限定されない。 Further, for example, as the time envelope information, information indicating the degree of falling of the time envelope shape is calculated. For example, the difference value in the time direction of the time envelope E _LO (i) is calculated, and the minimum value of the difference value in an arbitrary time segment is calculated and encoded. The method of calculating the information indicating the degree of falling of the time envelope shape of the downsample signal is not limited to the above example.

［第3の実施形態の音声符号化装置の第2の変形例］
図23は、第3の実施形態に係る音声符号化装置の第2の変形例22Bの構成を示す図である。 [Second variant of the voice coding device of the third embodiment]
FIG. 23 is a diagram showing a configuration of a second modification 22B of the voice coding device according to the third embodiment.

図24は、第3の実施形態に係る音声符号化装置の第2の変形例22Bの動作を示すフローチャートである。 FIG. 24 is a flowchart showing the operation of the second modification 22B of the voice coding device according to the third embodiment.

時間包絡算出部22aBは、入力音声信号の時間包絡を算出する（ステップ22-1b）。 The time envelope calculation unit 22aB calculates the time envelope of the input voice signal (step 22-1b).

例えば、任意の時間セグメントt_t,E(l)≦i<t_t,E(l+1))内の前記入力信号x(i)の時間包絡E(i)を、当該時間セグメント内で正規化した入力信号のパワーとして算出できる。

入力信号の時間包絡は、入力信号の大きさの時間方向の変動がわかるパラメータであれば良く、前記の例に限定されない。 For example, the time envelope E (i) of the input signal x (i) in an arbitrary time segment t _{t, E} (l) ≤ i <t _{t, E} (l + 1)) is normalized in the time segment. It can be calculated as the power of the input signal.

The time envelope of the input signal is not limited to the above example, as long as it is a parameter that shows the fluctuation of the magnitude of the input signal in the time direction.

時間包絡情報符号化部22bBは、時間包絡算出部22aBにて算出された入力音声信号の時間包絡より時間包絡情報を算出し、当該時間包絡情報を符号化する（ステップS22-3b）。 The time envelope information coding unit 22bB calculates the time envelope information from the time envelope of the input voice signal calculated by the time envelope calculation unit 22aB, and encodes the time envelope information (step S22-3b).

例えば、時間包絡情報として、時間包絡形状の平坦さの程度を表す情報を算出する。例えば、任意の時間セグメントt_t,E(l)≦i<t_t,E(l+1)内の入力信号x(i) (t_t,E(l)≦i<t_t,E(l+1))の時間包絡E(i)を式（２５）により算出する。また時間包絡E(i)の算出方法は式（２５）に限定されない。時間包絡E(i)の分散またはそれに準ずるパラメータを算出し、当該パラメータを符号化する。さらに別の例では、時間包絡E(i)の相加平均と相乗平均の比またはそれに準ずるパラメータを算出し、当該パラメータを符号化する。入力信号の時間包絡形状の平坦さの程度を表す情報の算出方法は上記の例に限定されない。 For example, as the time envelope information, information indicating the degree of flatness of the time envelope shape is calculated. For example, the input signal x (i) (t _{t, E} (l) ≤ i <t _{t, E} (l) in any time segment t _{t, E} (l) ≤ i <t _{t, E} (l + 1) The time envelope E (i) of +1)) is calculated by Eq. (25). Further, the calculation method of the time envelope E (i) is not limited to the equation (25). Calculate the variance of the time envelope E (i) or a parameter equivalent thereto, and encode the parameter. In yet another example, the ratio of the arithmetic mean to the geometric mean of the time envelope E (i) or a parameter equivalent thereto is calculated and the parameter is encoded. The method of calculating the information indicating the degree of flatness of the time envelope shape of the input signal is not limited to the above example.

さらに、例えば、時間包絡情報として、時間包絡形状の立ち上がりの程度を表す情報を算出する。例えば、時間包絡E(i)のの時間方向の差分値を算出し、当該差分値の任意の時間セグメント内の最大値を算出し符号化する。入力信号の時間包絡形状を立ち上がりの程度を表す情報の算出方法は上記の例に限定されない。 Further, for example, as the time envelope information, information indicating the degree of rise of the time envelope shape is calculated. For example, the difference value in the time direction of the time envelope E (i) is calculated, and the maximum value of the difference value in an arbitrary time segment is calculated and encoded. The method of calculating the information indicating the degree of rise of the time envelope shape of the input signal is not limited to the above example.

さらに、例えば、時間包絡情報として、時間包絡形状の立ち下がりの程度を表す情報を算出する。例えば、時間包絡E(i)のの時間方向の差分値を算出し、当該差分値の任意の時間セグメント内の最小値を算出し符号化する。入力信号の時間包絡形状を立ち下がりの程度を表す情報の算出方法は上記の例に限定されない。 Further, for example, as the time envelope information, information indicating the degree of falling of the time envelope shape is calculated. For example, the difference value in the time direction of the time envelope E (i) is calculated, and the minimum value of the difference value in an arbitrary time segment is calculated and encoded. The method of calculating the information indicating the degree of falling of the time envelope shape of the input signal is not limited to the above example.

当該第3の実施形態の低周波数時間包絡形状決定部10eに対して、本発明の第1の実施形態の第1、第2、および第3の変形例が適用できることは明白である。 It is clear that the first, second, and third modifications of the first embodiment of the present invention can be applied to the low frequency time envelope shape determining unit 10e of the third embodiment.

［第4の実施形態］
図25は、第4の実施形態に係る音声復号装置13の構成を示す図である。音声復号装置13の通信装置は、下記音声符号化装置23から出力される多重化された符号化系列を受信し、更に、復号した音声信号を外部に出力する。音声復号装置13は、図25に示すように、機能的には、符号化系列逆多重化部10aA、コア復号部10b、分析フィルタバンク部10c、符号化系列解析部13c、高周波数時間包絡形状決定部13a、時間包絡修正部13b、高周波数信号生成部10g、復号/逆量子化部10h、周波数包絡調整部10i、及び合成フィルタバンク部10jを備える。 [Fourth Embodiment]
FIG. 25 is a diagram showing a configuration of the audio decoding device 13 according to the fourth embodiment. The communication device of the voice decoding device 13 receives the multiplexed coding sequence output from the following voice coding device 23, and further outputs the decoded voice signal to the outside. As shown in FIG. 25, the voice decoding device 13 functionally has a coded sequence demultiplexing section 10aA, a core decoding section 10b, an analysis filter bank section 10c, a coded sequence analysis section 13c, and a high frequency time wrapping shape. It includes a determination unit 13a, a time wrapping correction unit 13b, a high frequency signal generation unit 10g, a decoding / dequantization unit 10h, a frequency wrapping adjustment unit 10i, and a synthesis filter bank unit 10j.

図26は、第4の実施形態に係る音声復号装置13の動作を示すフローチャートである。 FIG. 26 is a flowchart showing the operation of the voice decoding device 13 according to the fourth embodiment.

符号化系列解析部13cは、符号化系列逆多重化部10aAで分割された符号化系列の帯域拡張部分を解析し、高周波数信号生成部10g、復号/逆量子化部10h、高周波数時間包絡形状決定部13aで必要な情報に分割する（ステップS13-3）。 The coded sequence analysis unit 13c analyzes the band expansion portion of the coded sequence divided by the coded sequence demultiplexing unit 10aA, and analyzes the high frequency signal generation unit 10g, the decoding / dequantization unit 10h, and the high frequency time entrapment. The shape determination unit 13a divides the frequency into necessary information (step S13-3).

高周波数時間包絡形状決定部13aは、符号化系列解析部13cから高周波時間包絡形状に関する情報を受け取り、当該情報に基づき高周波数信号の時間包絡形状を決定する（ステップS13-1）。例えば、高周波数信号の時間包絡形状を平坦と決定する。さらに、例えば、高周波数信号の時間包絡形状を立ち上がりと決定する。さらに、例えば、高周波数信号の時間包絡形状を立ち下がりと決定する。 The high frequency time envelope shape determination unit 13a receives information on the high frequency time envelope shape from the coded sequence analysis unit 13c, and determines the time envelope shape of the high frequency signal based on the information (step S13-1). For example, the time envelope shape of a high frequency signal is determined to be flat. Further, for example, the time envelope shape of the high frequency signal is determined to be the rising edge. Further, for example, the time-envelope shape of the high frequency signal is determined to be falling.

時間包絡修正部13bは、高周波数時間包絡形状決定部13aで決定した時間包絡形状に基づいて、分析フィルタバンク部10cから出力され、高周波数信号生成部10gにて高周波数信号の生成に利用する低周波数信号の複数のサブバンド信号の時間包絡の形状を修正する（ステップS13-2）。 The time wrapping correction unit 13b is output from the analysis filter bank unit 10c based on the time wrapping shape determined by the high frequency time wrapping shape determination unit 13a, and is used by the high frequency signal generation unit 10g to generate a high frequency signal. Correct the shape of the time wrapping of multiple subband signals of the low frequency signal (step S13-2).

例えば、前記高周波数信号の時間包絡形状が平坦と決定された場合、例えば、高周波数信号の生成に利用する低周波数信号に対して、低周波数時間包絡修正部10fが前記低周波数信号の時間包絡形状を平坦にする処理と同様の処理により、高周波数信号の生成に利用する低周波数信号の時間包絡形状を修正できる。 For example, when the time wrapping shape of the high frequency signal is determined to be flat, for example, the low frequency time wrapping correction unit 10f causes the time wrapping of the low frequency signal with respect to the low frequency signal used for generating the high frequency signal. By the same process as the process of flattening the shape, the time-wrapping shape of the low frequency signal used for generating the high frequency signal can be corrected.

さらに、例えば、前記高周波数信号の時間包絡形状が立ち上がりと決定された場合、例えば、低周波数時間包絡修正部10fが前記低周波数信号の時間包絡形状を立ち上がりにする処理と同様の処理により、高周波数信号の生成に利用する低周波数信号の時間包絡形状を修正できる。 Further, for example, when the time-envelope shape of the high-frequency signal is determined to rise, for example, the low-frequency time-envelope correction unit 10f performs a process similar to the process of making the time-envelope shape of the low-frequency signal rise. The time envelope shape of the low frequency signal used to generate the frequency signal can be modified.

さらに、例えば、前記高周波数信号の時間包絡形状が立ち下がりと決定された場合、例えば、低周波数時間包絡修正部10fが前記低周波数信号の時間包絡形状を立ち下がりにする処理と同様の処理により、高周波数信号の生成に利用する低周波数信号の時間包絡形状を修正できる。 Further, for example, when the time envelope shape of the high frequency signal is determined to be falling, for example, the process similar to the process in which the low frequency time envelope correction unit 10f makes the time envelope shape of the low frequency signal fall. , The time envelope shape of the low frequency signal used to generate the high frequency signal can be modified.

高周波数信号の生成に利用する低周波数信号の時間包絡形状を修正する処理は、上記の例に限定されない。 The process of modifying the time envelope shape of the low frequency signal used to generate the high frequency signal is not limited to the above example.

図27は、第4の実施形態に係る音声符号化装置23の構成を示す図である。音声符号化装置23の通信装置は、符号化の対象となる音声信号を外部から受信し、更に、符号化された符号化系列を外部に出力する。音声符号化装置23は、図27に示すように、機能的には、ダウンサンプリング部20a、コア符号化部20b、分析フィルタバンク部20c及び20c1、制御パラメータ符号化部20d、包絡算出部20e、量子化/符号化部20f、時間包絡情報符号化部23a、符号化系列多重化部20h、サブバンド信号パワー算出部20j、及びコア復号信号生成部20iを備える。 FIG. 27 is a diagram showing a configuration of the voice coding device 23 according to the fourth embodiment. The communication device of the voice coding device 23 receives the voice signal to be coded from the outside, and further outputs the coded coding sequence to the outside. As shown in FIG. 27, the voice coding device 23 functionally includes a downsampling unit 20a, a core coding unit 20b, an analysis filter bank unit 20c and 20c1, a control parameter coding unit 20d, and an entrapment calculation unit 20e. It includes a quantization / coding unit 20f, a time-wrapping information coding unit 23a, a coding sequence multiplexing unit 20h, a subband signal power calculation unit 20j, and a core decoding signal generation unit 20i.

図28は、第4の実施形態に係る音声符号化装置23の動作を示すフローチャートである。 FIG. 28 is a flowchart showing the operation of the voice coding device 23 according to the fourth embodiment.

時間包絡情報符号化部23aは、高周波数信号の生成に利用する低周波数信号の時間包絡と高周波数信号の時間包絡のうち少なくとも一つ以上を算出し、さらにサブバンド信号パワー算出部20jにて算出されたコア復号信号のサブバンド信号のパワーを用いてコア復号信号の時間包絡を算出し、当該低周波数信号の時間包絡及び高周波数信号の時間包絡のうち少なくとも一つ以上とコア復号信号の時間包絡より時間包絡情報を符号化する（ステップS23-1）。低周波数信号の時間包絡は、包絡算出部20eにて算出した低周波数信号のサブバンド信号のパワーを用いて低周波数信号の時間包絡を算出する。高周波数信号の時間包絡は、包絡算出部20eにて算出した高周波数信号のサブバンド信号のパワーを用いて高周波数信号の時間包絡を算出する。当該処理において、低周波数信号のサブバンド信号のパワーが算出されていない場合は、時間包絡情報符号化部23aにて低周波数信号のサブバンド信号のパワーを算出でき、低周波数信号のサブバンド信号のパワーがどこで算出されるかは限定されない。さらには、高周波数信号のサブバンド信号のパワーが算出されていない場合は、時間包絡情報符号化部23aにて高周波数信号のサブバンド信号のパワーを算出でき、高周波数信号のサブバンド信号のパワーがどこで算出されるかは限定されない。 The time wrapping information coding unit 23a calculates at least one of the time wrapping of the low frequency signal and the time wrapping of the high frequency signal used to generate the high frequency signal, and the subband signal power calculation unit 20j further calculates. The time wrapping of the core decoding signal is calculated using the power of the subband signal of the calculated core decoding signal, and at least one of the time wrapping of the low frequency signal and the time wrapping of the high frequency signal and the core decoding signal Encode the time-related information from the time-related (step S23-1). For the time envelope of the low frequency signal, the time envelope of the low frequency signal is calculated by using the power of the subband signal of the low frequency signal calculated by the envelope calculation unit 20e. For the time envelope of the high frequency signal, the time envelope of the high frequency signal is calculated by using the power of the subband signal of the high frequency signal calculated by the envelope calculation unit 20e. If the power of the subband signal of the low frequency signal is not calculated in the process, the power of the subband signal of the low frequency signal can be calculated by the time wrapping information coding unit 23a, and the subband signal of the low frequency signal is calculated. There is no limit to where the power of is calculated. Furthermore, when the power of the subband signal of the high frequency signal is not calculated, the power of the subband signal of the high frequency signal can be calculated by the time wrapping information coding unit 23a, and the subband signal of the high frequency signal can be calculated. There is no limit to where the power is calculated.

例えば、時間包絡情報符号化部20gが前記低周波数信号の時間包絡を算出する処理と同様の処理により、当該高周波数信号の生成に利用する低周波数信号の時間包絡を算出できる。高周波数信号の生成に利用する低周波数信号のサブバンド信号の時間包絡は、当該低周波数信号のサブバンド信号の大きさの時間方向の変動がわかるパラメータであれば良く、前記の例に限定されない。 For example, the time envelope of the low frequency signal used for generating the high frequency signal can be calculated by the same process as the process in which the time envelope information coding unit 20g calculates the time envelope of the low frequency signal. The time envelope of the subband signal of the low frequency signal used for generating the high frequency signal may be a parameter that shows the variation in the magnitude of the subband signal of the low frequency signal in the time direction, and is not limited to the above example. ..

また、例えば、時間包絡情報符号化部21aが前記高周波数信号の時間包絡を算出する処理と同様の処理により、当該高周波数信号の時間包絡を算出できる。高周波数信号のサブバンド信号の時間包絡は、当該高周波数信号のサブバンド信号の大きさの時間方向の変動がわかるパラメータであれば良く、前記の例に限定されない。 Further, for example, the time envelope of the high frequency signal can be calculated by the same process as the process in which the time envelope information coding unit 21a calculates the time envelope of the high frequency signal. The time envelope of the subband signal of the high frequency signal may be any parameter as long as it is a parameter that shows the variation in the magnitude of the subband signal of the high frequency signal in the time direction, and is not limited to the above example.

例えば、時間包絡情報符号化部20gが時間包絡情報として平坦の程度を表す情報を算出する処理において、前記低周波数信号サブバンド信号の時間包絡の代わりに、当該高周波数信号の生成に利用する低周波数信号のサブバンド信号の時間包絡を用いることにより、時間包絡情報として平坦の程度を表す情報を算出でき、また当該時間包絡情報を符号化できる。さらには、例えば、時間包絡情報符号化部20gが時間包絡情報として平坦の程度を表す情報を算出する処理において、前記低周波数信号サブバンド信号の時間包絡の代わりに、当該高周波数信号のサブバンド信号の時間包絡を用いることにより、時間包絡情報として平坦の程度を表す情報を算出でき、また当該時間包絡情報を符号化できる。例えば、時間包絡の平坦の程度を平坦か否かで表現すれば1ビットで符号化できる。 For example, in a process in which the time-wrapping information coding unit 20g calculates information indicating the degree of flatness as time-wrapping information, the low frequency signal used to generate the high-frequency signal instead of the time-wrapping of the low-frequency signal subband signal. By using the time wrapping of the subband signal of the frequency signal, information indicating the degree of flatness can be calculated as the time wrapping information, and the time wrapping information can be encoded. Further, for example, in the process of calculating the information indicating the degree of flatness as the time-envelope information by the time-envelope information coding unit 20g, the sub-band of the high-frequency signal is replaced with the time-envelope of the low-frequency signal sub-band signal. By using the time envelope of the signal, information indicating the degree of flatness can be calculated as the time envelope information, and the time envelope information can be encoded. For example, if the degree of flatness of the time envelope is expressed by whether it is flat or not, it can be encoded with 1 bit.

さらに、例えば、時間包絡情報符号化部20gが時間包絡情報として立ち上がりの程度を表す情報を算出する処理において、前記低周波数信号サブバンド信号の時間包絡の代わりに、当該高周波数信号の生成に利用する低周波数信号のサブバンド信号の時間包絡を用いることにより、時間包絡情報として立ち上がりの程度を表す情報を算出でき、また当該時間包絡情報を符号化できる。さらには、例えば、時間包絡情報符号化部20gが時間包絡情報として立ち上がりの程度を表す情報を算出する処理において、前記低周波数信号サブバンド信号の時間包絡の代わりに、当該高周波数信号のサブバンド信号の時間包絡を用いることにより、時間包絡情報として立ち上がりの程度を表す情報を算出でき、また当該時間包絡情報を符号化できる。例えば、時間包絡の立ち上がりの程度を立ち上がりか否かで表現すれば1ビットで符号化できる。 Further, for example, in a process in which the time-wrapping information coding unit 20g calculates information indicating the degree of rise as time-wrapping information, it is used to generate the high-frequency signal instead of the time-wrapping of the low-frequency signal subband signal. By using the time wrapping of the subband signal of the low frequency signal, it is possible to calculate the information indicating the degree of rise as the time wrapping information, and to encode the time wrapping information. Further, for example, in a process in which the time-wrapping information coding unit 20g calculates information indicating the degree of rise as time-wrapping information, instead of the time-wrapping of the low-frequency signal sub-band signal, the sub-band of the high-frequency signal is used. By using the time wrapping of the signal, information indicating the degree of rise can be calculated as the time wrapping information, and the time wrapping information can be encoded. For example, if the degree of rise of the time envelope is expressed by whether it rises or not, it can be encoded with 1 bit.

さらに、例えば、時間包絡情報符号化部20gが時間包絡情報として立ち下がりの程度を表す情報を算出する処理において、前記低周波数信号サブバンド信号の時間包絡の代わりに、当該高周波数信号の生成に利用する低周波数信号のサブバンド信号の時間包絡を用いることにより、時間包絡情報として立ち下がりの程度を表す情報を算出でき、また当該時間包絡情報を符号化できる。さらには、例えば、時間包絡情報符号化部20gが時間包絡情報として立ち下がりの程度を表す情報を算出する処理において、前記低周波数信号サブバンド信号の時間包絡の代わりに、当該高周波数信号のサブバンド信号の時間包絡を用いることにより、時間包絡情報として立ち下がりの程度を表す情報を算出でき、また当該時間包絡情報を符号化できる。例えば、時間包絡の立ち下がりの程度を立ち下がりか否かで表現すれば1ビットで符号化できる。 Further, for example, in a process in which the time-wrapping information coding unit 20g calculates information indicating the degree of fall as time-wrapping information, instead of the time-wrapping of the low-frequency signal subband signal, the high-frequency signal is generated. By using the time wrapping of the subband signal of the low frequency signal to be used, information indicating the degree of falling can be calculated as the time wrapping information, and the time wrapping information can be encoded. Further, for example, in a process in which the time-wrapping information coding unit 20g calculates information indicating the degree of fall as time-wrapping information, instead of the time-wrapping of the low-frequency signal sub-band signal, the sub of the high-frequency signal is used. By using the time wrapping of the band signal, information indicating the degree of falling can be calculated as the time wrapping information, and the time wrapping information can be encoded. For example, if the degree of the fall of the time envelope is expressed by whether it falls or not, it can be encoded with 1 bit.

なお、時間包絡情報の算出方法、及び符号化方法は前記の例に限定されない。 The time-envelope information calculation method and coding method are not limited to the above examples.

［第4の実施形態の音声復号装置の第1の変形例］
図29は、第4の実施形態に係る音声復号装置の第1の変形例13Aの構成を示す図である。 [First variant of the audio decoding device of the fourth embodiment]
FIG. 29 is a diagram showing a configuration of a first modification 13A of the audio decoding device according to the fourth embodiment.

図30は、第4の実施形態に係る音声復号装置の第1の変形例13Aの動作を示すフローチャートである。 FIG. 30 is a flowchart showing the operation of the first modification 13A of the audio decoding device according to the fourth embodiment.

高周波数時間包絡形状決定部13aAは、コア復号部10bから低周波数信号を受け取り、当該低周波数信号に基づいて高周波数時間包絡形状を決定する（ステップS13-1a）。 The high frequency time envelope shape determining unit 13aA receives the low frequency signal from the core decoding unit 10b and determines the high frequency time envelope shape based on the low frequency signal (step S13-1a).

例えば、低周波数信号の時間包絡を算出し、当該低周波数時間包絡の形状に基づいて高周波数時間包絡形状を決定する。さらに、例えば、低周波数信号に所定の処理を施した信号の時間包絡を算出し、当該処理済低周波数信号の時間包絡の形状に基づいて高周波数時間包絡形状を決定する。前記所定の処理は、例えばハイパスフィルタ処理であるが、これに限定されない。 For example, the time envelope of the low frequency signal is calculated, and the high frequency time envelope shape is determined based on the shape of the low frequency time envelope. Further, for example, the time envelope of a signal obtained by subjecting a low frequency signal to a predetermined process is calculated, and the high frequency time envelope shape is determined based on the shape of the time envelope of the processed low frequency signal. The predetermined process is, for example, a high-pass filter process, but is not limited thereto.

例えば、高周波数信号の時間包絡形状を平坦と決定する。例えば、低周波数時間包絡形状決定部10eAが前記低周波数信号の時間包絡形状を平坦と決定する処理と同様に高周波数信号の時間包絡形状を平坦と決定できる。さらに、低周波数時間包絡形状決定部10eAが前記低周波数信号の時間包絡形状を平坦と決定する処理において、前記低周波数信号の時間包絡の代わりに、前記処理済低周波数信号の時間包絡を用いて、高周波数信号の時間包絡形状を平坦と決定できる。高周波数信号の時間包絡形状を平坦と決定する処理は上記の例に限定されない。 For example, the time envelope shape of a high frequency signal is determined to be flat. For example, the low frequency time envelope shape determining unit 10eA can determine the time envelope shape of the high frequency signal as flat in the same manner as the process of determining the time envelope shape of the low frequency signal as flat. Further, in the process in which the low frequency time envelope shape determining unit 10eA determines that the time envelope shape of the low frequency signal is flat, the time envelope of the processed low frequency signal is used instead of the time envelope of the low frequency signal. , The time envelope shape of the high frequency signal can be determined to be flat. The process of determining the time envelope shape of a high frequency signal to be flat is not limited to the above example.

さらに、例えば、高周波数信号の時間包絡形状を立ち上がりと決定する。例えば、低周波数時間包絡形状決定部10eAが前記低周波数信号の時間包絡形状を立ち上がりと決定する処理と同様に高周波数信号の時間包絡形状を立ち上がりと決定できる。さらに、低周波数時間包絡形状決定部10eAが前記低周波数信号の時間包絡形状を立ち上がりと決定する処理において、前記低周波数信号の時間包絡の代わりに、前記処理済低周波数信号の時間包絡を用いて、高周波数信号の時間包絡形状を立ち上がりと決定できる。高周波数信号の時間包絡形状を立ち上がりと決定する処理は上記の例に限定されない。 Further, for example, the time envelope shape of the high frequency signal is determined to be the rising edge. For example, the low frequency time envelope shape determining unit 10eA can determine the time envelope shape of the high frequency signal as the rising edge in the same manner as the process of determining the time envelope shape of the low frequency signal as the rising edge. Further, in the process in which the low frequency time envelope shape determining unit 10eA determines the time envelope shape of the low frequency signal as the rising edge, the time envelope of the processed low frequency signal is used instead of the time envelope of the low frequency signal. , The time-envelope shape of the high frequency signal can be determined as the rising edge. The process of determining the time envelope shape of the high frequency signal as the rising edge is not limited to the above example.

さらに、例えば、高周波数信号の時間包絡形状を立ち下がりと決定する。例えば、低周波数時間包絡形状決定部10eAが前記低周波数信号の時間包絡形状を立ち下がりと決定する処理と同様に高周波数信号の時間包絡形状を立ち下がりと決定できる。さらに、低周波数時間包絡形状決定部10eAが前記低周波数信号の時間包絡形状を立ち下がりと決定する処理において、前記低周波数信号の時間包絡の代わりに、前記処理済低周波数信号の時間包絡を用いて、高周波数信号の時間包絡形状を立ち下がりと決定できる。高周波数信号の時間包絡形状を立ち下がりと決定する処理は上記の例に限定されない。 Further, for example, the time-envelope shape of the high frequency signal is determined to be falling. For example, the low frequency time envelope shape determining unit 10eA can determine the time envelope shape of the high frequency signal as the falling edge in the same manner as the process of determining the time envelope shape of the low frequency signal as the falling edge. Further, in the process in which the low frequency time envelope shape determining unit 10eA determines the time envelope shape of the low frequency signal as a falling edge, the time envelope of the processed low frequency signal is used instead of the time envelope of the low frequency signal. Therefore, the time-envelope shape of the high frequency signal can be determined to be the falling edge. The process of determining the time-envelope shape of a high-frequency signal as a falling edge is not limited to the above example.

［第4の実施形態の音声復号装置の第2の変形例］
図31は、第4の実施形態に係る音声復号装置の第2の変形例13Bの構成を示す図である。 [Second variant of the audio decoding device of the fourth embodiment]
FIG. 31 is a diagram showing a configuration of a second modification 13B of the audio decoding device according to the fourth embodiment.

第4の実施形態に係る音声復号装置の第1の変形例13Aとの相違点は、高周波数時間包絡形状決定部13aBは、分析フィルタバンク部10cから低周波数信号の複数のサブバンド信号を受け取り、当該低周波数信号の複数のサブバンド信号に基づいて高周波数信号の時間包絡形状を決定する点である（ステップS13-1aに相当の処理）。 The difference from the first modification 13A of the voice decoding apparatus according to the fourth embodiment is that the high frequency time entrainment shape determining unit 13aB receives a plurality of subband signals of low frequency signals from the analysis filter bank unit 10c. , The point is that the time entrainment shape of the high frequency signal is determined based on the plurality of subband signals of the low frequency signal (process corresponding to step S13-1a).

例えば、低周波数信号の少なくとも一つ以上のサブバンド信号の時間包絡を算出し、当該低周波数サブバンド信号時間包絡の形状に基づいて高周波数時間包絡形状を決定する。 For example, the time envelope of at least one or more subband signals of the low frequency signal is calculated, and the high frequency time envelope shape is determined based on the shape of the low frequency subband signal time envelope.

例えば、高周波数信号の時間包絡形状を平坦と決定する。例えば、低周波数時間包絡形状決定部10eBが前記低周波数信号の時間包絡形状を平坦と決定する処理と同様にして、高周波数信号の時間包絡形状を平坦と決定できる。この際、周波数帯域の境界を表すB_LO(m)は、例えば、比較的高い周波数の周波数帯域のみを定義するなどとして、低周波数時間包絡形状決定部10eBと異ならせることができる。高周波数信号の時間包絡形状を平坦と決定する処理は上記の例に限定されない。 For example, the time envelope shape of a high frequency signal is determined to be flat. For example, the time envelope shape of the high frequency signal can be determined to be flat in the same manner as the process in which the low frequency time envelope shape determining unit 10eB determines the time envelope shape of the low frequency signal to be flat. At this time, the B _LO (m) representing the boundary of the frequency band can be different from the low frequency time envelope shape determining unit 10eB, for example, by defining only the frequency band having a relatively high frequency. The process of determining the time envelope shape of a high frequency signal to be flat is not limited to the above example.

さらに、例えば、高周波数信号の時間包絡形状を立ち上がりと決定する。例えば、低周波数時間包絡形状決定部10eBが前記低周波数信号の時間包絡形状を立ち上がりと決定する処理と同様にして、高周波数信号の時間包絡形状を立ち上がりと決定できる。この際、周波数帯域の境界を表すB_LO(m)は、例えば、比較的高い周波数の周波数帯域のみを定義するなどとして、低周波数時間包絡形状決定部10eBと異ならせることができる。高周波数信号の時間包絡形状を立ち上がりと決定する処理は上記の例に限定されない。 Further, for example, the time envelope shape of the high frequency signal is determined to be the rising edge. For example, the time envelope shape of the high frequency signal can be determined as the rising edge in the same manner as the process in which the low frequency time envelope shape determining unit 10eB determines the time envelope shape of the low frequency signal as the rising edge. At this time, the B _LO (m) representing the boundary of the frequency band can be different from the low frequency time envelope shape determining unit 10eB, for example, by defining only the frequency band having a relatively high frequency. The process of determining the time envelope shape of the high frequency signal as the rising edge is not limited to the above example.

さらに、例えば、高周波数信号の時間包絡形状を立ち下がりと決定する。例えば、低周波数時間包絡形状決定部10eBが前記低周波数信号の時間包絡形状を立ち下がりと決定する処理と同様にして、高周波数信号の時間包絡形状を立ち下がりと決定できる。この際、周波数帯域の境界を表すB_LO(m)は、例えば、比較的高い周波数の周波数帯域のみを定義するなどとして、低周波数時間包絡形状決定部10eBと異ならせることができる。高周波数信号の時間包絡形状を立ち下がりと決定する処理は上記の例に限定されない。 Further, for example, the time-envelope shape of the high frequency signal is determined to be falling. For example, the time envelope shape of the high frequency signal can be determined to be falling in the same manner as the process in which the low frequency time envelope shape determining unit 10eB determines the time envelope shape of the low frequency signal as falling. At this time, the B _LO (m) representing the boundary of the frequency band can be different from the low frequency time envelope shape determining unit 10eB, for example, by defining only the frequency band having a relatively high frequency. The process of determining the time-envelope shape of a high-frequency signal as a falling edge is not limited to the above example.

［第4の実施形態の音声復号装置の第3の変形例］
図32は、第4の実施形態に係る音声復号装置の第3の変形例13Cの構成を示す図である。 [Third variant of the audio decoding device of the fourth embodiment]
FIG. 32 is a diagram showing a configuration of a third modification 13C of the audio decoding device according to the fourth embodiment.

高周波数時間包絡形状決定部13aCは、符号化系列解析部13cから高周波時間包絡形状に関する情報、コア復号部10bから低周波数信号、分析フィルタバンク部10cから低周波数信号の複数のサブバンド信号のうち少なくとも一つを受け取り、高周波数信号の時間包絡形状を決定する（ステップS13-1に相当の処理）。 The high frequency time entrainment shape determination unit 13aC is composed of information on the high frequency time entrapment shape from the coded sequence analysis unit 13c, the low frequency signal from the core decoding unit 10b, and the low frequency signal from the analysis filter bank unit 10c. Receive at least one and determine the time-enclosed shape of the high frequency signal (process corresponding to step S13-1).

例えば、高周波数信号の時間包絡形状を平坦と決定する。この場合、上記第4の実施形態の音声復号装置、当該復号装置の第1及び第2の変形例にて記載の高周波数信号の時間包絡形状を平坦と決定する方法を少なくとも一つ以上組み合わせて時間包絡形状を平坦と決定する。高周波数信号の時間包絡形状を平坦と決定する方法は上記に限定されない。 For example, the time envelope shape of a high frequency signal is determined to be flat. In this case, at least one combination of the voice decoding device of the fourth embodiment and the method of determining the time envelope shape of the high frequency signal as flat according to the first and second modifications of the decoding device. Determine the time envelope shape to be flat. The method for determining the time envelope shape of a high frequency signal as flat is not limited to the above.

また、例えば、高周波数信号の時間包絡形状を立ち上がりと決定する。この場合、上記第4の実施形態の音声復号装置、当該復号装置の第1及び第2の変形例にて記載の高周波数信号の時間包絡形状を立ち上がりと決定する方法を少なくとも一つ以上組み合わせて時間包絡形状を立ち上がりと決定する。高周波数信号の時間包絡形状を立ち上がりと決定する方法は上記に限定されない。 Further, for example, the time-envelope shape of the high frequency signal is determined as the rising edge. In this case, at least one combination of the voice decoding device of the fourth embodiment and the method of determining the time envelope shape of the high frequency signal as the rising edge described in the first and second modifications of the decoding device is combined. The time envelope shape is determined to be the rising edge. The method for determining the time envelope shape of the high frequency signal as the rising edge is not limited to the above.

さらに、例えば、高周波数信号の時間包絡形状を立ち下がりと決定する。この場合、上記第4の実施形態の音声復号装置、当該復号装置の第1及び第2の変形例にて記載の高周波数信号の時間包絡形状を立ち下がりと決定する方法を少なくとも一つ以上組み合わせて時間包絡形状を立ち下がりと決定する。高周波数信号の時間包絡形状を立ち下がりと決定する方法は上記に限定されない。 Further, for example, the time-envelope shape of the high frequency signal is determined to be falling. In this case, at least one combination of the voice decoding device of the fourth embodiment and the method of determining the time envelope shape of the high frequency signal as the falling edge described in the first and second modifications of the decoding device. The time-envelope shape is determined to be falling. The method for determining the time-envelope shape of a high-frequency signal as a falling edge is not limited to the above.

［第4の実施形態の音声符号化装置の第1の変形例］
図33は、第4の実施形態に係る音声符号化装置の第1の変形例23Aの構成を示す図である。 [First modification of the voice coding device of the fourth embodiment]
FIG. 33 is a diagram showing a configuration of a first modification 23A of the voice coding device according to the fourth embodiment.

図34は、第4の実施形態に係る音声符号化装置の第1の変形例23Aの動作を示すフローチャートである。 FIG. 34 is a flowchart showing the operation of the first modification 23A of the voice coding device according to the fourth embodiment.

時間包絡情報符号化部23aAは、低周波数信号の時間包絡と高周波数信号の時間包絡のうち少なくとも一つ以上を算出し、当該低周波数信号及び高周波数信号の時間包絡のうち少なくとも一つ以上より時間包絡情報を算出し符号化する（ステップS23-1a）。低周波数信号の時間包絡は、包絡算出部20eにて算出した低周波数信号のサブバンド信号のパワーを用いて低周波数信号の時間包絡を算出する。高周波数信号の時間包絡は、包絡算出部20eにて算出した高周波数信号のサブバンド信号のパワーを用いて高周波数信号の時間包絡を算出する。当該処理において、低周波数信号のサブバンド信号のパワーが算出されていない場合は、時間包絡情報符号化部23aAにて低周波数信号のサブバンド信号のパワーを算出してもよく、低周波数信号のサブバンド信号のパワーがどこで算出されるかは限定されない。さらには、高周波数信号のサブバンド信号のパワーが算出されていない場合は、時間包絡情報符号化部23aAにて高周波数信号のサブバンド信号のパワーを算出してもよく、高周波数信号のサブバンド信号のパワーがどこで算出されるかは限定されない。 The time envelope information coding unit 23aA calculates at least one of the time envelope of the low frequency signal and the time envelope of the high frequency signal, and from at least one of the time envelopes of the low frequency signal and the high frequency signal. The time envelope information is calculated and encoded (step S23-1a). For the time envelope of the low frequency signal, the time envelope of the low frequency signal is calculated by using the power of the subband signal of the low frequency signal calculated by the envelope calculation unit 20e. For the time envelope of the high frequency signal, the time envelope of the high frequency signal is calculated by using the power of the subband signal of the high frequency signal calculated by the envelope calculation unit 20e. When the power of the subband signal of the low frequency signal is not calculated in the processing, the power of the subband signal of the low frequency signal may be calculated by the time wrapping information coding unit 23aA, and the power of the subband signal of the low frequency signal may be calculated. Where the power of the subband signal is calculated is not limited. Furthermore, when the power of the sub-band signal of the high-frequency signal has not been calculated, the power of the sub-band signal of the high-frequency signal may be calculated by the time-wrapping information coding unit 23aA, and the sub-band signal of the high-frequency signal may be calculated. There is no limit to where the power of the band signal is calculated.

例えば、時間包絡情報として、時間包絡形状の平坦さの程度を表す情報を算出する。例えば、任意の時間セグメントt_E(l)≦i<t_E(l+1)内でB_LO(m) (m=0,…,M_LO, M_LO≧1) (B_LO(0)≧0, B_LO(M_LO)<k_x)で境界を表されるM_LO個の周波数帯域に分割し、m番目の周波数帯域に含まれる低周波数信号のサブバンド信号X_LO(k,i) (B_LO(m)≦k<B_LO(m+1), t_E(l)≦i<t_E(l+1))の時間包絡E_LO(k,i)を式（７）により算出する。また時間包絡E_LO(k,i)の算出方法は式（７）に限定されない。時間包絡E_LO(k,i)の分散またはそれに準ずるパラメータを算出し、当該パラメータを符号化する。さらに別の例では、時間包絡E_LO(k,i)の相加平均と相乗平均の比またはそれに準ずるパラメータを算出し、当該パラメータを符号化する。さらには、例えば、任意の時間セグメントt_E(l)≦i<t_E(l+1)内でB_HI(m) (m=0,…,M_HI, M_H≧1) (B_HI(0)≧k_x, B_HI(M_HI)<k_h)で境界を表されるM_HI個の周波数帯域に分割し、m番目の周波数帯域に含まれる高周波数信号のサブバンド信号X_HI(k,i) (B_HI(m)≦k<B_HI(m+1), t_E(l)≦i<t_E(l+1))の時間包絡E_HI(k,i)を式（１１）により算出する。また時間包絡E_HI(k,i)の算出方法は式（１１）に限定されない。時間包絡E_HI(k,i)の分散またはそれに準ずるパラメータを算出し、当該パラメータを符号化する。さらに別の例では、時間包絡E_HI(k,i)の相加平均と相乗平均の比またはそれに準ずるパラメータを算出し、当該パラメータを符号化する。時間包絡形状の平坦さの程度を表す情報の算出方法は上記の例に限定されない。 For example, as the time envelope information, information indicating the degree of flatness of the time envelope shape is calculated. For example, within any time segment t _E (l) ≤ i <t _E (l + 1), B _LO (m) (m = 0,…, M _LO , M _LO ≥ 1) (B _LO (0) ≥ It is divided into M _LO frequency bands whose boundaries are represented by 0, B _LO (M _LO ) <k _x ), and the subband signal X _LO (k, i) of the low frequency signal included in the mth frequency band. Calculate the time-wrapping E _LO (k, i) of (B _LO (m) ≤ k <B _LO (m + 1), t _E (l) ≤ i <t _E (l + 1)) by Eq. (7). To do. Moreover, the calculation method of the time envelope E _LO (k, i) is not limited to the equation (7). Calculate the variance of the time envelope E _LO (k, i) or a parameter equivalent thereto, and encode the parameter. In yet another example, the ratio of the arithmetic mean to the geometric mean of the time-envelope _ELO (k, i) or a parameter equivalent thereto is calculated and the parameter is encoded. Furthermore, for example, within an arbitrary time segment t _E (l) ≤ i <t _E (l + 1), B _HI (m) (m = 0,…, M _HI , M _H ≥ 1) (B _HI ( 0) ≧ k _x , B _HI (M _HI ) <k _h ) is divided into M _HI frequency bands whose boundaries are represented by the subband signal X _HI (subband signal of the high frequency signal included in the mth frequency band). The time wrapping E _HI (k, i) of k, i) (B _HI (m) ≤ k <B _HI (m + 1), t _E (l) ≤ i <t _E (l + 1)) Calculated according to 11). Moreover, the calculation method of the time envelope E _HI (k, i) is not limited to the equation (11). Calculate the variance of the time-envelope E _HI (k, i) or a parameter equivalent thereto, and encode the parameter. In yet another example, the ratio of the arithmetic mean to the geometric mean of the time-envelope E _HI (k, i) or a parameter equivalent thereto is calculated and the parameter is encoded. The method of calculating the information indicating the degree of flatness of the time envelope shape is not limited to the above example.

さらに、例えば、時間包絡情報として、時間包絡形状の立ち上がりの程度を表す情報を算出する。例えば、時間包絡E_LO(k,i)の時間方向の差分値を算出し、当該差分値の任意の時間セグメント内の最大値を算出し符号化する。さらには、例えば、時間包絡E_HI(k,i)の時間方向の差分値を算出し、当該差分値の任意の時間セグメント内の最大値を算出し符号化する。時間包絡形状を立ち上がりの程度を表す情報の算出方法は上記の例に限定されない。 Further, for example, as the time envelope information, information indicating the degree of rise of the time envelope shape is calculated. For example, the difference value in the time direction of the time envelope E _LO (k, i) is calculated, and the maximum value of the difference value in an arbitrary time segment is calculated and encoded. Further, for example, the difference value in the time direction of the time envelope E _HI (k, i) is calculated, and the maximum value of the difference value in an arbitrary time segment is calculated and encoded. The method of calculating the information indicating the degree of rise of the time envelope shape is not limited to the above example.

さらに、例えば、時間包絡情報として、時間包絡形状の立ち下がりの程度を表す情報を算出する。例えば、時間包絡E_LO(k,i)のの時間方向の差分値を算出し、当該差分値の任意の時間セグメント内の最小値を算出し符号化する。さらには、例えば、時間包絡E_HI(k,i)のの時間方向の差分値を算出し、当該差分値の任意の時間セグメント内の最小値を算出し符号化する。 Further, for example, as the time envelope information, information indicating the degree of falling of the time envelope shape is calculated. For example, the difference value in the time direction of the time envelope E _LO (k, i) is calculated, and the minimum value of the difference value in an arbitrary time segment is calculated and encoded. Further, for example, the difference value in the time direction of the time envelope E _HI (k, i) is calculated, and the minimum value of the difference value in an arbitrary time segment is calculated and encoded.

なお、時間包絡形状を立ち下がりの程度を表す情報の算出方法は上記の例に限定されない。前記時間包絡情報として平坦の程度、立ち上がりの程度、及び立下りの程度を表す情報を算出する例において、低周波数信号及び高周波数信号のサブバンド信号の時間包絡のうち一方のみを用いる場合においては、他方の時間包絡の算出のみに係る各部及び各処理を省略することができる。 The method of calculating the information indicating the degree of falling of the time envelope shape is not limited to the above example. In the example of calculating the information indicating the degree of flatness, the degree of rising edge, and the degree of falling edge as the time envelope information, when only one of the time envelopes of the subband signal of the low frequency signal and the high frequency signal is used. , Each part and each process related only to the calculation of the other time envelope can be omitted.

［第5の実施形態］
図35は、第5の実施形態に係る音声復号装置14の構成を示す図である。音声復号装置14の通信装置は、下記音声符号化装置24から出力される多重化された符号化系列を受信し、更に、復号した音声信号を外部に出力する。音声復号装置14は、図35に示すように、機能的には、符号化系列逆多重化部10aA、コア復号部10b、分析フィルタバンク部10c、符号化系列解析部13c、高周波数信号生成部10g、高周波数時間包絡形状決定部13a、時間包絡修正部14a、復号/逆量子化部10h、周波数包絡調整部10i、及び合成フィルタバンク部10jを備える。 [Fifth Embodiment]
FIG. 35 is a diagram showing a configuration of the audio decoding device 14 according to the fifth embodiment. The communication device of the voice decoding device 14 receives the multiplexed coding sequence output from the following voice coding device 24, and further outputs the decoded voice signal to the outside. As shown in FIG. 35, the voice decoding device 14 functionally has a coded sequence demultiplexing section 10aA, a core decoding section 10b, an analysis filter bank section 10c, a coded sequence analysis section 13c, and a high frequency signal generation section. It includes 10 g, a high frequency time wrapping shape determination unit 13a, a time wrapping correction unit 14a, a decoding / dequantization unit 10h, a frequency wrapping adjustment unit 10i, and a synthesis filter bank unit 10j.

図36は、第5の実施形態に係る音声復号装置14の動作を示すフローチャートである。 FIG. 36 is a flowchart showing the operation of the voice decoding device 14 according to the fifth embodiment.

時間包絡修正部14aは、高周波数時間包絡形状決定部13aで決定した時間包絡形状に基づいて、高周波数信号生成部10gから出力される高周波数信号の複数のサブバンド信号の時間包絡の形状を修正する（ステップS14-1）。 The time envelope correction unit 14a determines the shape of the time envelope of a plurality of subband signals of the high frequency signal output from the high frequency signal generation unit 10g based on the time envelope shape determined by the high frequency time envelope shape determination unit 13a. Correct (step S14-1).

例えば、任意の時間セグメントt_E(l)≦i<t_E(l+1)内でB_gen,HI(m) (m=0,…,M_gen,HI, M_gen,HI≧1) (B_gen,HI(0)≧k_x, B_gen,HI(M_gen,HI)<k_h)で境界を表されるM_HI個の周波数帯域に分割し、m番目の周波数帯域に含まれる高周波数信号生成部10gから出力される高周波数信号のサブバンド信号X_gen,HI(k,i) (B_HI(m)≦k<B_HI(m+1), t_E(l)≦i<t_E(l+1))に対して、所定の関数F(X_gen,HI(k,i))を用いて以下の式（２６）

により得られるX’_gen,HI(k,i)を時間包絡形状が修正された高周波数信号のサブバンド信号として出力する。 For example, within any time segment t _E (l) ≤ i <t _E (l + 1), B _{gen, HI} (m) (m = 0,…, M _{gen, HI} , M _{gen, HI} ≥ 1) ( It is divided into M _HI frequency bands whose boundaries are represented by B _{gen, HI} (0) ≥ k _x , B _{gen, HI} (M _{gen, HI} ) <k _h ), and the height included in the mth frequency band. Sub-band signal of high frequency signal output from frequency signal generator 10g X _{gen, HI} (k, i) (B _HI (m) ≤ k <B _HI (m + 1), t _E (l) ≤ i < For t _E (l + 1)), the following equation (26) is used using the predetermined function F (X _{gen, HI} (k, i)).

_{X'gen, HI} (k, i) obtained by is output as a subband signal of a high frequency signal with a corrected time envelope shape.

例えば、前記高周波数信号の時間包絡形状が平坦と決定された場合、以下の処理により、当該高周波数信号の時間包絡形状を修正できる。例えば、当該サブバンド信号X_gen,HI(k,i)をB_gen,HI(m) (m=0,…,M_HI, M_HI≧1) (B_gen,HI(0)≧k_x, B_gen,HI(M_HI)<k_h)で境界を表されるM_HI個の周波数帯域に分割し、m番目の周波数帯域に含まれるサブバンド信号X_gen,HI(k,i) (B_HI(m)≦k<B_HI(m+1), t_E(l)≦i<t_E(l+1))に対して、所定の関数F(X_gen,HI(k,i))を、

（これらを式（２７）という。）
として、X’_gen,HI(k,i)を時間包絡形状が修正された高周波数信号のサブバンド信号として出力する。
また別の例によれば、所定の関数F(X_gen,HI(k,i))を、サブバンド信号X_gen,HI(k,i)に対して平滑化フィルタ処理を施す

(N_filt≧1)で定義して、X’_gen,HI(k,i)を時間包絡形状が修正された高周波数信号のサブバンド信号として出力する。さらに、前記B_gen,HI(m)を用いて境界が表される各周波数帯域内で、フィルタ処理前後のサブバンド信号のパワーをあわせるように処理できる。
また別の例によれば、前記B_gen,HI(m)を用いて境界が表される各周波数帯域内で、サブバンド信号X_gen,HI(k,i)を周波数方向に線形予測して線形予測係数α_p(m) (m=0,…,M_HI-1)を得て、所定の関数F(X_gen,HI(k,i))をサブバンド信号X_gen,HI(k,i)に対して線形予測逆フィルタ処理を施す

(N_pred≧1)で定義して、X’_gen,HI(k,i)を時間包絡形状が修正された高周波数信号のサブバンド信号として出力する。 For example, when the time envelope shape of the high frequency signal is determined to be flat, the time envelope shape of the high frequency signal can be corrected by the following processing. For example, the subband signal X _{gen, HI} (k, i) is changed to B _{gen, HI} (m) (m = 0,…, M _HI , M _HI ≧ 1) (B _{gen, HI} (0) ≧ k _x , A subband signal X _{gen, HI} (k, i) (B) divided into M _HI frequency bands whose boundaries are represented by B _{gen, HI} (M _HI ) <k _h ) and included in the mth frequency band. _{For HI} (m) ≤ k <B _HI (m + 1), t _E (l) ≤ i <t _E (l + 1)), the given function F (X _{gen, HI} (k, i)) ,

(These are called equation (27).)
As, _{X'gen, HI} (k, i) is output as a subband signal of a high frequency signal with a corrected time envelope shape.
According to another example, the predetermined function F (X _{gen, HI} (k, i)) is subjected to smoothing filtering on the subband signal X _{gen, HI} (k, i).

Defined by (N _filt ≧ 1), _{X'gen, HI} (k, i) is output as a subband signal of a high frequency signal with a corrected time envelope shape. Further, the B _{gen and HI} (m) can be used to match the powers of the subband signals before and after the filtering within each frequency band whose boundary is represented.
According to another example, the subband signals X _{gen, HI} (k, i) are linearly predicted in the frequency direction within each frequency band whose boundary is represented by using the B _{gen, HI} (m). Obtain the linear prediction coefficient α _p (m) (m = 0,…, M _HI -1) and _{apply the} predetermined function F (X _{gen, HI} (k, i)) to the subband signal X _{gen, HI} (k, k, Perform linear prediction inverse filtering on i)

Defined by (N _pred ≥ 1), _{X'gen, HI} (k, i) is output as a subband signal of a high frequency signal with a corrected time envelope shape.

上記の時間包絡形状を平坦に修正する処理の例は、それぞれを組み合わせて実施できる。時間包絡修正部14aは、高周波数信号の複数のサブバンド信号の時間包絡の形状を平坦に修正する処理を実施し、上記の例に限定されない。 The above-mentioned example of the process of correcting the time envelope shape flatly can be carried out in combination with each other. The time envelope correction unit 14a performs a process of flatly correcting the shape of the time envelope of a plurality of subband signals of the high frequency signal, and is not limited to the above example.

さらには、例えば、前記高周波数信号の時間包絡形状が立ち上がりと決定された場合、以下の処理により、当該高周波数信号の時間包絡形状を修正できる。例えば、所定の関数F(X_gen,HI(k,i))をiに対して単調増加する関数incr(i)を用いて

で定義して、X’_gen,HI(k,i)を時間包絡形状が修正された高周波数信号のサブバンド信号として出力する。さらに、前記B_gen,HI(m)を用いて境界が表される各周波数帯域内で、時間包絡形状の修正前後のサブバンド信号のパワーをあわせるように処理できる。 Further, for example, when the time envelope shape of the high frequency signal is determined to be rising, the time envelope shape of the high frequency signal can be corrected by the following processing. For example, using the function incr (i) that monotonically increases a given function F (X _{gen, HI} (k, i)) with respect to i.

Defined in, _{X'gen, HI} (k, i) is output as a subband signal of a high frequency signal with a corrected time envelope shape. Further, the B _{gen and HI} (m) can be used to match the powers of the subband signals before and after the correction of the time envelope shape within each frequency band whose boundary is represented.

時間包絡修正部14aは、高周波数信号の複数のサブバンド信号の時間包絡の形状を立ち上がりに修正する処理を実施し、上記の例に限定されない。 The time envelope correction unit 14a performs a process of correcting the shape of the time envelope of a plurality of subband signals of the high frequency signal at the rising edge, and is not limited to the above example.

さらには、例えば、前記高周波数信号の時間包絡形状が立ち下がりと決定された場合、以下の処理により、当該高周波数信号の時間包絡形状を修正できる。例えば、所定の関数F(X_gen,HI(k,i))を、iに対して単調減少する関数decr(i)を用いて

で定義して、X’_gen,HI(k,i)を時間包絡形状が修正された高周波数信号のサブバンド信号として出力する。さらに、前記B_gen,HI(m)を用いて境界が表される各周波数帯域内で、時間包絡形状の修正前後のサブバンド信号のパワーをあわせるように処理できる。 Further, for example, when the time envelope shape of the high frequency signal is determined to be falling, the time envelope shape of the high frequency signal can be corrected by the following processing. For example, using the function decr (i) that monotonically decreases a given function F (X _{gen, HI} (k, i)) with respect to i.

Defined in, _{X'gen, HI} (k, i) is output as a subband signal of a high frequency signal with a corrected time envelope shape. Further, it is possible to process so as to match the powers of the subband signals before and after the correction of the time envelope shape within each frequency band whose boundary is represented by using the B _{gen and HI} (m).

時間包絡修正部14aは、高周波数信号の複数のサブバンド信号の時間包絡の形状を立ち下がりに修正する処理を実施し、上記の例に限定されない。 The time envelope correction unit 14a performs a process of correcting the shape of the time envelope of a plurality of subband signals of the high frequency signal to a falling edge, and is not limited to the above example.

なお、本実施形態における周波数包絡調整部10iを、“ISO/IEC 14496-3”に規定される“SBR”および“Low Delay SBR”における“HF adjustment”にて実現される場合は、時間包絡修正部14aの処理を周波数包絡調整部10iにおいて行うことで演算量の削減ができる。具体的には、例えば、式（２７）により時間包絡形状を修正する際に、式（２７）内の高周波数信号のサブバンド信号のパワー

の算出は、前記“HF adjustment”において算出されるために省略できる。さらに、前記“HF adjustment”にて“interpolation”を利用しない場合（すなわち、bs_interpol_freq=0の場合）は、式（２７）内の高周波数信号のサブバンド信号のパワーの周波数方向の和

の算出は、前記“HF adjustment”において算出されるため、さらに省略できる。 If the frequency envelope adjustment unit 10i in this embodiment is realized by "HF adjustment" in "SBR" and "Low Delay SBR" specified in "ISO / IEC 14496-3", time envelope correction is performed. The amount of calculation can be reduced by performing the processing of the part 14a in the frequency envelope adjusting part 10i. Specifically, for example, when the time envelope shape is corrected by the equation (27), the power of the subband signal of the high frequency signal in the equation (27).

Can be omitted because it is calculated in the above-mentioned "HF adjustment". Further, when "interpolation" is not used in the "HF adjustment" (that is, when bs_interpol_freq = 0), the sum of the powers of the subband signals of the high frequency signal in the equation (27) in the frequency direction.

Since the calculation of is calculated in the above-mentioned "HF adjustment", it can be further omitted.

一方、前記“HF adjustment”において前記“interpolation”を利用し時間方向の和、

が算出される場合には、上記和を、前記“HF adjustment”において算出される

の代替量、あるいは近似量として用いることができ、上記和の算出を省略することにより演算量が削減できる。 On the other hand, in the "HF adjustment", the sum in the time direction using the "interpolation",

Is calculated, the sum is calculated in the "HF adjustment".

It can be used as an alternative amount or an approximate amount of, and the calculation amount can be reduced by omitting the calculation of the sum.

さらに、時間包絡修正部14aの他の例においても、同様に一部の演算を省略できることは明白である。 Furthermore, it is clear that some operations can be omitted in other examples of the time envelope correction unit 14a as well.

なお、本実施形態に係る音声復号装置14の高周波数時間包絡形状決定部13aに対して、本発明の第4の実施形態の音声復号装置の第1、第2、及び第3の変形例が適用できることは明白である。 It should be noted that the first, second, and third modifications of the voice decoding device according to the fourth embodiment of the present invention are provided with respect to the high frequency time envelope shape determining unit 13a of the voice decoding device 14 according to the present invention. It is clear that it is applicable.

図37は、第5の実施形態に係る音声符号化装置24の構成を示す図である。音声符号化装置24の通信装置は、符号化の対象となる音声信号を外部から受信し、更に、符号化された符号化系列を外部に出力する。音声符号化装置24は、図37に示すように、機能的には、ダウンサンプリング部20a、コア符号化部20b、分析フィルタバンク部20c、制御パラメータ符号化部20d、包絡算出部20e、量子化/符号化部20f、擬似高周波数信号生成部24a、サブバンド信号パワー算出部24b、時間包絡情報符号化部24c、及び符号化系列多重化部20hを備える。 FIG. 37 is a diagram showing a configuration of the voice coding device 24 according to the fifth embodiment. The communication device of the voice coding device 24 receives the voice signal to be coded from the outside, and further outputs the coded coding sequence to the outside. As shown in FIG. 37, the voice coding device 24 functionally includes a downsampling unit 20a, a core coding unit 20b, an analysis filter bank unit 20c, a control parameter coding unit 20d, an encapsulation calculation unit 20e, and a quantization unit. It includes a / coding unit 20f, a pseudo high frequency signal generation unit 24a, a subband signal power calculation unit 24b, a time wrapping information coding unit 24c, and a coding sequence multiplexing unit 20h.

図38は、第5の実施形態に係る音声符号化装置24の動作を示すフローチャートである。 FIG. 38 is a flowchart showing the operation of the voice coding device 24 according to the fifth embodiment.

擬似高周波数信号生成部24aは、分析フィルタバンク部20cで得られる入力音声信号の低周波数信号のサブバンド信号と、制御パラメータ符号化部20dで得られる高周波数信号を生成するために必要な制御パラメータに基づいて、擬似高周波数信号を生成する（ステップS24-1）。当該擬似高周波数信号の生成処理は、高周波数信号生成部10gにおける処理と同様に行われるが、高周波数信号生成部10gではコア復号部10bにて復号された低周波数信号のサブバンド信号から生成されるのに対し、擬似高周波数信号生成部24aでは入力音声信号の低周波数信号のサブバンド信号から生成される点が異なる。なお、擬似高周波数信号生成部24aでは、演算量の削減を目的として、高周波数信号生成部10gでの処理の一部を省略できる。例えば、生成される高周波数信号のトーナリティの調整処理を省略できる。 The pseudo high frequency signal generation unit 24a is a control required to generate a subband signal of a low frequency signal of the input audio signal obtained by the analysis filter bank unit 20c and a high frequency signal obtained by the control parameter coding unit 20d. Generate a pseudo high frequency signal based on the parameters (step S24-1). The pseudo high frequency signal generation processing is performed in the same manner as the processing in the high frequency signal generation unit 10g, but the high frequency signal generation unit 10g generates the pseudo high frequency signal from the subband signal of the low frequency signal decoded by the core decoding unit 10b. On the other hand, the pseudo high frequency signal generation unit 24a is different in that it is generated from the subband signal of the low frequency signal of the input audio signal. In the pseudo high frequency signal generation unit 24a, a part of the processing by the high frequency signal generation unit 10g can be omitted for the purpose of reducing the amount of calculation. For example, the process of adjusting the tonality of the generated high frequency signal can be omitted.

サブバンド信号パワー算出部24bは、擬似高周波数信号生成部24aにて生成された擬似高周波数信号のサブバンド信号のパワーを算出する（ステップS24-2）。 The subband signal power calculation unit 24b calculates the power of the subband signal of the pseudo high frequency signal generated by the pseudo high frequency signal generation unit 24a (step S24-2).

時間包絡情報符号化部24cは、包絡算出部20eにて算出した高周波数信号のサブバンド信号のパワーを用いて高周波数信号の時間包絡を算出し、サブバンド信号パワー算出部24bにて算出した擬似高周波数信号のサブバンド信号のパワーを用いて擬似高周波数信号の時間包絡を算出し、当該高周波数信号の時間包絡と擬似高周波数信号の時間包絡より時間包絡情報を算出し符号化する（ステップS24-3）。当該処理において、高周波数信号のサブバンド信号のパワーが算出されていない場合は、時間包絡情報符号化部24cにて高周波数信号のサブバンド信号のパワーを算出でき、高周波数信号のサブバンド信号のパワーがどこで算出されるかは限定されない。 The time wrapping information coding unit 24c calculates the time wrapping of the high frequency signal using the power of the subband signal of the high frequency signal calculated by the wrapping calculation unit 20e, and calculates it by the subband signal power calculation unit 24b. The time wrapping of the quasi-high frequency signal is calculated using the power of the subband signal of the quasi-high frequency signal, and the time wrapping information is calculated and encoded from the time wrapping of the high frequency signal and the time wrapping of the pseudo high frequency signal ( Step S24-3). If the power of the sub-band signal of the high-frequency signal is not calculated in the processing, the power of the sub-band signal of the high-frequency signal can be calculated by the time-wrapping information coding unit 24c, and the sub-band signal of the high-frequency signal can be calculated. There is no limit to where the power of is calculated.

例えば、時間包絡情報符号化部21aが前記高周波数信号の時間包絡を算出する処理と同様の処理により、当該高周波数信号の時間包絡を算出できる。高周波数信号のサブバンド信号の時間包絡は、当該高周波数信号のサブバンド信号の大きさの時間方向の変動がわかるパラメータであれば良く、前記の例に限定されない。 For example, the time envelope of the high frequency signal can be calculated by the same process as the process in which the time envelope information coding unit 21a calculates the time envelope of the high frequency signal. The time envelope of the subband signal of the high frequency signal may be any parameter as long as it is a parameter that shows the variation in the magnitude of the subband signal of the high frequency signal in the time direction, and is not limited to the above example.

例えば、任意の時間セグメントt_E(l)≦i<t_E(l+1)内でB_sim,gen,HI(m) (m=0,…,M_sim,gen,HI, M_sim,gen,HI≧1) (B_sim,gen,HI(0)≧k_x, B_sim,gen,HI(M_sim,gen,HI)<k_h)で境界を表されるM_sim,gen,HI個の周波数帯域に分割し、m番目の周波数帯域に含まれる擬似高周波数信号のサブバンド信号X_sim,gen,HI(k,i) (B_sim,gen,HI(m)≦k<B_sim,gen,HI(m+1), t_E(l)≦i<t_E(l+1))の時間包絡E_sim,gen,HI(k,i)を算出する。

擬似高周波数信号のサブバンド信号の時間包絡は、擬似高周波数信号のサブバンド信号の大きさの時間方向の変動がわかるパラメータであれば良く、前記の例に限定されない。 For example, within any time segment t _E (l) ≤ i <t _E (l + 1) B _{sim, gen, HI} (m) (m = 0,…, M _{sim, gen, HI} , M _{sim, gen , HI} ≧ 1) (B _{sim, gen, HI} (0) ≧ k _x , B _{sim, gen, HI} (M _{sim, gen, HI} ) <k _h ) M _{sim, gen, HI} Subband signal of pseudo high frequency signal included in the mth frequency band X _{sim, gen, HI} (k, i) (B _{sim, gen, HI} (m) ≤ k <B _sim, Calculate the time wrapping E _{sim, gen, HI} (k, i) of _{gen, HI} (m + 1), t _E (l) ≤ i <t _E (l + 1)).

The time envelope of the subband signal of the pseudo high frequency signal is not limited to the above example, as long as it is a parameter that shows the fluctuation of the magnitude of the subband signal of the pseudo high frequency signal in the time direction.

例えば、時間包絡情報符号化部20gが時間包絡情報として平坦の程度を表す情報を算出する処理において、前記低周波数信号のサブバンド信号の時間包絡の代わりに当該高周波数信号のサブバンド信号の時間包絡を用い、さらに前記コア復号信号のサブバンド信号の時間包絡の代わりに当該擬似高周波数信号のサブバンド信号の時間包絡を用いることにより、時間包絡情報として平坦の程度を表す情報を算出でき、また当該時間包絡情報を符号化できる。例えば、時間包絡の平坦の程度を平坦か否かで表現すれば1ビットで符号化でき、例えば、前記任意の時間セグメント内において前記M_sim,gen,HI個の周波数帯域毎に当該情報をM_sim,gen,HIビットで符号化できる。 For example, in a process in which the time-wrapping information coding unit 20g calculates information indicating the degree of flatness as time-wrapping information, the time of the sub-band signal of the high-frequency signal is replaced with the time-wrapping of the sub-band signal of the low-frequency signal. By using the wrapping and further using the time wrapping of the subband signal of the pseudo high frequency signal instead of the time wrapping of the subband signal of the core decoding signal, information indicating the degree of flatness can be calculated as the time wrapping information. Moreover, the time-related information can be encoded. For example, if the degree of flatness of the time envelope is expressed by whether it is flat or not, it can be encoded with 1 bit. For example, the information is M _{sim, gen, HI for} each frequency band of M _{sim, gen, HI} in the arbitrary time segment. _It can be encoded with _{sim, gen, HI} bits.

さらに、例えば、時間包絡情報符号化部20gが時間包絡情報として立ち上がりの程度を表す情報を算出する処理において、前記低周波数信号のサブバンド信号の時間包絡の代わりに当該高周波数信号のサブバンド信号の時間包絡を用い、さらに前記コア復号信号のサブバンド信号の時間包絡の代わりに当該擬似高周波数信号のサブバンド信号の時間包絡を用いることにより、時間包絡情報として立ち上がりの程度を表す情報を算出でき、また当該時間包絡情報を符号化できる。例えば、時間包絡の立ち上がりの程度を立ち上がりか否かで表現すれば1ビットで符号化でき、例えば、前記任意の時間セグメント内において前記M_sim,gen,HI個の周波数帯域毎に当該情報をM_sim,gen,HIビットで符号化できる。 Further, for example, in a process in which the time-wrapping information coding unit 20g calculates information indicating the degree of rise as time-wrapping information, the sub-band signal of the high-frequency signal is replaced with the time-wrapping of the sub-band signal of the low-frequency signal. By using the time wrapping of the above and using the time wrapping of the subband signal of the pseudo high frequency signal instead of the time wrapping of the subband signal of the core decoding signal, information indicating the degree of rise is calculated as the time wrapping information. And the time wrapping information can be encoded. For example, if the degree of rise of the time envelope is expressed by whether it rises or not, it can be encoded with 1 bit. For example, the information is M _{sim, gen, HI for} each frequency band of M _{sim, gen, HI} in the arbitrary time segment. _It can be encoded with _{sim, gen, HI} bits.

さらに、例えば、時間包絡情報符号化部20gが時間包絡情報として立ち下がりの程度を表す情報を算出する処理において、前記低周波数信号のサブバンド信号の時間包絡の代わりに当該高周波数信号のサブバンド信号の時間包絡を用い、さらに前記コア復号信号のサブバンド信号の時間包絡の代わりに当該擬似高周波数信号のサブバンド信号の時間包絡を用いることにより、時間包絡情報として立ち下がりの程度を表す情報を算出でき、また当該時間包絡情報を符号化できる。例えば、時間包絡の立ち下がりの程度を立ち下がりか否かで表現すれば1ビットで符号化でき、例えば、前記任意の時間セグメント内において前記M_sim,gen,HI個の周波数帯域毎に当該情報をM_sim,gen,HIビットで符号化できる。 Further, for example, in a process in which the time-wrapping information coding unit 20g calculates information indicating the degree of fall as time-wrapping information, the sub-band of the high-frequency signal is replaced with the time-wrapping of the sub-band signal of the low-frequency signal. By using the time wrapping of the signal and further using the time wrapping of the subband signal of the pseudo high frequency signal instead of the time wrapping of the subband signal of the core decoding signal, information indicating the degree of falling is used as the time wrapping information. Can be calculated, and the time-related information can be encoded. For example, if the degree of the fall of the time envelope is expressed by whether or not it falls, it can be encoded with 1 bit. For example, the information can be encoded for each of the M _{sim, gen, and HI} frequency bands in the arbitrary time segment. Can be encoded with M _{sim, gen, HI} bits.

なお、時間包絡情報の算出方法、及び符号化方法は前記の例に限定されない。また、本実施形態の音声符号化装置に対して、本発明の第4の実施形態の音声符号化装置の第1の変形例が適用できることは明白である。 The time-envelope information calculation method and coding method are not limited to the above examples. Further, it is clear that the first modification of the voice coding device of the fourth embodiment of the present invention can be applied to the voice coding device of the present embodiment.

［第5の実施形態の音声復号装置の第1の変形例］
図39は、第5の実施形態に係る音声復号装置の第1の変形例14Aの構成を示す図である。 [First variant of the audio decoding device of the fifth embodiment]
FIG. 39 is a diagram showing a configuration of a first modification 14A of the audio decoding device according to the fifth embodiment.

図40は、第5の実施形態に係る音声復号装置の第1の変形例14Aの動作を示すフローチャートである。 FIG. 40 is a flowchart showing the operation of the first modification 14A of the audio decoding device according to the fifth embodiment.

高周波数時間包絡形状決定部14bは、符号化系列解析部13cから高周波時間包絡形状に関する情報、コア復号部10bから低周波数信号、分析フィルタバンク部10cから低周波数信号の複数のサブバンド信号、高周波数信号生成部10gから高周波数信号の複数のサブバンド信号、のうち少なくとも一つを受け取り、高周波数信号の時間包絡形状を決定する（ステップS14-2）。例えば、高周波数信号の時間包絡形状を平坦と決定する。さらに、例えば、高周波数信号の時間包絡形状を立ち上がりと決定する。さらに、例えば、高周波数信号の時間包絡形状を立ち下がりと決定する。本発明第4の実施形態に係る音声復号装置の第3の変形例13Cの高周波数時間包絡形状決定部13aCとの相違点は、入力として高周波数信号生成部10gから高周波数信号の複数のサブバンド信号も許容される点であり、当該高周波数信号のサブバンド信号からも、低周波数信号のサブバンド信号と同様の方法により、高周波数時間包絡形状を決定することができる。 The high frequency time entrainment shape determination unit 14b contains information on the high frequency time entrapment shape from the coded sequence analysis unit 13c, low frequency signals from the core decoding unit 10b, multiple subband signals of low frequency signals from the analysis filter bank unit 10c, and high frequencies. At least one of a plurality of subband signals of the high frequency signal is received from the frequency signal generator 10g, and the time entrainment shape of the high frequency signal is determined (step S14-2). For example, the time envelope shape of a high frequency signal is determined to be flat. Further, for example, the time envelope shape of the high frequency signal is determined to be the rising edge. Further, for example, the time-envelope shape of the high frequency signal is determined to be falling. The difference from the high frequency time entrainment shape determination unit 13aC of the third modification 13C of the voice decoding device according to the fourth embodiment of the present invention is that a plurality of subs of the high frequency signal are input from the high frequency signal generation unit 10g. A band signal is also an acceptable point, and the high frequency time entrainment shape can be determined from the subband signal of the high frequency signal by the same method as the subband signal of the low frequency signal.

［第6の実施形態］
図41は、第6の実施形態に係る音声復号装置15の構成を示す図である。音声復号装置15の通信装置は、下記音声符号化装置25から出力される多重化された符号化系列を受信し、更に、復号した音声信号を外部に出力する。音声復号装置15は、図41に示すように、機能的には、符号化系列逆多重化部10aA、コア復号部10b、分析フィルタバンク部10c、符号化系列解析部13c、高周波数信号生成部10g、復号/逆量子化部10h、周波数包絡調整部10i、高周波数時間包絡形状決定部13a、時間包絡修正部15a、及び合成フィルタバンク部10jを備える。 [Sixth Embodiment]
FIG. 41 is a diagram showing a configuration of the voice decoding device 15 according to the sixth embodiment. The communication device of the voice decoding device 15 receives the multiplexed coding sequence output from the following voice coding device 25, and further outputs the decoded voice signal to the outside. As shown in FIG. 41, the voice decoding device 15 functionally has a coded sequence demultiplexing section 10aA, a core decoding section 10b, an analysis filter bank section 10c, a coded sequence analysis section 13c, and a high frequency signal generation section. It includes 10 g, a decoding / dequantization unit 10h, a frequency wrapping adjustment unit 10i, a high frequency time wrapping shape determination unit 13a, a time wrapping correction unit 15a, and a synthetic filter bank unit 10j.

図42は、第6の実施形態に係る音声復号装置15の動作を示すフローチャートである。 FIG. 42 is a flowchart showing the operation of the voice decoding device 15 according to the sixth embodiment.

時間包絡修正部15aは、高周波数時間包絡形状決定部13aで決定した時間包絡形状に基づいて、周波数包絡調整部10iから出力される高周波数信号の複数のサブバンド信号の時間包絡の形状を修正する（ステップS15-1）。 The time envelope correction unit 15a corrects the shape of the time envelope of a plurality of subband signals of the high frequency signal output from the frequency envelope adjustment unit 10i based on the time envelope shape determined by the high frequency time envelope shape determination unit 13a. (Step S15-1).

例えば、任意の時間セグメントt_E(l)≦i<t_E(l+1)内でB_HI(m) (m=0,…,M_HI, M_HI≧1) (B_HI(0)≧k_x, B_HI(M_HI)<k_h)で境界を表されるM_HI個の周波数帯域に分割し、m番目の周波数帯域に含まれる周波数包絡調整部10iから出力される高周波数信号のサブバンド信号X_adj,HI(k,i) (B_adj,HI(m)≦k<B_adj,HI(m+1), t_E(l)≦i<t_E(l+1))に対して、所定の関数F(X_adj,HI(k,i))を用いて以下の式（３７）

により得られるX’_adj,HI(k,i)を時間包絡形状が修正された高周波数信号のサブバンド信号として出力する。 For example, within any time segment t _E (l) ≤ i <t _E (l + 1), B _HI (m) (m = 0,…, M _HI , M _HI ≥ 1) (B _HI (0) ≥ The high frequency signal output from the frequency wrapping adjustment unit 10i included in the mth frequency band divided into M _HI frequency bands whose boundaries are represented by k _x , B _HI (M _HI ) <k _h ). Subband signal X _{adj, HI} (k, i) (B _{adj, HI} (m) ≤ k <B _{adj, HI} (m + 1), t _E (l) ≤ i <t _E (l + 1)) On the other hand, using the predetermined function F (X _{adj, HI} (k, i)), the following equation (37)

_{X'adj, HI} (k, i) obtained by is output as a subband signal of a high frequency signal with a corrected time envelope shape.

例えば、前記高周波数信号の時間包絡形状が平坦と決定された場合、以下の処理により、当該高周波数信号の時間包絡形状を修正できる。例えば、時間包絡修正部14aにおける時間包絡形状を平坦に修正する処理において、高周波数信号生成部10gから出力される高周波数信号のサブバンド信号の代わりに、当該周波数包絡調整部10iから出力される高周波数信号のサブバンド信号X_adj,HI(k,i)を用いることにより、当該周波数包絡調整部10iから出力される高周波数信号のサブバンド信号X_adj,HI(k,i)の時間包絡形状を平坦に修正できる。時間包絡修正部15aは、高周波数信号の複数のサブバンド信号の時間包絡の形状を平坦に修正する処理を実施し、上記の例に限定されない。 For example, when the time envelope shape of the high frequency signal is determined to be flat, the time envelope shape of the high frequency signal can be corrected by the following processing. For example, in the process of flattening the time wrapping shape in the time wrapping correction unit 14a, the frequency wrapping adjustment unit 10i outputs the subband signal of the high frequency signal instead of the high frequency signal generation unit 10g. high frequency signals of the sub-band signals X _adj, by using _HI (k, i), the high frequency signal of the sub-band signals X _adj output from the frequency envelope adjuster _10i, the time envelope of _HI (k, i) The shape can be modified flat. The time envelope correction unit 15a performs a process of flattening the shape of the time envelope of a plurality of subband signals of the high frequency signal, and is not limited to the above example.

さらに、例えば、前記高周波数信号の時間包絡形状が立ち上がりと決定された場合、以下の処理により、当該高周波数信号の時間包絡形状を修正できる。例えば、時間包絡修正部14aにおける時間包絡形状を立ち上がりに修正する処理において、高周波数信号生成部10gから出力される高周波数信号のサブバンド信号の代わりに、当該周波数包絡調整部10iから出力される高周波数信号のサブバンド信号X_adj,HI(k,i)を用いることにより、当該周波数包絡調整部10iから出力される高周波数信号のサブバンド信号X_adj,HI(k,i)の時間包絡形状を立ち上がりに修正できる。時間包絡修正部15aは、高周波数信号の複数のサブバンド信号の時間包絡の形状を立ち上がりに修正する処理を実施し、上記の例に限定されない。 Further, for example, when the time envelope shape of the high frequency signal is determined to be rising, the time envelope shape of the high frequency signal can be corrected by the following processing. For example, in the process of correcting the time wrapping shape at the rising edge in the time wrapping correction unit 14a, the frequency wrapping adjustment unit 10i outputs the subband signal of the high frequency signal instead of the high frequency signal generation unit 10g. high frequency signals of the sub-band signals X _adj, by using _HI (k, i), the high frequency signal of the sub-band signals X _adj output from the frequency envelope adjuster _10i, the time envelope of _HI (k, i) The shape can be modified to rise. The time envelope correction unit 15a performs a process of correcting the shape of the time envelope of a plurality of subband signals of the high frequency signal at the rising edge, and is not limited to the above example.

さらに、例えば、前記高周波数信号の時間包絡形状が立ち下がりと決定された場合、以下の処理により、当該高周波数信号の時間包絡形状を修正できる。例えば、時間包絡修正部14aにおける時間包絡形状を立ち下がりに修正する処理において、高周波数信号生成部10gから出力される高周波数信号のサブバンド信号の代わりに、当該周波数包絡調整部10iから出力される高周波数信号のサブバンド信号X_adj,HI(k,i)を用いることにより、当該周波数包絡調整部10iから出力される高周波数信号のサブバンド信号X_adj,HI(k,i)の時間包絡形状を立ち下がりに修正できる。時間包絡修正部15aは、高周波数信号の複数のサブバンド信号の時間包絡の形状を立ち下がりに修正する処理を実施し、上記の例に限定されない。 Further, for example, when the time envelope shape of the high frequency signal is determined to be falling, the time envelope shape of the high frequency signal can be corrected by the following processing. For example, in the process of correcting the time wrapping shape to the falling edge in the time wrapping correction unit 14a, the frequency wrapping adjustment unit 10i outputs the high frequency signal instead of the high frequency signal subband signal output from the high frequency signal generation unit 10g. high frequency signals of the sub-band signals X _{adj that,} by using _HI (k, i), the frequency envelope adjuster 10i high frequency signal of the sub-band signals X _adj output _{from, HI} (k, i) of the time The wrapping shape can be modified to fall. The time envelope correction unit 15a performs a process of correcting the shape of the time envelope of a plurality of subband signals of the high frequency signal to a falling edge, and is not limited to the above example.

なお、本実施形態に係る音声復号装置15の高周波数時間包絡形状決定部13aに対して、本発明の第4の実施形態の音声復号装置の第1、第2、及び第3の変形例、及び本発明第5の実施形態の音声復号装置の第1の変形例が適用できることは明白である。 It should be noted that the first, second, and third modifications of the voice decoding device according to the fourth embodiment of the present invention are provided with respect to the high frequency time-enclosed shape determining unit 13a of the voice decoding device 15 according to the present invention. And it is clear that the first modification of the voice decoding apparatus of the fifth embodiment of the present invention can be applied.

図43は、第6の実施形態に係る音声符号化装置25の構成を示す図である。音声符号化装置25の通信装置は、符号化の対象となる音声信号を外部から受信し、更に、符号化された符号化系列を外部に出力する。音声符号化装置25は、図43に示すように、機能的には、ダウンサンプリング部20a、コア符号化部20b、分析フィルタバンク部20c、制御パラメータ符号化部20d、包絡算出部20e、量子化/符号化部20f、擬似高周波数信号生成部24a、サブバンド信号パワー算出部24b、周波数包絡調整部25a、時間包絡情報符号化部25b、及び符号化系列多重化部20hを備える。 FIG. 43 is a diagram showing a configuration of the voice coding device 25 according to the sixth embodiment. The communication device of the voice coding device 25 receives the voice signal to be coded from the outside, and further outputs the coded coding sequence to the outside. As shown in FIG. 43, the voice coding device 25 functionally includes a downsampling unit 20a, a core coding unit 20b, an analysis filter bank unit 20c, a control parameter coding unit 20d, an encapsulation calculation unit 20e, and a quantization unit. / It includes a coding unit 20f, a pseudo high frequency signal generation unit 24a, a subband signal power calculation unit 24b, a frequency wrapping adjustment unit 25a, a time wrapping information coding unit 25b, and a coding sequence multiplexing unit 20h.

図44は、第6の実施形態に係る音声符号化装置25の動作を示すフローチャートである。 FIG. 44 is a flowchart showing the operation of the voice coding device 25 according to the sixth embodiment.

周波数包絡調整部25aは、制御パラメータ符号化部20dで得られる高周波数信号の周波数包絡調整に必要な制御パラメータと、量子化/符号化部20fで量子化された高周波数信号に対するゲインおよびノイズ信号の大きさに基づいて、擬似高周波数信号生成部24aで生成された擬似高周波数信号の周波数包絡を調整する（ステップS25-1）。当該擬似高周波数信号の周波数包絡調整処理は、周波数包絡調整部10iにおける処理と同様に行われるが、周波数包絡調整部10iでは高周波数信号生成部10gにて生成された高周波数信号のサブバンド信号に対して行うのに対し、周波数包絡調整部25aでは擬似高周波数信号生成部24aにて生成された擬似高周波数信号のサブバンド信号に対して行う点が異なる。なお、周波数包絡調整部25aでは、演算量の削減を目的として、周波数包絡調整部10iでの処理の一部を省略できる。例えば、正弦波信号の付加の処理を省略できる。さらには、例えば、ノイズ信号の付加の処理を省略できる。この場合、ノイズ信号の大きさを調整する処理も省略できる。 The frequency wrapping adjustment unit 25a contains control parameters required for frequency wrapping adjustment of the high frequency signal obtained by the control parameter coding unit 20d, and a gain and noise signal for the high frequency signal quantized by the quantization / coding unit 20f. The frequency wrapping of the pseudo high frequency signal generated by the pseudo high frequency signal generation unit 24a is adjusted based on the magnitude of (step S25-1). The frequency wrapping adjustment processing of the pseudo high frequency signal is performed in the same manner as the processing in the frequency wrapping adjustment unit 10i, but the frequency wrapping adjustment unit 10i is a subband signal of the high frequency signal generated by the high frequency signal generation unit 10g. The difference is that the frequency wrapping adjustment unit 25a performs the operation on the subband signal of the pseudo high frequency signal generated by the pseudo high frequency signal generation unit 24a. In the frequency envelope adjusting unit 25a, a part of the processing in the frequency envelope adjusting unit 10i can be omitted for the purpose of reducing the amount of calculation. For example, the processing of adding a sine wave signal can be omitted. Further, for example, the processing of adding a noise signal can be omitted. In this case, the process of adjusting the magnitude of the noise signal can also be omitted.

時間包絡情報符号化部25bは、包絡算出部20eにて算出した高周波数信号のサブバンド信号のパワーを用いて高周波数信号の時間包絡を算出し、サブバンド信号パワー算出部24bにて算出した周波数包絡調整された擬似高周波数信号のサブバンド信号のパワーを用いて擬似高周波数信号の時間包絡を算出し、当該高周波数信号の時間包絡と擬似高周波数信号の時間包絡より時間包絡情報を符号化する（ステップS25-2）。当該処理において、高周波数信号のサブバンド信号のパワーが算出されていない場合は、時間包絡情報符号化部25bにて高周波数信号のサブバンド信号のパワーを算出でき、高周波数信号のサブバンド信号のパワーがどこで算出されるかは限定されない。 The time wrapping information coding unit 25b calculates the time wrapping of the high frequency signal using the power of the subband signal of the high frequency signal calculated by the wrapping calculation unit 20e, and calculates it by the subband signal power calculation unit 24b. The time wrapping of the quasi-high frequency signal is calculated using the power of the subband signal of the quasi-high frequency signal adjusted for frequency wrapping, and the time wrapping information is coded from the time wrapping of the high frequency signal and the time wrapping of the pseudo high frequency signal. (Step S25-2). When the power of the sub-band signal of the high-frequency signal is not calculated in the processing, the power of the sub-band signal of the high-frequency signal can be calculated by the time-wrapping information coding unit 25b, and the sub-band signal of the high-frequency signal can be calculated. There is no limit to where the power of is calculated.

例えば、任意の時間セグメントt_E(l)≦i<t_E(l+1)内でB_sim,adj,HI(m) (m=0,…,M_sim,adj,HI, M_sim,adj,HI≧1) (B_sim,adj,HI(0)≧k_x, B_sim,adj,HI(M_sim,adj,HI)<k_h)で境界を表されるM_sim,adj,HI個の周波数帯域に分割し、m番目の周波数帯域に含まれる擬似高周波数信号のサブバンド信号X_sim,adj,HI(k,i) (B_sim,adj,HI(m)≦k<B_sim,adj,HI(m+1), t_E(l)≦i<t_E(l+1))の時間包絡E_sim,adj,HI(k,i)を算出する。

擬似高周波数信号のサブバンド信号の時間包絡は、擬似高周波数信号のサブバンド信号の大きさの時間方向の変動がわかるパラメータであれば良く、前記の例に限定されない。 For example, within any time segment t _E (l) ≤ i <t _E (l + 1), B _{sim, adj, HI} (m) (m = 0,…, M _{sim, adj, HI} , M _{sim, adj , HI} ≧ 1) (B _{sim, adj, HI} (0) ≧ k _x , B _{sim, adj, HI} (M _{sim, adj, HI} ) <k _h ) M _{sim, adj, HI} Subband signal of pseudo high frequency signal included in the mth frequency band X _{sim, adj, HI} (k, i) (B _{sim, adj, HI} (m) ≤ k <B _sim, Calculate the time envelope E _{sim, adj, HI} (k, i) of _{adj, HI} (m + 1), t _E (l) ≤ i <t _E (l + 1)).

例えば、時間包絡情報符号化部20gが時間包絡情報として平坦の程度を表す情報を算出する処理において、前記低周波数信号のサブバンド信号の時間包絡の代わりに当該高周波数信号のサブバンド信号の時間包絡を用い、さらに前記コア復号信号のサブバンド信号の時間包絡の代わりに当該擬似高周波数信号のサブバンド信号の時間包絡を用いることにより、時間包絡情報として平坦の程度を表す情報を算出でき、また当該時間包絡情報を符号化できる。例えば、時間包絡の平坦の程度を平坦か否かで表現すれば1ビットで符号化でき、例えば、前記任意の時間セグメント内において前記M_sim,adj,HI個の周波数帯域毎に当該情報をM_sim,adj,HIビットで符号化できる。 For example, in a process in which the time-wrapping information coding unit 20g calculates information indicating the degree of flatness as time-wrapping information, the time of the sub-band signal of the high-frequency signal is replaced with the time-wrapping of the sub-band signal of the low-frequency signal. By using the wrapping and further using the time wrapping of the subband signal of the pseudo high frequency signal instead of the time wrapping of the subband signal of the core decoding signal, information indicating the degree of flatness can be calculated as the time wrapping information. Moreover, the time-related information can be encoded. For example, if the degree of flatness of the time envelope is expressed by whether it is flat or not, it can be encoded with 1 bit. For example, the information is M _{sim, adj, HI for} each frequency band within the arbitrary time segment. _It can be encoded with _{sim, adj, and HI} bits.

さらに、例えば、時間包絡情報符号化部20gが時間包絡情報として立ち上がりの程度を表す情報を算出する処理において、前記低周波数信号のサブバンド信号の時間包絡の代わりに当該高周波数信号のサブバンド信号の時間包絡を用い、さらに前記コア復号信号のサブバンド信号の時間包絡の代わりに当該擬似高周波数信号のサブバンド信号の時間包絡を用いることにより、時間包絡情報として立ち上がりの程度を表す情報を算出でき、また当該時間包絡情報を符号化できる。例えば、時間包絡の立ち上がりの程度を立ち上がりか否かで表現すれば1ビットで符号化でき、例えば、前記任意の時間セグメント内において前記M_sim,adj,HI個の周波数帯域毎に当該情報をM_sim,adj,HIビットで符号化できる。 Further, for example, in a process in which the time-wrapping information coding unit 20g calculates information indicating the degree of rise as time-wrapping information, the sub-band signal of the high-frequency signal is replaced with the time-wrapping of the sub-band signal of the low-frequency signal. By using the time wrapping of the above and using the time wrapping of the subband signal of the pseudo high frequency signal instead of the time wrapping of the subband signal of the core decoding signal, information indicating the degree of rise is calculated as the time wrapping information. And the time wrapping information can be encoded. For example, if the degree of rise of the time envelope is expressed by whether it rises or not, it can be encoded with 1 bit. For example, the information is M _{sim, adj, HI} in the arbitrary time segment for each frequency band of M _{sim, adj, HI.} _It can be encoded with _{sim, adj, and HI} bits.

さらに、例えば、時間包絡情報符号化部20gが時間包絡情報として立ち下がりの程度を表す情報を算出する処理において、前記低周波数信号のサブバンド信号の時間包絡の代わりに当該高周波数信号のサブバンド信号の時間包絡を用い、さらに前記コア復号信号のサブバンド信号の時間包絡の代わりに当該擬似高周波数信号のサブバンド信号の時間包絡を用いることにより、時間包絡情報として立ち下がりの程度を表す情報を算出でき、また当該時間包絡情報を符号化できる。例えば、時間包絡の立ち下がりの程度を立ち下がりか否かで表現すれば1ビットで符号化でき、例えば、前記任意の時間セグメント内において前記M_sim,adj,HI個の周波数帯域毎に当該情報をM_sim,adj,HIビットで符号化できる。 Further, for example, in a process in which the time-wrapping information coding unit 20g calculates information indicating the degree of fall as time-wrapping information, the sub-band of the high-frequency signal is replaced with the time-wrapping of the sub-band signal of the low-frequency signal. By using the time wrapping of the signal and further using the time wrapping of the subband signal of the pseudo high frequency signal instead of the time wrapping of the subband signal of the core decoding signal, information indicating the degree of falling is used as the time wrapping information. Can be calculated, and the time-related information can be encoded. For example, if the degree of the fall of the time envelope is expressed by whether or not it falls, it can be encoded with 1 bit. For example, the information can be encoded for each of the M _{sim, adj, and HI} frequency bands in the arbitrary time segment. Can be encoded with M _{sim, adj, HI} bits.

［第6の実施形態の音声復号装置の第1の変形例］
図45は、第6の実施形態に係る音声復号装置の第1の変形例15Aの構成を示す図である。 [First variant of the audio decoding device of the sixth embodiment]
FIG. 45 is a diagram showing a configuration of a first modification 15A of the audio decoding device according to the sixth embodiment.

図46は、第6の実施形態に係る音声復号装置の第1の変形例15Aの動作を示すフローチャートである。 FIG. 46 is a flowchart showing the operation of the first modification 15A of the audio decoding device according to the sixth embodiment.

本変形例においては、周波数包絡調整部10iは高周波数信号を構成する成分のうち少なくとも一つ以上を分離して出力する。例えば、高周波数信号を構成する成分は、低周波数信号より生成された高周波数信号成分、ノイズ信号成分、正弦波信号成分である。 In this modification, the frequency envelope adjusting unit 10i separates and outputs at least one or more of the components constituting the high frequency signal. For example, the components constituting the high frequency signal are a high frequency signal component, a noise signal component, and a sinusoidal signal component generated from the low frequency signal.

時間包絡修正部15aAは、高周波数時間包絡形状決定部13aで決定した時間包絡形状に基づいて、周波数包絡調整部10iより分離した形で出力された高周波数信号を構成する成分のうち少なくとも一つ以上の時間包絡形状を修正し、時間包絡形状を修正された成分を含む高周波数信号の各成分から高周波数信号を合成する（ステップS15-1a）。 The time envelope correction unit 15aA is at least one of the components constituting the high frequency signal output in a form separated from the frequency envelope adjustment unit 10i based on the time envelope shape determined by the high frequency time envelope shape determination unit 13a. The above time envelope shape is corrected, and a high frequency signal is synthesized from each component of the high frequency signal including the component whose time envelope shape is corrected (step S15-1a).

例えば、周波数包絡調整部10iより分離した形で出力された高周波数信号のうち任意の成分の信号のサブバンド信号X_shp,dj,HI(k,i) (B_shp,adj,HI(m)≦k<B_shp,adj,HI(m+1), t_E(l)≦i<t_E(l+1))に対して、所定の関数F(X_shp,adj,HI(k,i))を用いて以下の式（３９）

により、前記高周波数信号のうち任意の成分の信号のサブバンド信号X_shp,dj,HI(k,i)の時間包絡形状を修正した成分のサブバンド信号X’_shp,adj,HI(k,i)を得る。そして、当該時間包絡形状を修正した成分のサブバンド信号と時間包絡形状の修正が施されない他の成分の信号とで高周波数信号を合成し、高周波数信号を出力する。 For example, the subband signal of any component of the high frequency signal output from the frequency wrapping adjustment unit 10i X _{shp, dj, HI} (k, i) (B _{shp, adj, HI} (m)) For _≤k <B _{shp, adj, HI} (m + 1), t _E (l) ≤i <t _E (l + 1)), the given function F (X _{shp, adj, HI} (k, i) )) And the following equation (39)

_Therefore, the sub-band signal _{X'shp, adj, HI} (k,) of the component whose time-envelope shape is corrected of the sub-band signal X _{shp, dj, HI} (k, i) of the signal of any component among the high frequency signals i) get. Then, a high frequency signal is synthesized with a subband signal of the component whose time envelope shape is corrected and a signal of another component whose time envelope shape is not corrected, and a high frequency signal is output.

なお、時間包絡形状が修正される成分が複数の場合、それぞれまたはそのうちの一部は異なる時間包絡形状に修正できる。さらに、時間包絡形状が修正される成分の信号は複数の成分の信号の和の信号とすることができ、例えば低周波数信号より生成された高周波数信号成分とノイズ信号成分の和とすることができる。 When there are a plurality of components whose time envelope shape is modified, each or a part of them can be modified to have different time envelope shapes. Further, the signal of the component whose time envelope shape is modified can be the sum signal of the signals of a plurality of components, for example, the sum of the high frequency signal component and the noise signal component generated from the low frequency signal. it can.

なお、本変形例に係る音声復号装置15Aの高周波数時間包絡形状決定部13aに対して、本発明の第4の実施形態の音声復号装置の第1、第2、及び第3の変形例、及び本発明第5の実施形態の音声復号装置の第1の変形例が適用できることは明白である。 It should be noted that the first, second, and third modifications of the voice decoding device according to the fourth embodiment of the present invention are provided with respect to the high frequency time wrapping shape determining unit 13a of the voice decoding device 15A according to the present modification. And it is clear that the first modification of the voice decoding apparatus of the fifth embodiment of the present invention can be applied.

［第7の実施形態］
図47は、第7の実施形態に係る音声復号装置16の構成を示す図である。音声復号装置16の通信装置は、下記音声符号化装置26から出力される多重化された符号化系列を受信し、更に、復号した音声信号を外部に出力する。音声復号装置16は、図47に示すように、機能的には、符号化系列逆多重化部10a、コア復号部10b、分析フィルタバンク部10c、符号化系列解析部13c、低周波数時間包絡形状決定部10e、低周波数時間包絡修正部10f、高周波数時間包絡形状決定部13a、時間包絡修正部13b、高周波数信号生成部10g、復号/逆量子化部10h、周波数包絡調整部10i、及び合成フィルタバンク部10jを備える。 [7th Embodiment]
FIG. 47 is a diagram showing a configuration of the audio decoding device 16 according to the seventh embodiment. The communication device of the voice decoding device 16 receives the multiplexed coding sequence output from the following voice coding device 26, and further outputs the decoded voice signal to the outside. As shown in FIG. 47, the voice decoding device 16 functionally has a coded sequence demultiplexing section 10a, a core decoding section 10b, an analysis filter bank section 10c, a coded sequence analysis section 13c, and a low frequency time wrapping shape. Determination unit 10e, low frequency time wrapping correction unit 10f, high frequency time wrapping shape determination unit 13a, time wrapping correction unit 13b, high frequency signal generation unit 10g, decoding / inverse quantization unit 10h, frequency wrapping adjustment unit 10i, and synthesis It has a filter bank section 10j.

図48は、第7の実施形態に係る音声復号装置の動作を示すフローチャートである。 FIG. 48 is a flowchart showing the operation of the voice decoding device according to the seventh embodiment.

なお、本実施形態に係る音声復号装置16の低周波数時間包絡形状決定部10eに対して、本発明の第1の実施形態の音声復号装置の第1、第2、及び第3の変形例が適用できることは明白である。 It should be noted that the first, second, and third modifications of the voice decoding device of the first embodiment of the present invention are provided with respect to the low frequency time envelope shape determining unit 10e of the voice decoding device 16 according to the present invention. It is clear that it is applicable.

さらには、本実施形態に係る音声復号装置16の高周波数時間包絡形状決定部13aに対して、本発明の第4の実施形態の音声復号装置の第1、第2、及び第3の変形例が適用できることは明白である。 Further, with respect to the high frequency time envelope shape determining unit 13a of the voice decoding device 16 according to the present embodiment, the first, second, and third modifications of the voice decoding device of the fourth embodiment of the present invention are made. Is clearly applicable.

図49は、第7の実施形態に係る音声符号化装置26の構成を示す図である。音声符号化装置26の通信装置は、符号化の対象となる音声信号を外部から受信し、更に、符号化された符号化系列を外部に出力する。音声符号化装置26は、図49に示すように、機能的には、ダウンサンプリング部20a、コア符号化部20b、分析フィルタバンク部20c及び20c1、制御パラメータ符号化部20d、包絡算出部20e、量子化/符号化部20f、コア復号信号生成部20i、サブバンド信号パワー算出部20j、時間包絡情報符号化部26a、及び符号化系列多重化部20hを備える。 FIG. 49 is a diagram showing the configuration of the voice coding device 26 according to the seventh embodiment. The communication device of the voice coding device 26 receives the voice signal to be coded from the outside, and further outputs the coded coding sequence to the outside. As shown in FIG. 49, the voice coding device 26 functionally includes a downsampling unit 20a, a core coding unit 20b, an analysis filter bank unit 20c and 20c1, a control parameter coding unit 20d, and an entrapment calculation unit 20e. It includes a quantization / coding unit 20f, a core decoding signal generation unit 20i, a subband signal power calculation unit 20j, a time wrapping information coding unit 26a, and a coding sequence multiplexing unit 20h.

図50は、第7の実施形態に係る音声符号化装置26の動作を示すフローチャートである。 FIG. 50 is a flowchart showing the operation of the voice coding device 26 according to the seventh embodiment.

時間包絡情報符号化部26aは、低周波数信号の時間包絡と高周波数信号の時間包絡のうち少なくとも一つ以上を算出し、さらに前記サブバンド信号パワー算出部20jにて算出されたコア復号信号のサブバンド信号のパワーを用いてコア復号信号の時間包絡を算出し、当該低周波数信号の時間包絡及び高周波数信号の時間包絡のうち少なくとも一つ以上とコア復号信号の時間包絡より時間包絡情報を符号化する（ステップS26-1）。 The time wrapping information coding unit 26a calculates at least one of the time wrapping of the low frequency signal and the time wrapping of the high frequency signal, and further, the core decoding signal calculated by the subband signal power calculation unit 20j. The time wrapping of the core decoding signal is calculated using the power of the subband signal, and the time wrapping information is obtained from at least one of the time wrapping of the low frequency signal and the time wrapping of the high frequency signal and the time wrapping of the core decoding signal. Encode (step S26-1).

当該時間包絡情報は、低周波数時間包絡情報と高周波数時間包絡情報を含む。 The time envelope information includes low frequency time envelope information and high frequency time envelope information.

低周波数信号の時間包絡は、包絡算出部20eにて算出した低周波数信号のサブバンド信号のパワーを用いて低周波数信号の時間包絡を算出する。高周波数信号の時間包絡は、包絡算出部20eにて算出した高周波数信号のサブバンド信号のパワーを用いて高周波数信号の時間包絡を算出する。当該処理において、低周波数信号のサブバンド信号のパワーが算出されていない場合は、時間包絡情報符号化部26aにて低周波数信号のサブバンド信号のパワーを算出でき、低周波数信号のサブバンド信号のパワーがどこで算出されるかは限定されない。さらには、高周波数信号のサブバンド信号のパワーが算出されていない場合は、時間包絡情報符号化部26aにて高周波数信号のサブバンド信号のパワーを算出でき、高周波数信号のサブバンド信号のパワーがどこで算出されるかは限定されない。 For the time envelope of the low frequency signal, the time envelope of the low frequency signal is calculated by using the power of the subband signal of the low frequency signal calculated by the envelope calculation unit 20e. For the time envelope of the high frequency signal, the time envelope of the high frequency signal is calculated by using the power of the subband signal of the high frequency signal calculated by the envelope calculation unit 20e. If the power of the subband signal of the low frequency signal is not calculated in the process, the power of the subband signal of the low frequency signal can be calculated by the time wrapping information coding unit 26a, and the subband signal of the low frequency signal is calculated. There is no limit to where the power of is calculated. Furthermore, when the power of the subband signal of the high frequency signal is not calculated, the power of the subband signal of the high frequency signal can be calculated by the time wrapping information coding unit 26a, and the subband signal of the high frequency signal can be calculated. There is no limit to where the power is calculated.

例えば、時間包絡情報符号化部20gの動作と同様に低周波数時間包絡情報を算出し符号化することができ、時間包絡情報符号化部23aの動作と同様に高周波数時間包絡情報を算出し符号化することができる。当該低周波数時間包絡情報、及び高周波数時間包絡情報の算出符号化は、前記の例に限定されない。 For example, the low frequency time envelope information can be calculated and encoded in the same manner as the operation of the time envelope information coding unit 20g, and the high frequency time envelope information can be calculated and coded in the same manner as the operation of the time envelope information coding unit 23a. Can be changed. The calculation coding of the low frequency time envelope information and the high frequency time envelope information is not limited to the above example.

当該低周波数時間包絡情報と当該高周波数時間包絡情報は別々に符号化することもでき、また一緒に符号化することもでき、本発明においては低周波数時間包絡情報及び高周波数時間包絡情報の符号化の方法は限定されない。 The low frequency time envelope information and the high frequency time envelope information can be encoded separately or together. In the present invention, the codes of the low frequency time envelope information and the high frequency time envelope information can be encoded. The method of conversion is not limited.

例えば、当該低周波数時間包絡情報と当該高周波数時間包絡情報をベクトルとして扱い、ベクトル量子化により符号化することができる。さらに、例えば、当該ベクトルをエントロピー符号化することもできる。 For example, the low frequency time envelope information and the high frequency time envelope information can be treated as a vector and encoded by vector quantization. Further, for example, the vector can be entropy-encoded.

さらには、低周波数時間包絡情報と高周波数時間包絡情報を同一の時間包絡情報とすることもでき、この場合、音声復号装置16の符号化系列解析部10dからは同一の時間包絡情報が低周波数時間包絡情報及び高周波数時間包絡情報として出力される。本発明においては、低周波数時間包絡情報及び高周波数時間包絡情報の形態は限定されない。 Furthermore, the low-frequency time-envelope information and the high-frequency time-envelope information can be the same time-envelope information. In this case, the same time-envelope information is transmitted from the coded sequence analysis unit 10d of the voice decoding device 16 at a low frequency. It is output as time envelope information and high frequency time envelope information. In the present invention, the forms of the low frequency time envelope information and the high frequency time envelope information are not limited.

［第7の実施形態の音声復号装置の第1の変形例］
図51は、第7の実施形態に係る音声復号装置の第1の変形例16Aの構成を示す図である。 [First modification of the audio decoding device of the seventh embodiment]
FIG. 51 is a diagram showing a configuration of a first modification 16A of the audio decoding device according to the seventh embodiment.

図52は、第7の実施形態に係る音声復号装置の第1の変形例16Aの動作を示すフローチャートである。 FIG. 52 is a flowchart showing the operation of the first modification 16A of the audio decoding device according to the seventh embodiment.

高周波数時間包絡形状決定部16aは、符号化系列解析部13cから高周波時間包絡形状に関する情報、コア復号部10bから低周波数信号、分析フィルタバンク部10cから低周波数信号の複数のサブバンド信号、低周波数時間包絡修正部10fから時間包絡形状を修正済みの低周波数信号の複数のサブバンド信号、のうち少なくとも一つを受け取り、高周波数信号の時間包絡形状を決定する（ステップS16-1）。例えば、高周波数信号の時間包絡形状を平坦と決定するケース、高周波数信号の時間包絡形状を立ち上がりと決定するケース、高周波数信号の時間包絡形状を立ち下がりと決定するケースが挙げられる。第4の実施形態に係る音声復号装置の第3の変形例13Cの高周波数時間包絡形状決定部13aCとの相違点は、入力として低周波数時間包絡修正部10fから時間包絡形状を修正済みの低周波数信号の複数のサブバンド信号も許容される点であり、当該低周波数信号のサブバンド信号からも、分析フィルタバンク部10cからの低周波数信号のサブバンド信号と同様の方法により、高周波数時間包絡形状を決定することができる。 The high-frequency time-enclosed shape determination unit 16a includes information on the high-frequency time-enclosed shape from the coded sequence analysis unit 13c, low-frequency signals from the core decoding unit 10b, multiple sub-band signals of low-frequency signals from the analysis filter bank unit 10c, and low. At least one of a plurality of subband signals of the low frequency signal whose time wrapping shape has been corrected is received from the frequency time wrapping correction unit 10f, and the time wrapping shape of the high frequency signal is determined (step S16-1). For example, there are cases where the time envelope shape of a high frequency signal is determined to be flat, a case where the time envelope shape of a high frequency signal is determined to be rising, and a case where the time envelope shape of a high frequency signal is determined to be falling. The difference from the high frequency time wrapping shape determining unit 13aC of the third modification 13C of the voice decoding device according to the fourth embodiment is that the low frequency time wrapping shape is corrected from the low frequency time wrapping correction unit 10f as an input. A plurality of subband signals of the frequency signal are also allowed, and the subband signal of the low frequency signal is also subjected to the high frequency time by the same method as the subband signal of the low frequency signal from the analysis filter bank unit 10c. The enveloping shape can be determined.

［第7の実施形態の音声復号装置の第2の変形例］
図153は、第7の実施形態に係る音声復号装置の第2の変形例16Bの構成を示す図である。 [Second variant of the audio decoding device of the seventh embodiment]
FIG. 153 is a diagram showing a configuration of a second modification 16B of the audio decoding device according to the seventh embodiment.

図154は、第7の実施形態に係る音声復号装置の第2の変形例16Bの動作を示すフローチャートである。 FIG. 154 is a flowchart showing the operation of the second modification 16B of the audio decoding device according to the seventh embodiment.

本変形例においては、低周波数時間包絡形状決定部16bと前記低周波数時間包絡形状決定部10eCとの相違点は、決定した低周波数包絡形状を時間包絡修正部16cへも通知する点である。低周波数時間包絡形状決定部16bにおける時間包絡形状の決定は、前記の例に加えて、例えば、前記低周波数信号の周波数パワー分布に基づくこともできる。 In this modification, the difference between the low frequency time envelope shape determination unit 16b and the low frequency time envelope shape determination unit 10eC is that the determined low frequency envelope shape is also notified to the time envelope correction unit 16c. The determination of the time envelope shape in the low frequency time envelope shape determination unit 16b can be based on, for example, the frequency power distribution of the low frequency signal, in addition to the above example.

さらには、前記低周波数時間包絡形状決定部10e、10eA、及び10eBに対しても同様の変形を加えることが可能なことは明白である。 Furthermore, it is clear that similar modifications can be applied to the low frequency time envelope shape determining portions 10e, 10eA, and 10eB.

時間包絡修正部16cと前記時間包絡修正部13bとの相違点は、高周波数時間包絡形状決定部13aC（13a、13aA、13aBでもよいことは明白）から受け取る時間包絡形状と低周波数時間包絡形状決定部16bから受け取る時間包絡形状のうち少なくとも一つ以上に基づいて、分析フィルタバンク部10cから出力され高周波数信号生成部10gにて高周波数信号の生成に用いる複数のサブバンド信号の時間包絡の形状を修正する点である(S16-2)。 The difference between the time envelope correction unit 16c and the time envelope correction unit 13b is the time envelope shape and the low frequency time envelope shape determination received from the high frequency time envelope shape determination unit 13aC (clearly, 13a, 13aA, 13aB may be used). The shape of the time envelope of a plurality of subband signals output from the analysis filter bank unit 10c and used by the high frequency signal generation unit 10g to generate a high frequency signal based on at least one of the time envelope shapes received from the unit 16b. (S16-2).

例えば、低周波数時間包絡形状決定部16bから平坦であるとの時間包絡形状の情報を受け取った場合には、高周波数時間包絡形状決定部13aCから受け取る時間包絡形状によらず、分析フィルタバンク部10cから出力される複数のサブバンド信号の時間包絡の形状を平坦に修正する。更に例えば、低周波数時間包絡形状決定部16bから平坦でないとの時間包絡形状の情報を受け取った場合には、高周波数時間包絡形状決定部13aCから受け取る時間包絡形状によらず、分析フィルタバンク部10cから出力される複数のサブバンド信号の時間包絡の形状を平坦に修正しない。立ち上がり、立ち下がりの場合も同様であり、時間包絡形状は限定されない。 For example, when information on the time envelope shape that is flat is received from the low frequency time envelope shape determination unit 16b, the analysis filter bank unit 10c is not affected by the time envelope shape received from the high frequency time envelope shape determination unit 13aC. Corrects the shape of the time envelope of multiple subband signals output from. Further, for example, when information on the time envelope shape that is not flat is received from the low frequency time envelope shape determination unit 16b, the analysis filter bank unit 10c is not affected by the time envelope shape received from the high frequency time envelope shape determination unit 13aC. Does not flatten the shape of the time envelope of multiple subband signals output from. The same applies to the case of rising and falling, and the time envelope shape is not limited.

［第7の実施形態の音声復号装置の第3の変形例］
図155は、第7の実施形態に係る音声復号装置の第3の変形例16Cの構成を示す図である。 [Third variant of the audio decoding device of the seventh embodiment]
FIG. 155 is a diagram showing a configuration of a third modification 16C of the audio decoding device according to the seventh embodiment.

図156は、第7の実施形態に係る音声復号装置の第3の変形例16Cの動作を示すフローチャートである。 FIG. 156 is a flowchart showing the operation of the third modification 16C of the audio decoding device according to the seventh embodiment.

本変形例においては、高周波数時間包絡形状決定部16dと前記高周波数時間包絡形状決定部13aCとの相違点は、決定した高周波数包絡形状を低周波数時間包絡修正部16eへも通知する点である。 In this modification, the difference between the high frequency time envelope shape determination unit 16d and the high frequency time envelope shape determination unit 13aC is that the determined high frequency envelope shape is also notified to the low frequency time envelope correction unit 16e. is there.

高周波数時間包絡形状決定部16dにおける時間包絡形状の決定は、前記の例に加えて、例えば、前記低周波数信号の周波数パワー分布に基づくこともできる。更には、例えば符号化系列解析部13cから得られる高周波数信号の生成の際のフレーム長を用いることができる。例えば、フレーム長が長い場合は平坦である、フレーム長が短い場合は立ち上がりまたは立ち下がりであると決定できる。前記高周波数信号の生成の際のフレーム長の例としては、“ISO/IEC14496-3”に規定される“time border”にて境界を決められる“time segment”の長さが挙げられる。さらには、前記高周波数時間包絡形状決定部13a、13aA、及び13aBに対しても同様の変形を加えることが可能なことは明白である。 The determination of the time envelope shape in the high frequency time envelope shape determination unit 16d can be based on, for example, the frequency power distribution of the low frequency signal, in addition to the above example. Further, for example, the frame length at the time of generating the high frequency signal obtained from the coded sequence analysis unit 13c can be used. For example, if the frame length is long, it can be determined to be flat, and if the frame length is short, it can be determined to be rising or falling. An example of the frame length at the time of generating the high frequency signal is the length of the "time segment" whose boundary is determined by the "time border" defined in "ISO / IEC14496-3". Furthermore, it is clear that similar modifications can be made to the high frequency time envelope shape determining portions 13a, 13aA, and 13aB.

低周波数時間包絡修正部16eと前記低周波数時間包絡修正部10fとの相違点は、低周波数時間包絡形状決定部10eC（10e、10eA、10eBでもよいことは明白）から受け取る時間包絡形状と高周波数時間包絡形状決定部16dから受け取る時間包絡形状のうち少なくとも一つ以上に基づいて、分析フィルタバンク部10cから出力される複数のサブバンド信号の時間包絡の形状を修正する点である(S16-3)。 The difference between the low frequency time envelope correction unit 16e and the low frequency time envelope correction unit 10f is the time envelope shape and high frequency received from the low frequency time envelope shape determination unit 10eC (clearly, 10e, 10eA, 10eB may be used). The point is to modify the time envelope shape of a plurality of subband signals output from the analysis filter bank unit 10c based on at least one of the time envelope shapes received from the time envelope shape determination unit 16d (S16-3). ).

例えば、高周波数時間包絡形状決定部16dから平坦であるとの時間包絡形状の情報を受け取った場合には、低周波数時間包絡形状決定部10eCから受け取る時間包絡形状によらず、分析フィルタバンク部10cから出力される複数のサブバンド信号の時間包絡の形状を平坦に修正する。更に例えば、高周波数時間包絡形状決定部16dから平坦ではないとの時間包絡形状の情報を受け取った場合には、低周波数時間包絡形状決定部10eCから受け取る時間包絡形状によらず、分析フィルタバンク部10cから出力される複数のサブバンド信号の時間包絡の形状を平坦に修正しない。立ち上がり、立ち下がりの場合も同様であり、時間包絡形状は限定されない。 For example, when information on the time envelope shape that is flat is received from the high frequency time envelope shape determination unit 16d, the analysis filter bank unit 10c is not affected by the time envelope shape received from the low frequency time envelope shape determination unit 10eC. Corrects the shape of the time envelope of multiple subband signals output from. Further, for example, when the information of the time envelope shape that is not flat is received from the high frequency time envelope shape determination unit 16d, the analysis filter bank unit regardless of the time envelope shape received from the low frequency time envelope shape determination unit 10eC. Do not flatten the shape of the time envelope of multiple subband signals output from 10c. The same applies to the case of rising and falling, and the time envelope shape is not limited.

［第7の実施形態の音声復号装置の第4の変形例］
図157は、第7の実施形態に係る音声復号装置の第4の変形例16Dの構成を示す図である。 [Fourth variant of the audio decoding device of the seventh embodiment]
FIG. 157 is a diagram showing a configuration of a fourth modification 16D of the audio decoding device according to the seventh embodiment.

図158は、第7の実施形態に係る音声復号装置の第4の変形例16Dの動作を示すフローチャートである。 FIG. 158 is a flowchart showing the operation of the fourth modification 16D of the audio decoding device according to the seventh embodiment.

本変形例においては、前記低周波数時間包絡形状決定部16b、前記時間包絡修正部16c、前記高周波数時間包絡形状決定部16d、及び前記低周波数時間包絡修正部16eを具備する。 In this modification, the low frequency time envelope shape determination unit 16b, the time envelope correction unit 16c, the high frequency time envelope shape determination unit 16d, and the low frequency time envelope correction unit 16e are provided.

［第7の実施形態の音声復号装置の第5の変形例］
図159は、第7の実施形態に係る音声復号装置の第5の変形例16Eの構成を示す図である。 [Fifth variant of the audio decoding device of the seventh embodiment]
FIG. 159 is a diagram showing a configuration of a fifth modification 16E of the audio decoding device according to the seventh embodiment.

図160は、第7の実施形態に係る音声復号装置の第5の変形例16Eの動作を示すフローチャートである。 FIG. 160 is a flowchart showing the operation of the fifth modification 16E of the audio decoding device according to the seventh embodiment.

本変形例と前記第7の実施形態に係る音声復号装置16との相違点は、低周波数時間包絡形状決定部10e及び高周波数時間包絡形状決定部13aにかえて時間包絡形状決定部16fを具備する点である。 The difference between this modification and the voice decoding device 16 according to the seventh embodiment is that the time envelope shape determination unit 16f is provided instead of the low frequency time envelope shape determination unit 10e and the high frequency time envelope shape determination unit 13a. It is a point to do.

時間包絡形状決定部16fは、符号化系列逆多重化部10aからの低周波数時間包絡形状に関する情報、コア復号部10bからの低周波数信号、分析フィルタバンク部10cからの低周波数信号の複数のサブバンド信号、符号化系列解析部13cからの高周波数時間包絡形状に関する情報のうち少なくとも一つ以上に基づいて時間包絡形状を決定する(S16-4)。決定した時間包絡形状は、低周波数時間包絡修正部10f、時間包絡修正部13bに通知される。 The time entrapment shape determination unit 16f contains information on the low frequency time entrapment shape from the coded sequence demultiplexing unit 10a, a low frequency signal from the core decoding unit 10b, and a plurality of subs of the low frequency signal from the analysis filter bank unit 10c. The time-wrapping shape is determined based on at least one of the information on the high-frequency time-wrapping shape from the band signal and the coded sequence analysis unit 13c (S16-4). The determined time envelope shape is notified to the low frequency time envelope correction unit 10f and the time envelope correction unit 13b.

例えば、時間包絡形状として平坦と決定する。さらに例えば、時間包絡形状として立ち上がりと決定する。さらに例えば、時間包絡形状として立下りと決定する。決定される時間包絡形状は、上記の例に限定されない。 For example, the time-envelope shape is determined to be flat. Further, for example, it is determined to rise as a time envelope shape. Further, for example, the time-envelope shape is determined to be falling. The time envelope shape determined is not limited to the above example.

時間包絡形状決定部16fでは、例えば、前記低周波数時間包絡形状決定部10e、10eA、10eB、10eC、及び16b、前記高周波数時間包絡形状決定部13a、13aA、13aB、13aC、及び16dと同様に時間包絡形状を決定できる。時間包絡形状の決定方法は上記の例に限定されない。 In the time envelope shape determination unit 16f, for example, the same as the low frequency time envelope shape determination unit 10e, 10eA, 10eB, 10eC, and 16b, and the high frequency time envelope shape determination unit 13a, 13aA, 13aB, 13aC, and 16d. The time envelope shape can be determined. The method for determining the time envelope shape is not limited to the above example.

［第7の実施形態の音声符号化装置の第1の変形例］
図53は、第7の実施形態に係る音声符号化装置の第1の変形例26Aの構成を示す図である。 [First modification of the voice coding device of the seventh embodiment]
FIG. 53 is a diagram showing a configuration of a first modification 26A of the voice coding device according to the seventh embodiment.

図54は、第7の実施形態に係る音声符号化装置の第1の変形例26Aの動作を示すフローチャートである。 FIG. 54 is a flowchart showing the operation of the first modification 26A of the voice coding device according to the seventh embodiment.

時間包絡情報符号化部26aAは、低周波数信号の時間包絡と高周波数信号の時間包絡のうち少なくとも一つ以上を算出し、当該低周波数信号及び高周波数信号の時間包絡のうち少なくとも一つ以上より時間包絡情報を算出し符号化する（ステップS26-1a）。 The time envelope information coding unit 26aA calculates at least one of the time envelope of the low frequency signal and the time envelope of the high frequency signal, and from at least one of the time envelopes of the low frequency signal and the high frequency signal. The time envelope information is calculated and encoded (step S26-1a).

当該時間包絡情報は、低周波数時間包絡情報と高周波数時間包絡情報を含む。第7の実施形態の音声符号化装置26の時間包絡情報符号化部26aの動作と同様に、当該低周波数時間包絡情報と高周波数時間包絡情報の符号化の方法は限定されない。 The time envelope information includes low frequency time envelope information and high frequency time envelope information. Similar to the operation of the time envelope information coding unit 26a of the voice coding device 26 of the seventh embodiment, the method of encoding the low frequency time envelope information and the high frequency time envelope information is not limited.

低周波数信号の時間包絡は、包絡算出部20eにて算出した低周波数信号のサブバンド信号のパワーを用いて低周波数信号の時間包絡を算出する。 For the time envelope of the low frequency signal, the time envelope of the low frequency signal is calculated by using the power of the subband signal of the low frequency signal calculated by the envelope calculation unit 20e.

高周波数信号の時間包絡は、包絡算出部20eにて算出した高周波数信号のサブバンド信号のパワーを用いて高周波数信号の時間包絡を算出する。 For the time envelope of the high frequency signal, the time envelope of the high frequency signal is calculated by using the power of the subband signal of the high frequency signal calculated by the envelope calculation unit 20e.

当該処理において、低周波数信号のサブバンド信号のパワーが算出されていない場合は、時間包絡情報符号化部26aAにて低周波数信号のサブバンド信号のパワーを算出してもよく、低周波数信号のサブバンド信号のパワーがどこで算出されるかは限定されない。 When the power of the subband signal of the low frequency signal is not calculated in the processing, the power of the subband signal of the low frequency signal may be calculated by the time wrapping information coding unit 26aA, and the power of the subband signal of the low frequency signal may be calculated. Where the power of the subband signal is calculated is not limited.

さらには、高周波数信号のサブバンド信号のパワーが算出されていない場合は、時間包絡情報符号化部26aAにて高周波数信号のサブバンド信号のパワーを算出してもよく、高周波数信号のサブバンド信号のパワーがどこで算出されるかは限定されない。 Furthermore, when the power of the sub-band signal of the high-frequency signal has not been calculated, the power of the sub-band signal of the high-frequency signal may be calculated by the time-wrapping information coding unit 26aA, and the sub-band signal of the high-frequency signal may be calculated. There is no limit to where the power of the band signal is calculated.

例えば、時間包絡情報符号化部20gAの動作と同様に低周波数時間包絡情報を算出し符号化することができ、時間包絡情報符号化部23aAの動作と同様に高周波数時間包絡情報を算出し符号化することができる。当該低周波数時間包絡情報、及び高周波数時間包絡情報の算出符号化は、前記の例に限定されない。 For example, the low frequency time envelope information can be calculated and encoded in the same manner as the operation of the time envelope information coding unit 20gA, and the high frequency time envelope information can be calculated and coded in the same manner as the operation of the time envelope information coding unit 23aA. Can be changed. The calculation coding of the low frequency time envelope information and the high frequency time envelope information is not limited to the above example.

さらには、第7の実施形態の音声符号化装置26の時間包絡情報符号化部26aの動作と同様に、低周波数時間包絡情報と高周波数時間包絡情報を同一の時間包絡情報とすることもできる。 Further, similarly to the operation of the time envelope information coding unit 26a of the voice coding device 26 of the seventh embodiment, the low frequency time envelope information and the high frequency time envelope information can be the same time envelope information. ..

［第8の実施形態］
図55は、第8の実施形態に係る音声復号装置17の構成を示す図である。音声復号装置17の通信装置は、下記音声符号化装置27から出力される多重化された符号化系列を受信し、更に、復号した音声信号を外部に出力する。音声復号装置17は、図55に示すように、機能的には、符号化系列逆多重化部10a、コア復号部10b、分析フィルタバンク部10c、符号化系列解析部13c、低周波数時間包絡形状決定部10e、低周波数時間包絡修正部10f、高周波数信号生成部10g、高周波数時間包絡形状決定部13a、時間包絡修正部14a、復号/逆量子化部10h、周波数包絡調整部10i、及び合成フィルタバンク部10jを備える。 [8th Embodiment]
FIG. 55 is a diagram showing a configuration of the audio decoding device 17 according to the eighth embodiment. The communication device of the voice decoding device 17 receives the multiplexed coding sequence output from the following voice coding device 27, and further outputs the decoded voice signal to the outside. As shown in FIG. 55, the voice decoding device 17 functionally has a coded sequence demultiplexing section 10a, a core decoding section 10b, an analysis filter bank section 10c, a coded sequence analysis section 13c, and a low frequency time envelope shape. Determination unit 10e, low frequency time envelope correction unit 10f, high frequency signal generation unit 10g, high frequency time envelope shape determination unit 13a, time envelope correction unit 14a, decoding / inverse quantization unit 10h, frequency envelope adjustment unit 10i, and synthesis. It has a filter bank section 10j.

図56は、第8の実施形態に係る音声復号装置の動作を示すフローチャートである。 FIG. 56 is a flowchart showing the operation of the voice decoding device according to the eighth embodiment.

なお、本実施形態に係る音声復号装置17の低周波数時間包絡形状決定部10eに対して、本発明の第1の実施形態の音声復号装置の第1、第2、及び第3の変形例が適用できることは明白である。 It should be noted that the first, second, and third modifications of the voice decoding device according to the first embodiment of the present invention are provided for the low frequency time envelope shape determining unit 10e of the voice decoding device 17 according to the present invention. It is clear that it is applicable.

さらには、本実施形態に係る音声復号装置17の高周波数時間包絡形状決定部13aに対して、本発明の第4の実施形態の音声復号装置の第1、第2、及び第3の変形例、本発明第5の実施形態の音声復号装置の第1の変形例、及び本発明第7の実施形態の音声復号装置の第1の変形例が適用できることは明白である。 Further, the first, second, and third modifications of the voice decoding device according to the fourth embodiment of the present invention are provided with respect to the high frequency time wrapping shape determining unit 13a of the voice decoding device 17 according to the present invention. It is clear that the first modification of the voice decoding device according to the fifth embodiment of the present invention and the first modification of the voice decoding device according to the seventh embodiment of the present invention can be applied.

図57は、第8の実施形態に係る音声符号化装置27の構成を示す図である。音声符号化装置27の通信装置は、符号化の対象となる音声信号を外部から受信し、更に、符号化された符号化系列を外部に出力する。音声符号化装置27は、図57に示すように、機能的には、ダウンサンプリング部20a、コア符号化部20b、分析フィルタバンク部20c及び20c1、制御パラメータ符号化部20d、包絡算出部20e、量子化/符号化部20f、擬似高周波数信号生成部24a、コア復号信号生成部20i、サブバンド信号パワー算出部20j及び24b、時間包絡情報符号化部27a、及び符号化系列多重化部20hを備える。 FIG. 57 is a diagram showing a configuration of the voice coding device 27 according to the eighth embodiment. The communication device of the voice coding device 27 receives the voice signal to be coded from the outside, and further outputs the coded coding sequence to the outside. As shown in FIG. 57, the voice coding device 27 functionally includes a downsampling unit 20a, a core coding unit 20b, an analysis filter bank unit 20c and 20c1, a control parameter coding unit 20d, and an entrapment calculation unit 20e. Quantization / coding unit 20f, pseudo high frequency signal generation unit 24a, core decoding signal generation unit 20i, subband signal power calculation unit 20j and 24b, time wrapping information coding unit 27a, and coding sequence multiplexing unit 20h. Be prepared.

図58は、第8の実施形態に係る音声符号化装置27の動作を示すフローチャートである。 FIG. 58 is a flowchart showing the operation of the voice coding device 27 according to the eighth embodiment.

時間包絡情報符号化部27aは、入力音声信号の低周波数信号の時間包絡、高周波数信号の時間包絡、コア復号信号の時間包絡、擬似高周波数信号の時間包絡のうち少なくとも一つ以上を算出し、算出された時間包絡より時間包絡情報を符号化する（ステップS27-1）。 The time envelope information coding unit 27a calculates at least one or more of the time envelope of the low frequency signal of the input voice signal, the time envelope of the high frequency signal, the time envelope of the core decoded signal, and the time envelope of the pseudo high frequency signal. , The time envelope information is encoded from the calculated time envelope (step S27-1).

低周波数信号の時間包絡は、包絡算出部20eにて算出した低周波数信号のサブバンド信号のパワーを用いて低周波数信号の時間包絡を算出する。高周波数信号の時間包絡は、包絡算出部20eにて算出した高周波数信号のサブバンド信号のパワーを用いて高周波数信号の時間包絡を算出する。当該処理において、低周波数信号のサブバンド信号のパワーが算出されていない場合は、時間包絡情報符号化部27aにて低周波数信号のサブバンド信号のパワーを算出でき、低周波数信号のサブバンド信号のパワーがどこで算出されるかは限定されない。さらには、高周波数信号のサブバンド信号のパワーが算出されていない場合は、時間包絡情報符号化部27aにて高周波数信号のサブバンド信号のパワーを算出でき、高周波数信号のサブバンド信号のパワーがどこで算出されるかは限定されない。 For the time envelope of the low frequency signal, the time envelope of the low frequency signal is calculated by using the power of the subband signal of the low frequency signal calculated by the envelope calculation unit 20e. For the time envelope of the high frequency signal, the time envelope of the high frequency signal is calculated by using the power of the subband signal of the high frequency signal calculated by the envelope calculation unit 20e. When the power of the subband signal of the low frequency signal is not calculated in the processing, the power of the subband signal of the low frequency signal can be calculated by the time wrapping information coding unit 27a, and the subband signal of the low frequency signal is calculated. There is no limit to where the power of is calculated. Furthermore, when the power of the subband signal of the high frequency signal is not calculated, the power of the subband signal of the high frequency signal can be calculated by the time wrapping information coding unit 27a, and the subband signal of the high frequency signal can be calculated. There is no limit to where the power is calculated.

コア復号信号の時間包絡は、前記サブバンド信号パワー算出部20jにて算出されたコア復号信号のサブバンド信号のパワーを用いて算出する。 The time envelope of the core decoded signal is calculated by using the power of the subband signal of the core decoded signal calculated by the subband signal power calculation unit 20j.

擬似高周波数信号の時間包絡は、前記サブバンド信号パワー算出部24bにて算出された擬似高周波数信号のサブバンド信号のパワーを用いて算出する。 The time envelope of the pseudo high frequency signal is calculated by using the power of the subband signal of the pseudo high frequency signal calculated by the subband signal power calculation unit 24b.

例えば、時間包絡情報符号化部20gの動作と同様に当該低周波数信号の時間包絡情報を算出し符号化することができ、時間包絡情報符号化部24cの動作と同様に当該高周波数信号の時間包絡情報を算出し符号化することができる。 For example, the time wrapping information of the low frequency signal can be calculated and encoded in the same manner as the operation of the time wrapping information coding unit 20g, and the time of the high frequency signal can be calculated and encoded in the same manner as the operation of the time wrapping information coding unit 24c. Entrainment information can be calculated and encoded.

第7の実施形態の音声符号化装置26の時間包絡情報符号化部26aの動作と同様に、当該低周波数時間包絡情報と高周波数時間包絡情報の算出及び符号化の方法は限定されない。 Similar to the operation of the time envelope information coding unit 26a of the voice coding device 26 of the seventh embodiment, the method of calculating and encoding the low frequency time envelope information and the high frequency time envelope information is not limited.

さらには、第7の実施形態の音声符号化装置26の時間包絡情報符号化部26aと同様に、低周波数時間包絡情報と高周波数時間包絡情報を同一の時間包絡情報とすることもできる。 Further, similarly to the time-envelope information coding unit 26a of the voice coding device 26 of the seventh embodiment, the low-frequency time-envelope information and the high-frequency time-envelope information can be the same time-envelope information.

なお、本実施形態に係る音声符号化装置27に対して、本発明の第7の実施形態の音声符号化装置の第1の変形例が適用できることは明白である。 It is clear that the first modification of the voice coding device of the seventh embodiment of the present invention can be applied to the voice coding device 27 according to the present embodiment.

［第8の実施形態の音声復号装置の第1の変形例］
図161は、第8の実施形態に係る音声復号装置の第1の変形例17Aの構成を示す図である。 [First modification of the audio decoding device of the eighth embodiment]
FIG. 161 is a diagram showing a configuration of a first modification 17A of the audio decoding device according to the eighth embodiment.

図162は、第8の実施形態に係る音声復号装置の第1の変形例17Aの動作を示すフローチャートである。 FIG. 162 is a flowchart showing the operation of the first modification 17A of the audio decoding device according to the eighth embodiment.

本変形例においては、時間包絡修正部17aと前記時間包絡修正部14aとの相違点は、高周波数時間包絡形状決定部13aC（13a、13aA、13aBでもよいことは明白）から受け取る時間包絡形状と低周波数時間包絡形状決定部16bから受け取る時間包絡形状のうち少なくとも一つ以上に基づいて、高周波数信号生成部10gから出力される高周波数信号の複数のサブバンド信号の時間包絡の形状を修正する点である(S17-1)。 In this modification, the difference between the time envelope correction unit 17a and the time envelope correction unit 14a is the time envelope shape received from the high frequency time envelope shape determination unit 13aC (it is clear that 13a, 13aA, 13aB may be used). Correct the time envelope shape of a plurality of subband signals of the high frequency signal output from the high frequency signal generator 10g based on at least one of the time envelope shapes received from the low frequency time envelope shape determination unit 16b. It is a point (S17-1).

例えば、低周波数時間包絡形状決定部16bから平坦であるとの時間包絡形状の情報を受け取った場合には、高周波数時間包絡形状決定部13aCから受け取る時間包絡形状によらず、高周波数信号生成部10gから出力される複数のサブバンド信号の時間包絡の形状を平坦に修正する。更に例えば、低周波数時間包絡形状決定部16bから平坦でないとの時間包絡形状の情報を受け取った場合には、高周波数時間包絡形状決定部13aCから受け取る時間包絡形状によらず、高周波数信号生成部10gから出力される複数のサブバンド信号の時間包絡の形状を平坦に修正しない。立ち上がり、立ち下がりの場合も同様であり、時間包絡形状は限定されない。 For example, when information on the time envelope shape that is flat is received from the low frequency time envelope shape determination unit 16b, the high frequency signal generation unit is not affected by the time envelope shape received from the high frequency time envelope shape determination unit 13aC. Correct the shape of the time envelope of multiple subband signals output from 10g. Further, for example, when the information of the time envelope shape that is not flat is received from the low frequency time envelope shape determination unit 16b, the high frequency signal generation unit regardless of the time envelope shape received from the high frequency time envelope shape determination unit 13aC. Do not flatten the shape of the time envelope of multiple subband signals output from 10g. The same applies to the case of rising and falling, and the time envelope shape is not limited.

［第8の実施形態の音声復号装置の第2の変形例］
図163は、第8の実施形態に係る音声復号装置の第2の変形例17Bの構成を示す図である。 [Second variant of the audio decoding device of the eighth embodiment]
FIG. 163 is a diagram showing a configuration of a second modification 17B of the audio decoding device according to the eighth embodiment.

図164は、第8の実施形態に係る音声復号装置の第2の変形例17Bの動作を示すフローチャートである。 FIG. 164 is a flowchart showing the operation of the second modification 17B of the audio decoding device according to the eighth embodiment.

本変形例と第8の実施形態に係る音声復号装置17との相違点は、高周波数時間包絡形状決定部13aC（13a、13aA、及び13aBでもよいことは明白）、低周波数時間包絡修正部10fにかえて、高周波数時間包絡形状決定部16d、低周波数時間包絡修正部16eを具備する点である。 The difference between this modification and the audio decoding device 17 according to the eighth embodiment is the high frequency time envelope shape determining unit 13aC (it is clear that 13a, 13aA, and 13aB may be used), and the low frequency time envelope correction unit 10f. Instead, it is provided with a high frequency time envelope shape determining unit 16d and a low frequency time envelope correction unit 16e.

［第8の実施形態の音声復号装置の第3の変形例］
図165は、第8の実施形態に係る音声復号装置の第3の変形例17Cの構成を示す図である。 [Third variant of the audio decoding device of the eighth embodiment]
FIG. 165 is a diagram showing a configuration of a third modification 17C of the audio decoding device according to the eighth embodiment.

図166は、第8の実施形態に係る音声復号装置の第3の変形例17Cの動作を示すフローチャートである。 FIG. 166 is a flowchart showing the operation of the third modification 17C of the audio decoding device according to the eighth embodiment.

本変形例においては、前記低周波数時間包絡形状決定部16b、前記時間包絡修正部17a、前記高周波数時間包絡形状決定部16d、及び前記低周波数時間包絡修正部16eを具備する。 In this modification, the low frequency time envelope shape determining unit 16b, the time envelope correction unit 17a, the high frequency time envelope shape determining unit 16d, and the low frequency time envelope correction unit 16e are provided.

［第8の実施形態の音声復号装置の第4の変形例］
図167は、第8の実施形態に係る音声復号装置の第4の変形例17Dの構成を示す図である。 [Fourth variant of the audio decoding device of the eighth embodiment]
FIG. 167 is a diagram showing a configuration of a fourth modification 17D of the audio decoding device according to the eighth embodiment.

図168は、第8の実施形態に係る音声復号装置の第4の変形例17Dの動作を示すフローチャートである。 FIG. 168 is a flowchart showing the operation of the fourth modification 17D of the audio decoding device according to the eighth embodiment.

本変形例と前記第8の実施形態に係る音声復号装置17との相違点は、低周波数時間包絡形状決定部10e及び高周波数時間包絡形状決定部13aにかえて時間包絡形状決定部16fを具備する点である。 The difference between this modification and the voice decoding device 17 according to the eighth embodiment is that the time envelope shape determination unit 16f is provided instead of the low frequency time envelope shape determination unit 10e and the high frequency time envelope shape determination unit 13a. It is a point to do.

［第9の実施形態］
図59は、第9の実施形態に係る音声復号装置18の構成を示す図である。音声復号装置18の通信装置は、下記音声符号化装置28から出力される多重化された符号化系列を受信し、更に、復号した音声信号を外部に出力する。音声復号装置18は、図59に示すように、機能的には、符号化系列逆多重化部10a、コア復号部10b、分析フィルタバンク部10c、符号化系列解析部13c、低周波数時間包絡形状決定部10e、低周波数時間包絡修正部10f、高周波数信号生成部10g、復号/逆量子化部10h、周波数包絡調整部10i、高周波数時間包絡形状決定部13a、時間包絡修正部14a、及び合成フィルタバンク部10jを備える。 [9th Embodiment]
FIG. 59 is a diagram showing the configuration of the audio decoding device 18 according to the ninth embodiment. The communication device of the voice decoding device 18 receives the multiplexed coding sequence output from the following voice coding device 28, and further outputs the decoded voice signal to the outside. As shown in FIG. 59, the voice decoding device 18 functionally has a coded sequence demultiplexing section 10a, a core decoding section 10b, an analysis filter bank section 10c, a coded sequence analysis section 13c, and a low frequency time envelope shape. Determination unit 10e, low frequency time envelope correction unit 10f, high frequency signal generation unit 10g, decoding / dequantization unit 10h, frequency envelope adjustment unit 10i, high frequency time envelope shape determination unit 13a, time envelope correction unit 14a, and synthesis. It has a filter bank section 10j.

図60は、第9の実施形態に係る音声復号装置の動作を示すフローチャートである。 FIG. 60 is a flowchart showing the operation of the voice decoding device according to the ninth embodiment.

なお、本実施形態に係る音声復号装置18の低周波数時間包絡形状決定部10eに対して、本発明の第1の実施形態の音声復号装置の第1、第2、及び第3の変形例が適用できることは明白である。 It should be noted that the first, second, and third modifications of the voice decoding device according to the first embodiment of the present invention are provided with respect to the low frequency time envelope shape determining unit 10e of the voice decoding device 18 according to the present invention. It is clear that it is applicable.

さらには、本実施形態に係る音声復号装置18の高周波数時間包絡形状決定部13aに対して、本発明の第4の実施形態の音声復号装置の第1、第2、及び第3の変形例、本発明第5の実施形態の音声復号装置の第1の変形例、及び本発明第7の実施形態の音声復号装置の第1の変形例が適用できることは明白である。 Further, the first, second, and third modifications of the voice decoding device according to the fourth embodiment of the present invention are provided with respect to the high frequency time wrapping shape determining unit 13a of the voice decoding device 18 according to the present invention. It is clear that the first modification of the voice decoding device according to the fifth embodiment of the present invention and the first modification of the voice decoding device according to the seventh embodiment of the present invention can be applied.

図61は、第9の実施形態に係る音声符号化装置28の構成を示す図である。音声符号化装置28の通信装置は、符号化の対象となる音声信号を外部から受信し、更に、符号化された符号化系列を外部に出力する。音声符号化装置28は、図61に示すように、機能的には、ダウンサンプリング部20a、コア符号化部20b、分析フィルタバンク部20c及び20c1、制御パラメータ符号化部20d、包絡算出部20e、量子化/符号化部20f、擬似高周波数信号生成部24a、周波数包絡調整部25a、コア復号信号生成部20i、サブバンド信号パワー算出部20j及び24b、時間包絡情報符号化部27a、及び符号化系列多重化部20hを備える。 FIG. 61 is a diagram showing the configuration of the voice coding device 28 according to the ninth embodiment. The communication device of the voice coding device 28 receives the voice signal to be coded from the outside, and further outputs the coded coding sequence to the outside. As shown in FIG. 61, the voice coding device 28 functionally includes a downsampling unit 20a, a core coding unit 20b, an analysis filter bank unit 20c and 20c1, a control parameter coding unit 20d, and an entrapment calculation unit 20e. Quantization / coding unit 20f, pseudo high frequency signal generation unit 24a, frequency wrapping adjustment unit 25a, core decoding signal generation unit 20i, subband signal power calculation unit 20j and 24b, time wrapping information coding unit 27a, and coding It is equipped with a sequence multiplexing unit 20h.

図62は、第9の実施形態に係る音声符号化装置28の動作を示すフローチャートである。 FIG. 62 is a flowchart showing the operation of the voice coding device 28 according to the ninth embodiment.

時間包絡情報符号化部28aは、入力音声信号の低周波数信号の時間包絡、高周波数信号の時間包絡、コア復号信号の時間包絡、及び周波数包絡調整された擬似高周波数信号の時間包絡のうち少なくとも一つ以上を算出し、算出された時間包絡より時間包絡情報を符号化する（ステップS28-1）。 The time-envelope information coding unit 28a uses at least one of the time-envelope of the low-frequency signal of the input voice signal, the time-envelope of the high-frequency signal, the time-envelope of the core-decoded signal, and the time-envelope of the pseudo-high-frequency signal adjusted. One or more are calculated, and the time envelope information is encoded from the calculated time envelope (step S28-1).

低周波数信号の時間包絡は、包絡算出部20eにて算出した低周波数信号のサブバンド信号のパワーを用いて低周波数信号の時間包絡を算出する。高周波数信号の時間包絡は、包絡算出部20eにて算出した高周波数信号のサブバンド信号のパワーを用いて高周波数信号の時間包絡を算出する。当該処理において、低周波数信号のサブバンド信号のパワーが算出されていない場合は、時間包絡情報符号化部28aにて低周波数信号のサブバンド信号のパワーを算出でき、低周波数信号のサブバンド信号のパワーがどこで算出されるかは限定されない。さらには、高周波数信号のサブバンド信号のパワーが算出されていない場合は、時間包絡情報符号化部28aにて高周波数信号のサブバンド信号のパワーを算出でき、高周波数信号のサブバンド信号のパワーがどこで算出されるかは限定されない。 For the time envelope of the low frequency signal, the time envelope of the low frequency signal is calculated by using the power of the subband signal of the low frequency signal calculated by the envelope calculation unit 20e. For the time envelope of the high frequency signal, the time envelope of the high frequency signal is calculated by using the power of the subband signal of the high frequency signal calculated by the envelope calculation unit 20e. If the power of the subband signal of the low frequency signal is not calculated in the processing, the power of the subband signal of the low frequency signal can be calculated by the time wrapping information coding unit 28a, and the subband signal of the low frequency signal is calculated. There is no limit to where the power of is calculated. Furthermore, when the power of the subband signal of the high frequency signal is not calculated, the power of the subband signal of the high frequency signal can be calculated by the time wrapping information coding unit 28a, and the subband signal of the high frequency signal can be calculated. There is no limit to where the power is calculated.

コア復号信号の時間包絡は、サブバンド信号パワー算出部20jにて算出されたコア復号信号のサブバンド信号のパワーを用いて算出する。 The time envelope of the core decoded signal is calculated using the power of the subband signal of the core decoded signal calculated by the subband signal power calculation unit 20j.

周波数包絡調整された擬似高周波数信号の時間包絡は、サブバンド信号パワー算出部24bにて算出された擬似高周波数信号のサブバンド信号のパワーを用いて算出する。 The time envelope of the pseudo high frequency signal adjusted for frequency envelope is calculated by using the power of the subband signal of the pseudo high frequency signal calculated by the subband signal power calculation unit 24b.

例えば、時間包絡情報符号化部20gの動作と同様に当該低周波数信号の時間包絡情報を算出し符号化することができ、時間包絡情報符号化部25bの動作と同様に当該高周波数信号の時間包絡情報を算出し符号化することができる。 For example, the time wrapping information of the low frequency signal can be calculated and encoded in the same manner as the operation of the time wrapping information coding unit 20g, and the time of the high frequency signal can be calculated and encoded in the same manner as the operation of the time wrapping information coding unit 25b. Entrainment information can be calculated and encoded.

なお、本実施形態に係る音声符号化装置28に対して、本発明の第7の実施形態の音声符号化装置の第1の変形例が適用できることは明白である。 It is clear that the first modification of the voice coding device according to the seventh embodiment of the present invention can be applied to the voice coding device 28 according to the present embodiment.

［第9の実施形態の音声復号装置の第1の変形例］
図63は、第9の実施形態に係る音声復号装置の第1の変形例18Aの構成を示す図である。 [First variant of the audio decoding device of the ninth embodiment]
FIG. 63 is a diagram showing a configuration of a first modification 18A of the audio decoding device according to the ninth embodiment.

図64は、第9の実施形態に係る音声復号装置の第1の変形例18Aの動作を示すフローチャートである。 FIG. 64 is a flowchart showing the operation of the first modification 18A of the audio decoding device according to the ninth embodiment.

なお、本変形例に係る音声復号装置18Aの低周波数時間包絡形状決定部10eに対して、本発明の第1の実施形態の音声復号装置の第1、第2、及び第3の変形例が適用できることは明白である。 It should be noted that the first, second, and third modifications of the voice decoding device according to the first embodiment of the present invention are provided with respect to the low frequency time envelope shape determining unit 10e of the voice decoding device 18A according to the present modification. It is clear that it is applicable.

さらには、本変形例に係る音声復号装置18Aの高周波数時間包絡形状決定部13aに対して、本発明の第4の実施形態の音声復号装置の第1、第2、及び第3の変形例、本発明第5の実施形態の音声復号装置の第1の変形例、及び本発明第7の実施形態の音声復号装置の第1の変形例が適用できることは明白である。 Further, with respect to the high frequency time entrainment shape determining unit 13a of the voice decoding device 18A according to the present modification, the first, second, and third modifications of the voice decoding device according to the fourth embodiment of the present invention. It is clear that the first modification of the voice decoding device according to the fifth embodiment of the present invention and the first modification of the voice decoding device according to the seventh embodiment of the present invention can be applied.

［第9の実施形態の音声復号装置の第2の変形例］
図169は、第9の実施形態に係る音声復号装置の第2の変形例18Bの構成を示す図である。 [Second variant of the audio decoding device of the ninth embodiment]
FIG. 169 is a diagram showing a configuration of a second modification 18B of the audio decoding device according to the ninth embodiment.

図170は、第9の実施形態に係る音声復号装置の第2の変形例18Bの動作を示すフローチャートである。 FIG. 170 is a flowchart showing the operation of the second modification 18B of the audio decoding device according to the ninth embodiment.

本変形例においては、時間包絡修正部18aと前記時間包絡修正部15aとの相違点は、高周波数時間包絡形状決定部13aC（13a、13aA、13aBでもよいことは明白）から受け取る時間包絡形状と低周波数時間包絡形状決定部16bから受け取る時間包絡形状のうち少なくとも一つ以上に基づいて、周波数包絡調整部10iから出力される高周波数信号の複数のサブバンド信号の時間包絡の形状を修正する点である(S18-1)。 In this modification, the difference between the time envelope correction unit 18a and the time envelope correction unit 15a is the time envelope shape received from the high frequency time envelope shape determination unit 13aC (it is clear that 13a, 13aA, 13aB may be used). The point of correcting the time envelope shape of a plurality of subband signals of the high frequency signal output from the frequency envelope adjustment unit 10i based on at least one of the time envelope shapes received from the low frequency time envelope shape determination unit 16b. (S18-1).

例えば、低周波数時間包絡形状決定部16bから平坦であるとの時間包絡形状の情報を受け取った場合には、高周波数時間包絡形状決定部13aCから受け取る時間包絡形状によらず、周波数包絡調整部10iから出力される複数のサブバンド信号の時間包絡の形状を平坦に修正する。更に例えば、低周波数時間包絡形状決定部16bから平坦でないとの時間包絡形状の情報を受け取った場合には、高周波数時間包絡形状決定部13aCから受け取る時間包絡形状によらず、周波数包絡調整部10iから出力される複数のサブバンド信号の時間包絡の形状を平坦に修正しない。立ち上がり、立ち下がりの場合も同様であり、時間包絡形状は限定されない。 For example, when information on the time envelope shape that is flat is received from the low frequency time envelope shape determination unit 16b, the frequency envelope adjustment unit 10i is not affected by the time envelope shape received from the high frequency time envelope shape determination unit 13aC. Corrects the shape of the time envelope of multiple subband signals output from. Further, for example, when the information of the time envelope shape that is not flat is received from the low frequency time envelope shape determination unit 16b, the frequency envelope adjustment unit 10i is not affected by the time envelope shape received from the high frequency time envelope shape determination unit 13aC. Does not flatten the shape of the time envelope of multiple subband signals output from. The same applies to the case of rising and falling, and the time envelope shape is not limited.

［第9の実施形態の音声復号装置の第3の変形例］
図171は、第9の実施形態に係る音声復号装置の第3の変形例18Cの構成を示す図である。 [Third variant of the audio decoding device of the ninth embodiment]
FIG. 171 is a diagram showing a configuration of a third modification 18C of the audio decoding device according to the ninth embodiment.

図172は、第9の実施形態に係る音声復号装置の第3の変形例18Cの動作を示すフローチャートである。 FIG. 172 is a flowchart showing the operation of the third modification 18C of the audio decoding device according to the ninth embodiment.

本変形例と第9の実施形態に係る音声復号装置18との相違点は、高周波数時間包絡形状決定部13aC（13a、13aA、及び13aBでもよいことは明白）、低周波数時間包絡修正部10fにかえて、高周波数時間包絡形状決定部16d、低周波数時間包絡修正部16eを具備する点である。 The difference between this modification and the audio decoding device 18 according to the ninth embodiment is the high frequency time envelope shape determining unit 13aC (it is clear that 13a, 13aA, and 13aB may be used), and the low frequency time envelope correction unit 10f. Instead, it is provided with a high frequency time envelope shape determining unit 16d and a low frequency time envelope correction unit 16e.

［第9の実施形態の音声復号装置の第4の変形例］
図173は、第9の実施形態に係る音声復号装置の第4の変形例18Dの構成を示す図である。 [Fourth variant of the audio decoding device of the ninth embodiment]
FIG. 173 is a diagram showing a configuration of a fourth modification 18D of the audio decoding device according to the ninth embodiment.

図174は、第9の実施形態に係る音声復号装置の第4の変形例18Dの動作を示すフローチャートである。 FIG. 174 is a flowchart showing the operation of the fourth modification 18D of the audio decoding device according to the ninth embodiment.

本変形例においては、前記低周波数時間包絡形状決定部16b、前記時間包絡修正部18a、前記高周波数時間包絡形状決定部16d、及び前記低周波数時間包絡修正部16eを具備する。 In this modification, the low frequency time envelope shape determination unit 16b, the time envelope correction unit 18a, the high frequency time envelope shape determination unit 16d, and the low frequency time envelope correction unit 16e are provided.

［第9の実施形態の音声復号装置の第5の変形例］
図175は、第9の実施形態に係る音声復号装置の第5の変形例18Eの構成を示す図である。 [Fifth variant of the audio decoding device of the ninth embodiment]
FIG. 175 is a diagram showing a configuration of a fifth modification 18E of the audio decoding device according to the ninth embodiment.

図176は、第9の実施形態に係る音声復号装置の第5の変形例18Eの動作を示すフローチャートである。 FIG. 176 is a flowchart showing the operation of the fifth modification 18E of the audio decoding device according to the ninth embodiment.

本変形例と前記第9の実施形態に係る音声復号装置18との相違点は、低周波数時間包絡形状決定部10e及び高周波数時間包絡形状決定部13aにかえて時間包絡形状決定部16fを具備する点である。 The difference between this modification and the voice decoding device 18 according to the ninth embodiment is that the time envelope shape determination unit 16f is provided instead of the low frequency time envelope shape determination unit 10e and the high frequency time envelope shape determination unit 13a. It is a point to do.

［第9の実施形態の音声復号装置の第6の変形例］
図177は、第9の実施形態に係る音声復号装置の第6の変形例18Fの構成を示す図である。 [Sixth variant of the audio decoding device of the ninth embodiment]
FIG. 177 is a diagram showing a configuration of a sixth modification 18F of the audio decoding device according to the ninth embodiment.

図178は、第9の実施形態に係る音声復号装置の第6の変形例18Fの動作を示すフローチャートである。 FIG. 178 is a flowchart showing the operation of the sixth modification 18F of the voice decoding device according to the ninth embodiment.

本変形例においては、時間包絡修正部18aAと前記時間包絡修正部15aAとの相違点は、高周波数時間包絡形状決定部13aC（13a、13aA、13aBでもよいことは明白）から受け取る時間包絡形状と低周波数時間包絡形状決定部16bから受け取る時間包絡形状のうち少なくとも一つ以上に基づいて、周波数包絡調整部10iより分離した形で出力された高周波数信号を構成する成分のうち少なくとも一つ以上の時間包絡形状を修正し、時間包絡形状を修正された成分を含む高周波数信号の各成分から高周波数信号を合成し出力する点である(S18-1a)。 In this modification, the difference between the time envelope correction unit 18aA and the time envelope correction unit 15aA is the time envelope shape received from the high frequency time envelope shape determination unit 13aC (it is clear that 13a, 13aA, 13aB may be used). At least one or more of the components constituting the high frequency signal output in a form separated from the frequency envelope adjusting unit 10i based on at least one of the time envelope shapes received from the low frequency time envelope shape determining unit 16b. The point is that the time envelope shape is corrected, and the high frequency signal is synthesized and output from each component of the high frequency signal including the component whose time envelope shape is corrected (S18-1a).

例えば、低周波数時間包絡形状決定部16bから平坦であるとの時間包絡形状の情報を受け取った場合には、高周波数時間包絡形状決定部13aCから受け取る時間包絡形状によらず、周波数包絡調整部10iより分離した形で出力された高周波数信号を構成する成分のうち少なくとも一つ以上の時間包絡形状を平坦に修正する。更に例えば、低周波数時間包絡形状決定部16bから平坦でないとの時間包絡形状の情報を受け取った場合には、高周波数時間包絡形状決定部13aCから受け取る時間包絡形状によらず、周波数包絡調整部10iより分離した形で出力された高周波数信号を構成する成分のうち少なくとも一つ以上の時間包絡形状を平坦に修正しない。立ち上がり、立ち下がりの場合も同様であり、時間包絡形状は限定されない。 For example, when information on the time envelope shape that is flat is received from the low frequency time envelope shape determination unit 16b, the frequency envelope adjustment unit 10i is not affected by the time envelope shape received from the high frequency time envelope shape determination unit 13aC. At least one of the components constituting the high frequency signal output in a more separated form is corrected to flatten the time envelope shape. Further, for example, when the information of the time envelope shape that is not flat is received from the low frequency time envelope shape determination unit 16b, the frequency envelope adjustment unit 10i is not affected by the time envelope shape received from the high frequency time envelope shape determination unit 13aC. At least one of the components constituting the high frequency signal output in a more separated form does not flatten the time envelope shape. The same applies to the case of rising and falling, and the time envelope shape is not limited.

［第9の実施形態の音声復号装置の第7の変形例］
図179は、第9の実施形態に係る音声復号装置の第7の変形例18Gの構成を示す図である。 [7th variant of the audio decoding device of the 9th embodiment]
FIG. 179 is a diagram showing a configuration of a seventh modification 18G of the audio decoding device according to the ninth embodiment.

図180は、第9の実施形態に係る音声復号装置の第7の変形例18Gの動作を示すフローチャートである。 FIG. 180 is a flowchart showing the operation of the seventh modification 18G of the voice decoding device according to the ninth embodiment.

本変形例と第9の実施形態の第1の変形例に係る音声復号装置18Aとの相違点は、高周波数時間包絡形状決定部13aC（13a、13aA、及び13aBでもよいことは明白）、低周波数時間包絡修正部10fにかえて、高周波数時間包絡形状決定部16d、低周波数時間包絡修正部16eを具備する点である。 The difference between this modification and the voice decoding device 18A according to the first modification of the ninth embodiment is that the high frequency time envelope shape determining unit 13aC (it is clear that 13a, 13aA, and 13aB may be used) and low. Instead of the frequency time envelope correction unit 10f, the high frequency time envelope shape determination unit 16d and the low frequency time envelope correction unit 16e are provided.

［第9の実施形態の音声復号装置の第8の変形例］
図181は、第9の実施形態に係る音声復号装置の第8の変形例18Hの構成を示す図である。 [Eighth variant of the audio decoding device of the ninth embodiment]
FIG. 181 is a diagram showing a configuration of an eighth modification 18H of the audio decoding device according to the ninth embodiment.

図182は、第9の実施形態に係る音声復号装置の第8の変形例18Hの動作を示すフローチャートである。 FIG. 182 is a flowchart showing the operation of the eighth modification 18H of the audio decoding device according to the ninth embodiment.

本変形例においては、前記低周波数時間包絡形状決定部16b、前記時間包絡修正部18aA、前記高周波数時間包絡形状決定部16d、及び前記低周波数時間包絡修正部16eを具備する。 In this modification, the low frequency time envelope shape determination unit 16b, the time envelope correction unit 18aA, the high frequency time envelope shape determination unit 16d, and the low frequency time envelope correction unit 16e are provided.

［第9の実施形態の音声復号装置の第9の変形例］
図183は、第9の実施形態に係る音声復号装置の第9の変形例18Iの構成を示す図である。 [Ninth variant of the audio decoding device of the ninth embodiment]
FIG. 183 is a diagram showing a configuration of a ninth modification 18I of the audio decoding device according to the ninth embodiment.

図184は、第9の実施形態に係る音声復号装置の第9の変形例18Iの動作を示すフローチャートである。 FIG. 184 is a flowchart showing the operation of the ninth modification 18I of the audio decoding device according to the ninth embodiment.

本変形例と前記第9の実施形態の変形例1に係る音声復号装置18Aとの相違点は、低周波数時間包絡形状決定部10e及び高周波数時間包絡形状決定部13aにかえて時間包絡形状決定部16fを具備する点である。 The difference between this modification and the voice decoding device 18A according to the first modification of the ninth embodiment is that the time envelope shape is determined instead of the low frequency time envelope shape determination unit 10e and the high frequency time envelope shape determination unit 13a. This is a point including the part 16f.

［第10の実施形態］
図65は、第10の実施形態に係る音声復号装置1の構成を示す図である。音声復号装置1の通信装置は、下記音声符号化装置2から出力される多重化された符号化系列を受信し、更に、復号した音声信号を外部に出力する。音声復号装置1は、図65に示すように、機能的には、符号化系列解析部1a、音声復号部1b、時間包絡形状決定部1c、及び時間包絡修正部1dを備える。 [10th Embodiment]
FIG. 65 is a diagram showing a configuration of the voice decoding device 1 according to the tenth embodiment. The communication device of the voice decoding device 1 receives the multiplexed coding sequence output from the following voice coding device 2, and further outputs the decoded voice signal to the outside. As shown in FIG. 65, the voice decoding device 1 functionally includes a coding sequence analysis unit 1a, a voice decoding unit 1b, a time envelope shape determination unit 1c, and a time envelope correction unit 1d.

図66は、第10の実施形態に係る音声復号装置1の動作を示すフローチャートである。 FIG. 66 is a flowchart showing the operation of the voice decoding device 1 according to the tenth embodiment.

符号化系列解析部1aは、符号化系列を解析し、音声符号化部分と時間包絡形状に関する情報に分割する（ステップS1-1）。 The coded sequence analysis unit 1a analyzes the coded sequence and divides it into information related to the voice-coded part and the time envelope shape (step S1-1).

音声復号部1bは、符号化系列の音声符号化部分を復号し、復号信号を得る（ステップS1-2）。 The voice decoding unit 1b decodes the voice-coded portion of the coding series to obtain a decoded signal (step S1-2).

時間包絡形状決定部1cは、符号化系列解析部1aで分割された時間包絡形状に関する情報、及び音声復号部1bで得られた復号信号のうち少なくとも一つ以上に基づき、復号信号の時間包絡形状を決定する（ステップS1-3）。 The time envelope shape determination unit 1c is based on at least one of the information on the time envelope shape divided by the coded sequence analysis unit 1a and the decoding signal obtained by the voice decoding unit 1b, and the time envelope shape of the decoding signal. (Step S1-3).

例えば、復号信号の時間包絡形状を平坦と決定する。例えば、復号信号のパワーまたはそれに準ずるパラメータを算出し、当該パラメータの分散またはそれに準ずるパラメータを算出する。算出したパラメータと所定の閾値とを比較して時間包絡形状が平坦か否かまたは平坦さの程度を決定する。さらに別の例では、復号信号のパワーまたはそれに準ずるパラメータの相加平均と相乗平均の比またはそれに準ずるパラメータを算出し、所定の閾値とを比較して時間包絡形状が平坦か否かまたは平坦さの程度を決定する。復号信号の時間包絡形状を平坦と決定する方法は上記の例に限定されない。 For example, the time envelope shape of the decoded signal is determined to be flat. For example, the power of the decoded signal or a parameter equivalent thereto is calculated, and the variance of the parameter or a parameter equivalent thereto is calculated. The calculated parameters are compared with a predetermined threshold value to determine whether or not the time envelope shape is flat or the degree of flatness. In yet another example, the ratio of the arithmetic mean to the geometric mean of the power of the decoded signal or its equivalent parameters or the equivalent parameters is calculated and compared with a predetermined threshold to determine whether the time envelope shape is flat or flat. Determine the degree of. The method for determining the time-envelope shape of the decoded signal as flat is not limited to the above example.

さらに、例えば、復号信号の時間包絡形状を立ち上がりと決定する。例えば、復号信号のパワーまたはそれに準ずるパラメータを算出し、当該パラメータの時間方向の差分値を算出し、当該差分値の任意の時間セグメント内の最大値を算出する。当該最大値と所定の閾値とを比較して、時間包絡形状が立ち上がりか否かまたは立ち上がりの程度を決定する。復号信号の時間包絡形状を立ち上がりと決定する方法は上記の例に限定されない。 Further, for example, the time-envelope shape of the decoded signal is determined to be the rising edge. For example, the power of the decoded signal or a parameter equivalent thereto is calculated, the difference value of the parameter in the time direction is calculated, and the maximum value of the difference value in an arbitrary time segment is calculated. The maximum value is compared with a predetermined threshold value to determine whether or not the time envelope shape rises or the degree of rise. The method for determining the time-envelope shape of the decoded signal as the rising edge is not limited to the above example.

さらに、例えば、低周波数信号の時間包絡形状を立ち下がりと決定する。例えば、復号信号のパワーまたはそれに準ずるパラメータを算出し、当該パラメータの時間方向の差分値を算出し、当該差分値の任意の時間セグメント内の最小値を算出する。当該最小値と所定の閾値とを比較して、時間包絡形状が立ち下がりか否かまたは立ち下がりの程度を決定する。復号数信号の時間包絡形状を立ち下がりと決定する方法は上記の例に限定されない。 Further, for example, the time-envelope shape of the low frequency signal is determined to be falling. For example, the power of the decoded signal or a parameter equivalent thereto is calculated, the difference value of the parameter in the time direction is calculated, and the minimum value of the difference value in an arbitrary time segment is calculated. The minimum value is compared with a predetermined threshold value to determine whether or not the time envelope shape falls or the degree of the fall. The method of determining the time-envelope shape of the decoded number signal as the falling edge is not limited to the above example.

上記の例は、音声復号部1bより、当該復号信号が時間領域の信号として出力されても適用でき、当該復号信号が複数のサブバンド信号として出力されても適用できる。 The above example can be applied even if the decoding signal is output as a time domain signal from the audio decoding unit 1b, and can be applied even if the decoding signal is output as a plurality of subband signals.

時間包絡修正部1dは、時間包絡形状決定部1cで決定した時間包絡形状に基づいて、音声復号部1bから出力される復号信号の時間包絡の形状を修正する（ステップS1-4）。 The time envelope correction unit 1d corrects the time envelope shape of the decoded signal output from the voice decoding unit 1b based on the time envelope shape determined by the time envelope shape determination unit 1c (step S1-4).

例えば、前記復号信号が複数のサブバンド信号で表される場合、時間包絡修正部1dは、任意の時間セグメント内の前記復号信号の複数のサブバンド信号X_dec(k,i) (0≦k<k_h, t(l)≦i<t(l+1))に対して、所定の関数F(X_dec(k,i))を用いて以下の式（４０）

により得られるX’_dec(k,i)を時間包絡形状が修正された復号信号のサブバンド信号として算出し，当該サブバンド信号より時間領域の信号を合成して出力する。 For example, when the decoded signal is represented by a plurality of subband signals, the time envelope correction unit 1d may use the plurality of subband signals X _dec (k, i) (0 ≦ k) of the decoded signal in an arbitrary time segment. For <k _h , t (l) ≤ i <t (l + 1)), the following equation (40) is used using the predetermined function F (X _dec (k, i)).

_X'dec (k, i) obtained by is calculated as a subband signal of the decoded signal whose time envelope shape is corrected, and the signal in the time domain is synthesized and output from the subband signal.

例えば、前記復号信号の時間包絡形状が平坦と決定された場合、以下の処理により、復号信号の時間包絡形状を修正できる。例えば、当該サブバンド信号X_dec(k,i)をB_dec(m) (m=0,…,M_dec, M_dec≧1) (B_dec(0)≧0, B_dec(M_dec)<k_h)で境界を表されるM_dec個の周波数帯域に分割し、m番目の周波数帯域に含まれるサブバンド信号X_dec(k,i) (B_dec(m)≦k<B_dec(m+1), t(l)≦i<t(l+1))に対して、所定の関数F(X_dec(k,i))を、

として、X’_dec(k,i)を時間包絡形状が修正された復号信号のサブバンド信号として算出する。
また別の例によれば、所定の関数F(X_dec(k,i))をサブバンド信号X_dec(k,i)に対して平滑化フィルタ処理を施す

(N_filt≧1)で定義して、X’_dec(k,i)を時間包絡形状が修正された復号信号のサブバンド信号として算出する。さらに、前記B_dec(m)を用いて境界が表される各周波数帯域内で、フィルタ処理前後のサブバンド信号のパワーをあわせるように処理できる。
また別の例によれば、サブバンド信号をX_dec(k,i)を前記B_dec(m)を用いて境界が表される各周波数帯域内で周波数方向に線形予測して線形予測係数α_p(m) (m=0,…,M_dec-1)を得て、所定の関数F(X_dec(k,i))をサブバンド信号X_dec(k,i)に対して線形予測逆フィルタ処理を施す

(N_pred≧1)で定義して、X’_dec(k,i)を時間包絡形状が修正された復号信号のサブバンド信号として算出する。 For example, when the time envelope shape of the decoded signal is determined to be flat, the time envelope shape of the decoded signal can be corrected by the following processing. For example, the subband signal X _dec (k, i) is set to B _dec (m) (m = 0,…, M _dec , M _dec ≧ 1) (B _dec (0) ≧ 0, B _dec (M _dec ) < The subband signal X _dec (k, i) (B _dec (m) ≤ k <B _dec (m), which is divided into M _dec frequency bands whose boundaries are represented by k _h ) and is included in the mth frequency band. For +1), t (l) ≤ i <t (l + 1)), the predetermined function F (X _dec (k, i)) is

As, _X'dec (k, i) is calculated as a subband signal of the decoded signal with the time-envelope shape corrected.
According to another example, a predetermined function F (X _dec (k, i)) is subjected to smoothing filtering on the subband signal X _dec (k, i).

Defined by (N _filt ≧ 1), _X'dec (k, i) is calculated as a subband signal of the decoded signal with the corrected time envelope shape. Further, the B _dec (m) can be used to match the powers of the subband signals before and after the filtering within each frequency band whose boundary is represented.
According to another example, the subband signal is linearly predicted in the frequency direction within each frequency band whose boundary is represented by X _dec (k, i) using the B _dec (m), and the linear prediction coefficient α. Obtaining _p (m) (m = 0,…, M _dec -1), linear prediction inverse of the given function F (X _dec (k, i)) with respect to the subband signal X _dec (k, i) Apply filtering

Defined by (N _pred ≥ 1), _X'dec (k, i) is calculated as a subband signal of the decoded signal with the corrected time envelope shape.

上記の時間包絡形状を平坦に修正する処理の例は、それぞれを組み合わせて実施できる。 The above-mentioned example of the process of correcting the time envelope shape flatly can be carried out in combination with each other.

時間包絡修正部1dは、復号信号の時間包絡の形状を平坦に修正する処理を実施し、上記の例に限定されない。 The time envelope correction unit 1d performs a process of flattening the shape of the time envelope of the decoded signal, and is not limited to the above example.

さらには、例えば、前記復号信号の時間包絡形状が立ち上がりと決定された場合、以下の処理により、復号信号の時間包絡形状を修正できる。
例えば、所定の関数F(X_dec(k,i))をiに対して単調増加する関数incr(i)を用いて

で定義して、X’_dec(k,i)を時間包絡形状が修正された復号信号のサブバンド信号として算出する。さらに、前記B_dec(m)を用いて境界が表される各周波数帯域内で、時間包絡形状の修正前後のサブバンド信号のパワーをあわせるように処理できる。 Further, for example, when the time-envelope shape of the decoded signal is determined to be rising, the time-envelope shape of the decoded signal can be corrected by the following processing.
For example, using the function incr (i) that monotonically increases a given function F (X _dec (k, i)) with respect to i.

Defined in, _X'dec (k, i) is calculated as a subband signal of the decoded signal with the corrected time envelope shape. Further, it is possible to process so as to match the powers of the subband signals before and after the correction of the time envelope shape within each frequency band whose boundary is represented by using the B _dec (m).

時間包絡修正部1dは、復号信号の複数のサブバンド信号の時間包絡の形状を立ち上がりに修正する処理を実施し、上記の例に限定されない。 The time envelope correction unit 1d performs a process of correcting the shape of the time envelope of a plurality of subband signals of the decoded signal at the rising edge, and is not limited to the above example.

さらには、例えば、前記復号信号の時間包絡形状が立ち下がりと決定された場合、以下の処理により、復号信号の時間包絡形状を修正できる。
例えば、所定の関数F(X_dec(k,i))をiに対して単調減少する関数decr(i)を用いて

で定義して、X’_dec(k,i)を時間包絡形状が修正された低周波数信号のサブバンド信号として算出する。さらに、前記B_dec(m)を用いて境界が表される各周波数帯域内で、時間包絡形状の修正前後のサブバンド信号のパワーをあわせるように処理できる。 Further, for example, when the time-envelope shape of the decoded signal is determined to be falling, the time-envelope shape of the decoded signal can be corrected by the following processing.
For example, using the function decr (i) that monotonically decreases a given function F (X _dec (k, i)) with respect to i.

Defined in, _X'dec (k, i) is calculated as a subband signal of the low frequency signal with the corrected time envelope shape. Further, it is possible to process so as to match the powers of the subband signals before and after the correction of the time envelope shape within each frequency band whose boundary is represented by using the B _dec (m).

時間包絡修正部1dは、復号信号の複数のサブバンド信号の時間包絡の形状を立ち下がりに修正する処理を実施し、上記の例に限定されない。 The time envelope correction unit 1d performs a process of correcting the shape of the time envelope of a plurality of subband signals of the decoded signal so as to fall, and is not limited to the above example.

例えば、前記復号信号が時間領域の信号で表される場合、時間包絡修正部1dは、任意の時間セグメント内の前記復号信号x_dec(i) (t(l)≦i<t(l+1))に対して、所定の関数F_t(x_dec(i))を用いて

により得られるx’_dec(i)を時間包絡形状が修正された復号信号として出力する。 For example, when the decoding signal is represented by a signal in the time domain, the time envelope correction unit 1d may use the decoding signal x _dec (i) (t (l) ≤ i <t (l + 1) in an arbitrary time segment. )) Using the given function F _t (x _dec (i))

The _x'dec (i) obtained by is output as a decoding signal with the time-envelope shape corrected.

例えば、前記復号信号の時間包絡形状が平坦と決定された場合、以下の処理により、復号信号の時間包絡形状を修正できる。
例えば、当該復号信号x_dec(i)に対して、所定の関数F_t(x_dec(i))を、

として、x’_dec(i)を時間包絡形状が修正された復号信号として出力する。 For example, when the time envelope shape of the decoded signal is determined to be flat, the time envelope shape of the decoded signal can be corrected by the following processing.
For example, for the decoding signal x _dec (i), a predetermined function F _t (x _dec (i)) is applied.

As, _x'dec (i) is output as a decoding signal with the time-envelope shape corrected.

また別の例によれば、所定の関数F_t(x_dec(i))を復号信号x_dec(i)に対して平滑化フィルタ処理を施す

(N_filt≧1)で定義して、x’_dec(i)を時間包絡形状が修正された復号信号として出力する。 According to another example, the predetermined function F _t (x _dec (i)) is subjected to smoothing filtering on the decoding signal x _dec (i).

Defined by (N _filt ≥ 1), _x'dec (i) is output as a decoded signal with the corrected time envelope shape.

さらには、例えば、前記復号信号の時間包絡形状が立ち上がりと決定された場合、以下の処理により、復号信号の時間包絡形状を修正できる。
例えば、所定の関数F_t(x_dec(i))を、iに対して単調増加する関数incr(i)を用いて

で定義して、x’_dec(i)を時間包絡形状が修正された復号信号として出力する。 Further, for example, when the time-envelope shape of the decoded signal is determined to be rising, the time-envelope shape of the decoded signal can be corrected by the following processing.
For example, using a function incr (i) that monotonically increases a given function F _t (x _dec (i)) with respect to i.

Defined in, _x'dec (i) is output as a decoded signal with the time-envelope shape corrected.

時間包絡修正部1dは、復号信号の時間包絡の形状を立ち上がりに修正する処理を実施し、上記の例に限定されない。 The time envelope correction unit 1d performs a process of correcting the shape of the time envelope of the decoded signal at the rising edge, and is not limited to the above example.

さらには、例えば、前記復号信号の時間包絡形状が立ち下がりと決定された場合、以下の処理により、復号信号の時間包絡形状を修正できる。
例えば、所定の関数F_t(x_dec(i))を、iに対して単調減少する関数decr(i)を用いて

で定義して、x’_dec(i)を時間包絡形状が修正された復号信号として出力する。時間包絡修正部1dは、復号信号の時間包絡の形状を立ち下がりに修正する処理を実施し、上記の例に限定されない。 Further, for example, when the time-envelope shape of the decoded signal is determined to be falling, the time-envelope shape of the decoded signal can be corrected by the following processing.
For example, using the function decr (i) that monotonically decreases a given function F _t (x _dec (i)) with respect to i.

Defined in, _x'dec (i) is output as a decoded signal with the time-envelope shape corrected. The time envelope correction unit 1d performs a process of correcting the shape of the time envelope of the decoded signal to a falling edge, and is not limited to the above example.

例えば、前記復号信号が離散フーリエ変換，離散コサイン変換，修正離散コサイン変換に代表される時間周波数変換による周波数領域の変換係数X_dec(k) (0≦k<k_h)で表されたときは、所定の関数F_f(X_dec(k))を用いて以下の式（５１）

により得られるX’_dec(k)を時間包絡形状が修正された復号信号の周波数領域の変換係数として算出し、所定の周波数間変換により時間領域の信号に変換して出力する。 For example, when the decoded signal is represented by the conversion coefficient X _dec (k) (0 ≤ k <k _h ) of the frequency domain by the time-frequency conversion represented by the discrete Fourier transform, the discrete cosine transform, and the modified discrete cosine transform. , Using the given function F _f (X _dec (k)), the following equation (51)

_X'dec (k) obtained in the above is calculated as a conversion coefficient in the frequency domain of the decoded signal whose time envelope shape is corrected, and is converted into a signal in the time domain by a predetermined inter-frequency conversion and output.

例えば、前記復号信号の時間包絡形状が平坦と決定された場合、以下の処理により、復号信号の時間包絡形状を修正できる。
B_dec(m) (m=0,…,M_dec, M_dec≧1) (B_dec(0)≧0, B_dec(M_dec)<k_h)で境界を表されるM_dec個の任意の周波数帯域B_dec(m)をにおいて、周波数方向に線形予測して線形予測係数α_p(m) (m=0,…,M_dec-1)を得て、所定の関数F_f(X_dec(k))を、変換係数X_dec(k)に対して線形予測逆フィルタ処理を施す

(N_pred≧1)で定義して、X’_dec(k,i)を時間包絡形状が修正された復号信号の変換係数として算出する。 For example, when the time envelope shape of the decoded signal is determined to be flat, the time envelope shape of the decoded signal can be corrected by the following processing.
_{B dec (m) (m =} 0, ..., M dec, M dec ≧ 1) (B dec (0) ≧ 0, B dec (M dec) <k h) M dec pieces of any expressed bounded by In the frequency band B _dec (m) of, linearly predict in the frequency direction to obtain the linear prediction coefficient α _p (m) (m = 0,…, M _dec -1), and obtain the predetermined function F _f (X _dec). (k)) is subjected to linear prediction inverse filtering on the conversion coefficient X _dec (k).

Defined by (N _pred ≥ 1), _X'dec (k, i) is calculated as the conversion coefficient of the decoded signal with the corrected time envelope shape.

図67は、第10の実施形態に係る音声符号化装置2の構成を示す図である。音声符号化装置2の通信装置は、符号化の対象となる音声信号を外部から受信し、更に、符号化された符号化系列を外部に出力する。音声符号化装置2は、図67に示すように、機能的には、音声符号化部2a、時間包絡情報符号化部2b、及び符号化系列多重化部2cを備える。 FIG. 67 is a diagram showing a configuration of the voice coding device 2 according to the tenth embodiment. The communication device of the voice coding device 2 receives the voice signal to be coded from the outside, and further outputs the coded coding sequence to the outside. As shown in FIG. 67, the voice coding device 2 functionally includes a voice coding unit 2a, a time-envelope information coding unit 2b, and a coding sequence multiplexing unit 2c.

図68は、第10の実施形態に係る音声符号化装置2の動作を示すフローチャートである。 FIG. 68 is a flowchart showing the operation of the voice coding device 2 according to the tenth embodiment.

音声符号化部2aは、入力音声信号を符号化する（ステップS2-1）。 The voice coding unit 2a encodes the input voice signal (step S2-1).

時間包絡情報符号化部2bは、入力音声信号、音声符号化部2aにおける入力音声信号の符号化結果を含む符号化過程で得られた情報のうち少なくとも一つ以上に基づき、時間包絡情報を算出し符号化する（ステップS2-2）。 The time wrapping information coding unit 2b calculates the time wrapping information based on at least one of the information obtained in the coding process including the input voice signal and the coding result of the input voice signal in the voice coding unit 2a. And code (step S2-2).

例えば、任意の時間セグメントt(l)≦i<t(l+1))内の時間領域の信号である前記入力音声信号x(i)の時間包絡E_t(i)を、当該時間セグメント内で正規化した復号信号のパワーとして算出できる。

さらに、例えば、音声符号化部2aにおいて前記入力音声信号が複数のサブバンドの信号X(k,i)が算出される場合、入力音声信号の時間包絡として、任意の時間セグメントt(l)≦i<t(l+1))内でB(m) (m=0,…,M, M≧1) (B(0)≧0, B(M)<k_h)で境界を表されるM個の周波数帯域に分割され、m番目の周波数帯域に含まれる当該入力音声信号のサブバンド信号X(k,i) (B(m)≦k<B(m+1), t(l)≦i<t(l+1))の時間包絡E(k,i)を、当該時間セグメント内で正規化した入力音声信号のサブバンド信号のパワーとして算出できる。

入力音声信号の時間包絡は、入力音声信号の大きさの時間方向の変動がわかるパラメータであれば良く、前記の例に限定されない。 For example, the time envelope E _t (i) of the input audio signal x (i), which is a signal in the time domain within an arbitrary time segment t (l) ≤ i <t (l + 1)), is set in the time segment. It can be calculated as the power of the decoded signal normalized by.

Further, for example, when the audio coding unit 2a calculates signals X (k, i) of a plurality of subbands of the input audio signal, an arbitrary time segment t (l) ≤ is used as the time wrapping of the input audio signal. Within i <t (l + 1)), the boundary is represented by B (m) (m = 0,…, M, M ≧ 1) (B (0) ≧ 0, B (M) <k _h ). Subband signal X (k, i) (B (m) ≤ k <B (m + 1), t (l) of the input audio signal divided into M frequency bands and included in the mth frequency band The time wrapping E (k, i) of ≤i <t (l + 1)) can be calculated as the power of the subband signal of the input audio signal normalized within the time segment.

The time envelope of the input audio signal is not limited to the above example, as long as it is a parameter that shows the fluctuation of the magnitude of the input audio signal in the time direction.

さらに、例えば、音声符号化部2aにおける前記入力音声信号の符号化結果に基づいて復号信号x_dec(i)を算出し、任意の時間セグメントt(l)≦i<t(l+1))内の当該復号信号x_dec(i)の時間包絡E_dec,t(i)を、当該時間セグメント内で正規化した復号信号のパワーとして算出できる。

さらに、例えば、音声符号化部2aにおける前記入力音声信号の符号化過程で、または符号化結果に基づいて復号信号のサブバンド信号X_dec(k,i)が算出される場合、復号信号の時間包絡として、任意の時間セグメントt(l)≦i<t(l+1))内でB(m) (m=0,…,M, M≧1) (B(0)≧0, B(M)<k_h)で境界を表されるM個の周波数帯域に分割され、m番目の周波数帯域に含まれる当該入力音声信号のサブバンド信号X_dec(k,i) (B(m)≦k<B(m+1), t(l)≦i<t(l+1))の時間包絡E_dec(k,i)を、当該時間セグメント内で正規化した入力音声信号のサブバンド信号のパワーとして算出できる。

例えば、時間包絡情報符号化部2bは時間包絡情報として平坦の程度を表す情報を算出する。例えば、入力音声信号及び復号信号の時間包絡の分散またはそれに準ずるパラメータのうち少なくとも一つ以上を算出する。さらに別の例では、入力音声信号及び復号信号の時間包絡の相加平均と相乗平均の比またはそれに準ずるパラメータのうち少なくとも一つ以上を算出する。この場合、時間包絡情報符号化部2bは、時間包絡情報として当該入力音声信号の時間包絡の平坦さを表す情報を算出すればよく、前記の例に限定されない。そして、前記パラメータを符号化する。例えば、入力音声信号と復号信号の当該パラメータの差分値またはその絶対値を符号化する。さらに、例えば、入力音声信号の当該パラメータの値または絶対値のうち少なくとも一つ以上を符号化する。例えば、時間包絡の平坦さを平坦か否かで表現すれば1ビットで符号化でき、例えば、前記時間領域の入力音声信号については前記任意の時間セグメント内において1ビットで符号化でき、さらに例えば、前記入力音声信号のサブバンド信号の前記M個の周波数帯域毎に当該情報を符号化する際にはMビットで符号化できる。時間包絡情報の符号化方法は前記の例に限定されない。 Further, for example, the decoding signal x _dec (i) is calculated based on the coding result of the input audio signal in the audio coding unit 2a, and an arbitrary time segment t (l) ≤ i <t (l + 1)). The time envelope E _{dec, t} (i) of the decoded signal x _dec (i) in can be calculated as the power of the decoded signal normalized within the time segment.

Further, for example, when the subband signal X _dec (k, i) of the decoding signal is calculated in the coding process of the input voice signal in the voice coding unit 2a or based on the coding result, the time of the decoding signal As an entrap, within any time segment t (l) ≤ i <t (l + 1)) B (m) (m = 0,…, M, M ≥ 1) (B (0) ≥ 0, B ( Subband signal X _dec (k, i) (B (m) ≤) of the input audio signal divided into M frequency bands whose boundaries are represented by M) <k _h ) and included in the mth frequency band. A subband signal of the input audio signal in which the time-enclosed E _dec (k, i) of k <B (m + 1), t (l) ≤ i <t (l + 1)) is normalized within the time segment. Can be calculated as the power of.

For example, the time-envelope information coding unit 2b calculates information indicating the degree of flatness as time-envelope information. For example, at least one or more of the variance of the time envelope of the input voice signal and the decoded signal or a parameter equivalent thereto is calculated. In yet another example, at least one of the ratio of the arithmetic mean to the geometric mean of the time envelope of the input audio signal and the decoding signal or a parameter equivalent thereto is calculated. In this case, the time-envelope information coding unit 2b may calculate the information indicating the flatness of the time-envelope of the input audio signal as the time-envelope information, and is not limited to the above example. Then, the parameter is encoded. For example, the difference value or the absolute value of the parameter of the input audio signal and the decoded signal is encoded. Further, for example, at least one or more of the values or absolute values of the parameters of the input voice signal are encoded. For example, if the flatness of the time envelope is expressed by whether it is flat or not, it can be encoded with 1 bit. For example, the input audio signal in the time domain can be encoded with 1 bit within the arbitrary time segment. When the information is encoded for each of the M frequency bands of the subband signal of the input audio signal, it can be encoded with M bits. The method for encoding the time envelope information is not limited to the above example.

さらに例えば、時間包絡情報符号化部2bは時間包絡情報として立ち上がりの程度を表す情報を算出する。例えば、任意の時間セグメントt(l)≦i<t(l+1)内において、入力音声信号の時間包絡の時間方向の差分値の最大値を算出する。

さらには、これらの式において、時間包絡に代えて当該時間包絡を時間方向に平滑化したパラメータの時間方向の差分値の最大値を算出できる。 Further, for example, the time-envelope information coding unit 2b calculates information indicating the degree of rise as time-envelope information. For example, within an arbitrary time segment t (l) ≤ i <t (l + 1), the maximum value of the difference value in the time direction of the time envelope of the input audio signal is calculated.

Furthermore, in these equations, instead of the time envelope, the maximum value of the difference value in the time direction of the parameter obtained by smoothing the time envelope in the time direction can be calculated.

この場合、時間包絡情報符号化部2bは、時間包絡情報として当該入力音声信号の時間包絡の立ち上がりの程度を表す情報を算出すればよく、前記の例に限定されない。そして、前記パラメータを符号化する。例えば、入力音声信号と復号信号の当該パラメータの差分値またはその絶対値のうち少なくとも一つ以上を符号化する。例えば、時間包絡の立ち上がりを立ち上がりか否かで表現すれば1ビットで符号化でき、例えば、前記時間領域の入力音声信号については前記任意の時間セグメント内において1ビットで符号化でき、さらに例えば、前記入力音声信号のサブバンド信号の前記M個の周波数帯域毎に当該情報を符号化する際にはMビットで符号化できる。時間包絡情報の符号化方法は前記の例に限定されない。 In this case, the time-envelope information coding unit 2b may calculate as the time-envelope information information indicating the degree of rise of the time-envelope of the input audio signal, and is not limited to the above example. Then, the parameter is encoded. For example, at least one of the difference value of the parameter of the input audio signal and the decoded signal or the absolute value thereof is encoded. For example, if the rising edge of the time envelope is expressed by whether it is rising or not, it can be encoded with 1 bit. For example, the input audio signal in the time domain can be encoded with 1 bit within the arbitrary time segment. When the information is encoded for each of the M frequency bands of the subband signal of the input audio signal, it can be encoded with M bits. The method for encoding the time envelope information is not limited to the above example.

さらに例えば、時間包絡情報符号化部2bは時間包絡情報として立ち下がりの程度を表す情報を算出する。例えば、任意の時間セグメントt(l)≦i<t(l+1)内において、入力音声信号の時間包絡の時間方向の差分値の最小値を算出する。

さらには、これらの式において、時間包絡に代えて当該時間包絡を時間方向に平滑化したパラメータの時間方向の差分値の最小値を算出できる。この場合、時間包絡情報符号化部2bは、時間包絡情報として当該入力音声信号のサブバンド信号の時間包絡の立ち下がりの程度を表す情報を算出すればよく、前記の例に限定されない。そして、前記パラメータを符号化する。例えば、入力音声信号と復号信号の当該パラメータの差分値またはその絶対値のうち少なくとも一つ以上を符号化する。例えば、時間包絡の立ち下がりを立ち下がりか否かで表現すれば1ビットで符号化でき、例えば、前記時間領域の入力音声信号については前記任意の時間セグメント内において1ビットで符号化でき、さらに例えば、前記入力音声信号のサブバンド信号の前記M個の周波数帯域毎に当該情報を符号化する際にはMビットで符号化できる。時間包絡情報の符号化方法は前記の例に限定されない。 Further, for example, the time-envelope information coding unit 2b calculates information indicating the degree of fall as time-envelope information. For example, within an arbitrary time segment t (l) ≤ i <t (l + 1), the minimum value of the difference value in the time direction of the time envelope of the input audio signal is calculated.

Furthermore, in these equations, instead of the time envelope, the minimum value of the difference value in the time direction of the parameter obtained by smoothing the time envelope in the time direction can be calculated. In this case, the time-envelope information coding unit 2b may calculate as the time-envelope information information indicating the degree of the fall of the time-envelope of the subband signal of the input audio signal, and is not limited to the above example. Then, the parameter is encoded. For example, at least one of the difference value of the parameter of the input audio signal and the decoded signal or the absolute value thereof is encoded. For example, if the falling edge of the time envelope is expressed by whether or not it falls, it can be encoded with 1 bit. For example, the input audio signal in the time domain can be encoded with 1 bit within the arbitrary time segment. For example, when encoding the information for each of the M frequency bands of the subband signal of the input audio signal, it can be encoded with M bits. The method for encoding the time envelope information is not limited to the above example.

上記の例では、入力音声信号の時間包絡の代わりに、音声符号化部2aにおいて任意の時間セグメントt(l)≦i<t(l+1)内で当該時間セグメントよりも短い時間セグメントのパワーと相関のある符号化パラメータ（例えば、CELP符号化における符号帳の利得）を用いることができる。 In the above example, instead of the time envelope of the input voice signal, the power of the time segment shorter than the time segment within an arbitrary time segment t (l) ≤ i <t (l + 1) in the voice coding unit 2a. Coding parameters that correlate with (eg, code book gain in CELP coding) can be used.

符号化系列多重化部2cは、音声符号化部2aより入力音声信号の符号化系列を受け取り、時間包絡情報符号化部2bより符号化された時間包絡形状情報を受け取り、多重化して符号化系列として出力する（ステップS2-3）。 The coded sequence multiplexing unit 2c receives the coded sequence of the input audio signal from the voice coding unit 2a, receives the time-enclosed shape information encoded from the time-enclosed information coding unit 2b, and multiplexes the coded sequence. Is output as (step S2-3).

［第11の実施形態］
図69は、第11の実施形態に係る音声復号装置100の構成を示す図である。音声復号装置100の通信装置は、下記音声符号化装置200から出力される多重化された符号化系列を受信し、更に、復号した音声信号を外部に出力する。音声復号装置100は、図69に示すように、機能的には、符号化系列逆多重化部100a、低周波数復号部100b、低周波数時間包絡形状決定部100c、低周波数時間包絡修正部100d、高周波数復号部100e、及び低周波数/高周波数信号合成部100fを備える。 [11th Embodiment]
FIG. 69 is a diagram showing a configuration of the audio decoding device 100 according to the eleventh embodiment. The communication device of the voice decoding device 100 receives the multiplexed coding sequence output from the following voice coding device 200, and further outputs the decoded voice signal to the outside. As shown in FIG. 69, the voice decoding apparatus 100 functionally includes a coding sequence demultiplexing unit 100a, a low frequency decoding unit 100b, a low frequency time envelope shape determining unit 100c, and a low frequency time envelope correction unit 100d. It includes a high frequency decoding unit 100e and a low frequency / high frequency signal synthesis unit 100f.

図70は、第11の実施形態に係る音声復号装置の動作を示すフローチャートである。 FIG. 70 is a flowchart showing the operation of the voice decoding device according to the eleventh embodiment.

符号化系列逆多重化部100aは、符号化系列を、低周波数信号を符号化した低周波数符号化部分と高周波数信号を符号化した高周波数符号化部分に分割する（ステップS100-1）。 The coded sequence demultiplexing section 100a divides the coded sequence into a low frequency coded portion that encodes a low frequency signal and a high frequency coded portion that encodes a high frequency signal (step S100-1).

低周波数復号部100bは、符号化系列逆多重化部100aにて分割された低周波数符号化部分を復号し、低周波数信号を得る（ステップS100-2）。 The low frequency decoding unit 100b decodes the low frequency coded portion divided by the coded sequence demultiplexing unit 100a to obtain a low frequency signal (step S100-2).

低周波数時間包絡形状決定部100cは、符号化系列逆多重化部100aで分割された低周波時間包絡形状に関する情報、及び低周波数復号部100bで得られた低周波数信号のうち少なくとも一つ以上に基づき、低周波数信号の時間包絡形状を決定する（ステップS100-3）。 The low frequency time envelope shape determination unit 100c is used to provide at least one of the information on the low frequency time envelope shape divided by the coded sequence demultiplexing unit 100a and the low frequency signal obtained by the low frequency decoding unit 100b. Based on this, the time envelope shape of the low frequency signal is determined (step S100-3).

例えば、低周波数信号の時間包絡形状を平坦と決定するケース、低周波数信号の時間包絡形状を立ち上がりと決定するケース、低周波数信号の時間包絡形状を立ち下がりと決定するケースが挙げられる。 For example, there are cases where the time envelope shape of a low frequency signal is determined to be flat, a case where the time envelope shape of a low frequency signal is determined to be rising, and a case where the time envelope shape of a low frequency signal is determined to be falling.

低周波数信号の時間包絡形状の決定は、例えば、時間包絡形状決定部1cにおける復号信号の時間包絡形状の決定処理において、音声復号部1bで得られる復号信号を、低周波数復号部100bで得られた低周波数信号に置き換えることにより実現できる。 In the determination of the time envelope shape of the low frequency signal, for example, in the process of determining the time envelope shape of the decoding signal in the time envelope shape determination unit 1c, the decoding signal obtained by the voice decoding unit 1b is obtained by the low frequency decoding unit 100b. It can be realized by replacing it with a low frequency signal.

低周波数時間包絡修正部100dは、低周波数時間包絡形状決定部100cで決定した時間包絡形状に基づいて、低周波数復号部100bから出力される低周波数信号の時間包絡の形状を修正する（ステップS100-4）。 The low frequency time envelope correction unit 100d corrects the time envelope shape of the low frequency signal output from the low frequency decoding unit 100b based on the time envelope shape determined by the low frequency time envelope shape determination unit 100c (step S100). -Four).

低周波数信号の時間包絡形状の修正は、例えば、時間包絡修正部1dにおける復号信号の時間包絡形状の修正処理において、音声復号部1bで得られる復号信号を、低周波数復号部100bで得られた低周波数信号に置き換えることにより実現できる。 To correct the time envelope shape of the low frequency signal, for example, in the time envelope shape correction process of the decoding signal in the time envelope correction unit 1d, the decoding signal obtained by the voice decoding unit 1b was obtained by the low frequency decoding unit 100b. This can be achieved by replacing it with a low frequency signal.

高周波数復号部100eは、符号化系列逆多重化部100aにて分割された高周波数符号化部分を復号し、高周波数信号を得る（ステップS100-5）。 The high frequency decoding unit 100e decodes the high frequency coded portion divided by the coded sequence demultiplexing unit 100a to obtain a high frequency signal (step S100-5).

高周波数復号部100eでの高周波数信号の復号は、高周波数信号を時間領域の信号、サブバンド信号、及び周波数領域の信号のうち少なくとも一つ以上の領域の信号で符号化した符号化系列を復号する方法で実現できる。 Decoding of a high frequency signal by the high frequency decoding unit 100e is performed by encoding a high frequency signal with a signal in at least one or more of a time domain signal, a subband signal, and a frequency domain signal. It can be realized by the method of decoding.

さらには、例えば前記第1〜第9の実施形態の音声復号装置のように、低周波数復号部で得られた復号結果を利用して高周波数信号を生成する帯域拡張方式で、高周波数信号を生成できる。この際には、帯域拡張方式にて高周波数信号を生成するために必要な情報が符号化系列に含まれる場合、符号化系列のうち当該情報が含まれる部分が高周波数符号化部分となる。そして、符号化系列逆多重化部100aにて分割された当該高周波数符号化部分を復号して帯域拡張方式に必要な情報を得て、高周波数信号を生成する。一方、帯域拡張方式にて高周波数信号を生成するために必要な情報が符号化系列に含まれない場合、符号化系列逆多重化部100aより高周波数復号部100eに入力は無く、所定の処理または低周波数復号部で得られた復号結果を利用した処理によって高周波数信号を生成する。 Further, for example, as in the voice decoding apparatus of the first to ninth embodiments, the high frequency signal is generated by a band expansion method that generates a high frequency signal by using the decoding result obtained by the low frequency decoding unit. Can be generated. At this time, when the coded sequence includes the information necessary for generating the high frequency signal by the band expansion method, the portion of the coded sequence containing the information becomes the high frequency coded portion. Then, the high frequency coded portion divided by the coded sequence demultiplexing unit 100a is decoded to obtain the information required for the band expansion method, and a high frequency signal is generated. On the other hand, when the coded sequence does not include the information required to generate the high frequency signal by the band expansion method, there is no input from the coded sequence demultiplexing section 100a to the high frequency decoding section 100e, and a predetermined process is performed. Alternatively, a high frequency signal is generated by processing using the decoding result obtained by the low frequency decoding unit.

低周波数/高周波数信号合成部100fは、低周波数時間包絡修正部100dで時間包絡形状を修正された低周波数信号と、高周波数復号部100eで得られた高周波数信号とを合成して低周波数成分および高周波数成分を含む音声信号を出力する（ステップS100-6）。 The low frequency / high frequency signal synthesizer 100f synthesizes the low frequency signal whose time wrapping shape is corrected by the low frequency time wrapping correction unit 100d and the high frequency signal obtained by the high frequency decoding unit 100e to generate a low frequency. Output an audio signal containing components and high frequency components (step S100-6).

図71は、第11の実施形態に係る音声符号化装置200の構成を示す図である。音声符号化装置200の通信装置は、符号化の対象となる音声信号を外部から受信し、更に、符号化された符号化系列を外部に出力する。音声符号化装置200は、図65に示すように、機能的には、低周波数符号化部200a、高周波数符号化部200b、低周波数時間包絡情報符号化部200c、及び符号化系列多重化部200dを備える。 FIG. 71 is a diagram showing a configuration of the voice coding device 200 according to the eleventh embodiment. The communication device of the voice coding device 200 receives the voice signal to be coded from the outside, and further outputs the coded coding sequence to the outside. As shown in FIG. 65, the voice coding device 200 functionally includes a low frequency coding unit 200a, a high frequency coding unit 200b, a low frequency time wrapping information coding unit 200c, and a coding sequence multiplexing unit. Equipped with 200d.

図72は、第11の実施形態に係る音声符号化装置200の動作を示すフローチャートである。 FIG. 72 is a flowchart showing the operation of the voice coding device 200 according to the eleventh embodiment.

低周波数符号化部200aは、入力音声信号の低周波数成分にあたる低周波数信号を符号化する（ステップS200-1）。 The low frequency coding unit 200a encodes a low frequency signal corresponding to a low frequency component of the input voice signal (step S200-1).

高周波数符号化部200bは、入力音声信号の高周波数成分にあたる高周波数信号を符号化する（ステップS200-2）。 The high frequency coding unit 200b encodes a high frequency signal corresponding to a high frequency component of the input voice signal (step S200-2).

低周波数時間包絡情報符号化部200cは、入力音声信号、低周波数符号化部200aにおける入力音声信号の符号化結果を含む符号化の過程で得られた情報のうち少なくとも一つ以上に基づき、低周波数時間包絡形状情報を算出し符号化する（ステップS200-3）。 The low frequency time wrapping information coding unit 200c is low based on at least one of the information obtained in the coding process including the input voice signal and the coding result of the input voice signal in the low frequency coding unit 200a. The frequency-time wrapping shape information is calculated and encoded (step S200-3).

低周波数時間包絡形状情報の算出、符号化処理は、例えば、時間包絡情報符号化部2bにおける入力音声信号の時間包絡情報の算出、符号化処理において、入力音声信号に代えて入力音声信号の低周波数信号を、復号信号に代えて低周波数符号化部200aにおける符号化結果を復号して得られる低周波数復号信号を用いることで、同様にして実現できる。 In the calculation and coding processing of the low frequency time wrapping shape information, for example, in the calculation and coding processing of the time wrapping information of the input voice signal in the time wrapping information coding unit 2b, the input voice signal is low instead of the input voice signal. The same can be achieved by using the low frequency decoding signal obtained by decoding the coding result in the low frequency coding unit 200a instead of the decoding signal for the frequency signal.

符号化系列多重化部200dは、低周波数符号化部200aより低周波数音声信号の符号化系列を受け取り、高周波数符号化部200bより高周波数音声信号の符号化系列を受け取り、低周波数時間包絡情報符号化部200cより符号化された低周波数時間包絡形状情報を受け取り、多重化して符号化系列として出力する（ステップS200-4）。 The coded sequence multiplexing unit 200d receives the coded sequence of the low frequency audio signal from the low frequency coded unit 200a, receives the coded sequence of the high frequency audio signal from the high frequency coded unit 200b, and receives the low frequency time wrapping information. It receives the coded low-frequency time-enclosed shape information from the coding unit 200c, multiplexes it, and outputs it as a coded sequence (step S200-4).

［第11の実施形態の音声復号装置の第1の変形例］
図73は、第11の実施形態に係る音声復号装置の第1の変形例100Aの構成を示す図である。 [First modification of the audio decoding device of the eleventh embodiment]
FIG. 73 is a diagram showing a configuration of a first modification 100A of the audio decoding device according to the eleventh embodiment.

図74は、第11の実施形態に係る音声復号装置の第1の変形例100Aの動作を示すフローチャートである。 FIG. 74 is a flowchart showing the operation of the first modification 100A of the audio decoding device according to the eleventh embodiment.

高周波数復号部100eAは、符号化系列逆多重化部100aにて分割された高周波数符号化部分を復号し、高周波数信号を得る（ステップS100-5A）。 The high frequency decoding unit 100eA decodes the high frequency coded portion divided by the coded sequence demultiplexing unit 100a to obtain a high frequency signal (step S100-5A).

高周波数復号部100eAでは、高周波数信号の復号において低周波数復号部で得られた低周波数復号信号を利用する際に、低周波数時間包絡修正部100dで時間包絡形状を修正された低周波数信号を利用する点が、高周波数復号部100eと異なる点である。 In the high frequency decoding unit 100eA, when the low frequency decoding signal obtained by the low frequency decoding unit is used in decoding the high frequency signal, the low frequency signal whose time wrapping shape is corrected by the low frequency time wrapping correction unit 100d is used. It is different from the high frequency decoding unit 100e in that it is used.

［第11の実施形態の音声復号装置の第2の変形例］
図75は、第11の実施形態に係る音声復号装置の第1の変形例100Aの構成を示す図である。 [Second variant of the audio decoding device of the eleventh embodiment]
FIG. 75 is a diagram showing a configuration of a first modification 100A of the audio decoding device according to the eleventh embodiment.

第11の実施形態の音声復号装置の第1の変形例との相違点は、低周波数／高周波数信号合成部100fに入力される低周波数信号が、低周波数時間包絡修正部100dからの出力ではなく、低周波数復号部100bからの出力である点である。 The difference from the first modification of the voice decoding apparatus of the eleventh embodiment is that the low frequency signal input to the low frequency / high frequency signal synthesis unit 100f is output from the low frequency time wrapping correction unit 100d. The point is that it is an output from the low frequency decoding unit 100b.

［第12の実施形態］
図76は、第12の実施形態に係る音声復号装置110の構成を示す図である。音声復号装置110の通信装置は、下記音声符号化装置210から出力される多重化された符号化系列を受信し、更に、復号した音声信号を外部に出力する。音声復号装置110は、図76に示すように、機能的には、符号化系列逆多重化部110a、低周波数復号部100b、高周波数復号部100e、高周波数時間包絡形状決定部110b、高周波数時間包絡修正部110c、及び低周波数/高周波数信号合成部100fを備える。 [12th Embodiment]
FIG. 76 is a diagram showing a configuration of the voice decoding device 110 according to the twelfth embodiment. The communication device of the voice decoding device 110 receives the multiplexed coding sequence output from the following voice coding device 210, and further outputs the decoded voice signal to the outside. As shown in FIG. 76, the voice decoding device 110 functionally includes a coding sequence demultiplexing unit 110a, a low frequency decoding unit 100b, a high frequency decoding unit 100e, a high frequency time envelope shape determining unit 110b, and a high frequency. It is provided with a time envelope correction unit 110c and a low frequency / high frequency signal synthesis unit 100f.

図77は、第12の実施形態に係る音声復号装置の動作を示すフローチャートである。 FIG. 77 is a flowchart showing the operation of the voice decoding device according to the twelfth embodiment.

符号化系列逆多重化部110aは、符号化系列を、低周波数符号化部分、高周波数符号化部分、高周波数時間包絡形状に関する情報に分割する（ステップS110-1）。 The coded sequence demultiplexing section 110a divides the coded sequence into information about the low frequency coded portion, the high frequency coded portion, and the high frequency time envelope shape (step S110-1).

高周波数時間包絡形状決定部110bは、符号化系列逆多重化部110aで分割された高周波時間包絡形状に関する情報、高周波数復号部100eで得られた高周波数信号、及び低周波数復号部100bで得られた低周波数信号のうち少なくとも一つ以上に基づき、高周波数信号の時間包絡形状を決定する（ステップS110-2）。 The high frequency time entrainment shape determination unit 110b obtains information on the high frequency time encapsulation shape divided by the coded sequence demultiplexing unit 110a, the high frequency signal obtained by the high frequency decoding unit 100e, and the low frequency decoding unit 100b. The time wrapping shape of the high frequency signal is determined based on at least one of the obtained low frequency signals (step S110-2).

例えば、高周波数信号の時間包絡形状を平坦と決定するケース、高周波数信号の時間包絡形状を立ち上がりと決定するケース、高周波数信号の時間包絡形状を立ち下がりと決定するケースが挙げられる。 For example, there are cases where the time envelope shape of a high frequency signal is determined to be flat, a case where the time envelope shape of a high frequency signal is determined to be rising, and a case where the time envelope shape of a high frequency signal is determined to be falling.

高周波数信号の時間包絡形状の決定は、例えば、時間包絡形状決定部1cにおける復号信号の時間包絡形状の決定処理において、音声復号部1bで得られる復号信号を、高周波数復号部100eで得られた高周波数信号に置き換えることにより実現できる。また、同様に、音声復号部1bで得られる復号信号を、低周波数復号部100bで得られた低周波数信号に置き換えることにより実現できる。 In the determination of the time envelope shape of the high frequency signal, for example, in the process of determining the time envelope shape of the decoding signal in the time envelope shape determination unit 1c, the decoding signal obtained by the voice decoding unit 1b is obtained by the high frequency decoding unit 100e. It can be realized by replacing it with a high frequency signal. Similarly, it can be realized by replacing the decoding signal obtained by the voice decoding unit 1b with the low frequency signal obtained by the low frequency decoding unit 100b.

高周波数時間包絡修正部110cは、高周波数時間包絡形状決定部110bで決定した時間包絡形状に基づいて、高周波数復号部110eから出力される高周波数信号の時間包絡の形状を修正する（ステップS110-3）。例えば、前記高周波数信号の時間包絡形状が平坦と決定された場合、以下の処理により、高周波数信号の時間包絡形状を修正できる。 The high frequency time envelope correction unit 110c corrects the time envelope shape of the high frequency signal output from the high frequency decoding unit 110e based on the time envelope shape determined by the high frequency time envelope shape determination unit 110b (step S110). -3). For example, when the time envelope shape of the high frequency signal is determined to be flat, the time envelope shape of the high frequency signal can be corrected by the following processing.

高周波数信号の時間包絡形状の修正は、例えば、時間包絡修正部1dにおける復号信号の時間包絡形状の修正処理において、音声復号部1bで得られる復号信号を、高周波数復号部100eで得られた高周波数信号に置き換えることにより実現できる。 To correct the time-envelope shape of the high-frequency signal, for example, in the time-envelope shape correction process of the decoding signal in the time-envelope correction unit 1d, the decoding signal obtained by the voice decoding unit 1b was obtained by the high-frequency decoding unit 100e. This can be achieved by replacing it with a high frequency signal.

図78は、第12の実施形態に係る音声符号化装置210の構成を示す図である。音声符号化装置210の通信装置は、符号化の対象となる音声信号を外部から受信し、更に、符号化された符号化系列を外部に出力する。音声符号化装置210は、図78に示すように、機能的には、低周波数符号化部200a、高周波数符号化部200b、高周波数時間包絡情報符号化部210a、及び符号化系列多重化部210bを備える。 FIG. 78 is a diagram showing a configuration of the voice coding device 210 according to the twelfth embodiment. The communication device of the voice coding device 210 receives the voice signal to be coded from the outside, and further outputs the coded coding sequence to the outside. As shown in FIG. 78, the voice coding device 210 functionally has a low frequency coding unit 200a, a high frequency coding unit 200b, a high frequency time envelope information coding unit 210a, and a coding sequence multiplexing unit. Equipped with 210b.

図79は、第12の実施形態に係る音声符号化装置210の動作を示すフローチャートである。 FIG. 79 is a flowchart showing the operation of the voice coding device 210 according to the twelfth embodiment.

高周波数時間包絡情報符号化部210aは、入力音声信号、低周波数符号化部200aにおける入力音声信号の符号化結果を含む符号化の過程で得られた情報、高周波数符号化部200bにおける入力音声信号の符号化結果を含む符号化の過程で得られた情報のうち少なくとも一つ以上に基づき、高周波数時間包絡形状情報を算出し符号化する（ステップS210-1）。 The high frequency time wrapping information coding unit 210a is the input voice signal, the information obtained in the coding process including the coding result of the input voice signal in the low frequency coding unit 200a, and the input voice in the high frequency coding unit 200b. High frequency time entrapment shape information is calculated and encoded based on at least one or more of the information obtained in the coding process including the signal coding result (step S210-1).

高周波数時間包絡形状情報の算出、符号化処理は、例えば、時間包絡情報符号化部2bにおける入力音声信号の時間包絡情報の算出、符号化処理において、入力音声信号に代えて入力音声信号の高周波数信号を、復号信号に代えて高周波数符号化部200bにおける符号化結果を復号して得られる高周波数復号信号を用いることで、同様にして実現できる。 In the calculation and coding processing of the high frequency time wrapping shape information, for example, in the calculation and coding processing of the time wrapping information of the input voice signal in the time wrapping information coding unit 2b, the height of the input voice signal is replaced with the input voice signal. The same can be achieved by using the high frequency decoding signal obtained by decoding the coding result in the high frequency coding unit 200b instead of the decoding signal for the frequency signal.

符号化系列多重化部210bは、低周波数符号化部200aより低周波数音声信号の符号化系列を受け取り、高周波数符号化部200bより高周波数音声信号の符号化系列を受け取り、高周波数時間包絡情報符号化部210aより符号化された高周波数時間包絡形状情報を受け取り、多重化して符号化系列として出力する（ステップS210-2）。 The coding sequence multiplexing section 210b receives the coding sequence of the low frequency audio signal from the low frequency coding section 200a, receives the coding sequence of the high frequency voice signal from the high frequency coding section 200b, and receives the high frequency time wrapping information. The high-frequency time-enclosed shape information encoded by the coding unit 210a is received, multiplexed, and output as a coded sequence (step S210-2).

［第13の実施形態］
図80は、第13の実施形態に係る音声復号装置120の構成を示す図である。音声復号装置120の通信装置は、下記音声符号化装置220から出力される多重化された符号化系列を受信し、更に、復号した音声信号を外部に出力する。音声復号装置120は、図80に示すように、機能的には、符号化系列逆多重化部120a、低周波数復号部100b、低周波数時間包絡形状決定部100c、低周波数時間包絡修正部100d、高周波数復号部100e、高周波数時間包絡形状決定部120b、高周波数時間包絡修正部110c、及び低周波数/高周波数信号合成部100fを備える。 [13th Embodiment]
FIG. 80 is a diagram showing a configuration of the audio decoding device 120 according to the thirteenth embodiment. The communication device of the voice decoding device 120 receives the multiplexed coding sequence output from the following voice coding device 220, and further outputs the decoded voice signal to the outside. As shown in FIG. 80, the voice decoding device 120 functionally includes a coding sequence demultiplexing unit 120a, a low frequency decoding unit 100b, a low frequency time envelope shape determining unit 100c, and a low frequency time envelope correcting unit 100d. It includes a high frequency decoding unit 100e, a high frequency time envelope shape determination unit 120b, a high frequency time envelope correction unit 110c, and a low frequency / high frequency signal synthesis unit 100f.

図81は、第13の実施形態に係る音声復号装置120の動作を示すフローチャートである。 FIG. 81 is a flowchart showing the operation of the voice decoding device 120 according to the thirteenth embodiment.

符号化系列逆多重化部120aは、符号化系列を、低周波数符号化部分、高周波数符号化部分、低周波数時間包絡形状に関する情報、高周波数時間包絡形状に関する情報に分割する（ステップS120-1）。 The coded sequence demultiplexing unit 120a divides the coded sequence into a low frequency coded portion, a high frequency coded portion, information on a low frequency time envelope shape, and information on a high frequency time envelope shape (step S120-1). ).

この際、低周波数時間包絡形状に関する情報、及び高周波数時間包絡形状に関する情報の分割に関して、例えば、別々に符号化された低周波数時間包絡形状に関する情報、及び高周波数時間包絡形状に関する情報を含む符号化系列から分割することもでき、また組み合わせて符号化された周波数時間包絡形状に関する情報、及び高周波数時間包絡形状に関する情報を含む符号化系列から分割することもできる。さらには、例えば、当該低周波数時間包絡形状に関する情報、及び当該高周波数時間包絡形状に関する情報が単一の情報により表され符号化された当該情報を含む符号化系列から分割することもできる。 At this time, regarding the division of the information regarding the low frequency time inclusion shape and the information regarding the high frequency time inclusion shape, for example, a code including information regarding the separately encoded low frequency time inclusion shape and information regarding the high frequency time inclusion shape. It can be split from the compound sequence, or it can be split from a coded sequence that contains information about the frequency time entrapment shape encoded in combination and information about the high frequency time entrapment shape. Further, for example, the information about the low frequency time envelope shape and the information about the high frequency time envelope shape can be divided from a coding series including the coded information represented by a single piece of information.

高周波数時間包絡形状決定部120bは、符号化系列逆多重化部120aで分割された高周波時間包絡形状に関する情報、低周波数復号部100bで得られた低周波数信号、及び低周波数時間包絡修正部100dで時間包絡形状を修正された低周波数信号のうち少なくとも一つ以上に基づき、高周波数信号の時間包絡形状を決定する（ステップS120-2）。 The high frequency time wrapping shape determination unit 120b contains information on the high frequency time wrapping shape divided by the coded sequence demultiplexing unit 120a, the low frequency signal obtained by the low frequency decoding unit 100b, and the low frequency time wrapping correction unit 100d. The time-wrapping shape of the high-frequency signal is determined based on at least one of the low-frequency signals whose time-wrapping shape has been corrected in (Step S120-2).

高周波数時間包絡形状決定部120bにおける高周波数時間包絡形状の決定処理において、低周波数時間包絡修正部100dで時間包絡形状を修正された低周波数信号に基づく場合は、時間包絡形状決定部1cにおける復号信号の時間包絡形状の決定処理において、音声復号部1bで得られる復号信号を、低周波数時間包絡修正部100dで時間包絡形状を修正された低周波数信号に置き換えることにより実現できる。 In the high frequency time entrainment shape determination process in the high frequency time entrainment shape determination unit 120b, if it is based on the low frequency signal whose time entrapment shape has been corrected by the low frequency time entrapment correction unit 100d, decoding in the time entrapment shape determination unit 1c. In the process of determining the time wrapping shape of the signal, it can be realized by replacing the decoding signal obtained by the voice decoding unit 1b with the low frequency signal whose time wrapping shape is corrected by the low frequency time wrapping correction unit 100d.

図82は、第13の実施形態に係る音声符号化装置220の構成を示す図である。音声符号化装置220の通信装置は、符号化の対象となる音声信号を外部から受信し、更に、符号化された符号化系列を外部に出力する。音声符号化装置220は、図82に示すように、機能的には、低周波数符号化部200a、高周波数符号化部200b、低周波数時間包絡情報符号化部200c、高周波数時間包絡情報符号化部220a、及び符号化系列多重化部220bを備える。 FIG. 82 is a diagram showing a configuration of the voice coding device 220 according to the thirteenth embodiment. The communication device of the voice coding device 220 receives the voice signal to be coded from the outside, and further outputs the coded coding sequence to the outside. As shown in FIG. 82, the voice coding device 220 functionally has a low frequency coding unit 200a, a high frequency coding unit 200b, a low frequency time wrapping information coding unit 200c, and a high frequency time wrapping information coding unit. A unit 220a and a coded sequence multiplexing unit 220b are provided.

図83は、第13の実施形態に係る音声符号化装置220の動作を示すフローチャートである。 FIG. 83 is a flowchart showing the operation of the voice coding device 220 according to the thirteenth embodiment.

高周波数時間包絡情報符号化部220aは、入力音声信号、低周波数符号化部200aにおける入力音声信号の符号化結果を含む符号化の過程で得られた情報、高周波数符号化部200bにおける入力音声信号の符号化結果を含む符号化の過程で得られた情報、低周波数時間包絡情報符号化部200cにおける低周波数時間包絡情報の符号化結果を含む符号化の過程で得られた情報のうち少なくとも一つ以上に基づき、高周波数時間包絡形状情報を算出し符号化する（ステップS220-1）。 The high frequency time wrapping information coding unit 220a is the input voice signal, the information obtained in the coding process including the coding result of the input voice signal in the low frequency coding unit 200a, and the input voice in the high frequency coding unit 200b. At least of the information obtained in the coding process including the signal coding result and the information obtained in the coding process including the coding result of the low frequency time wrapping information in the low frequency time wrapping information coding unit 200c. High frequency time entrapment shape information is calculated and encoded based on one or more (step S220-1).

高周波数時間包絡形状情報の算出、符号化処理は、例えば、高周波数時間包絡情報符号化部210aにおける高周波数信号の時間包絡情報の算出、符号化処理と同様にして実現できる。更には、例えば、低周波数時間包絡情報の符号化結果に基づいてもよい。例えば、低周波数時間包絡情報の符号化結果として低周波数時間包絡が平坦であるという結果が得られた場合にのみ、高周波数時間包絡情報として高周波数時間包絡が平坦であるか否かを符号化することができる。 The calculation and coding process of the high frequency time envelope shape information can be realized in the same manner as the calculation and coding process of the time envelope information of the high frequency signal in the high frequency time envelope information coding unit 210a, for example. Further, for example, it may be based on the coding result of the low frequency time envelope information. For example, it is encoded whether or not the high frequency time envelope is flat as the high frequency time envelope information only when the result that the low frequency time envelope is flat is obtained as the result of encoding the low frequency time envelope information. can do.

符号化系列多重化部220bは、低周波数符号化部200aより低周波数音声信号の符号化系列を受け取り、高周波数符号化部200bより高周波数音声信号の符号化系列を受け取り、低周波数時間包絡情報符号化部200cより符号化された低周波数時間包絡形状情報を受け取り、高周波数時間包絡情報符号化部210aより符号化された高周波数時間包絡形状情報を受け取り、多重化して符号化系列として出力する（ステップS220-2）。 The coding sequence multiplexing section 220b receives the coding sequence of the low frequency audio signal from the low frequency coding section 200a, receives the coding sequence of the high frequency voice signal from the high frequency coding section 200b, and receives the low frequency time wrapping information. Receives the low-frequency time-enclosed shape information encoded from the coding unit 200c, receives the high-frequency time-enclosed shape information encoded from the high-frequency time-enclosed information coding unit 210a, multiplexes it, and outputs it as a coded sequence. (Step S220-2).

この際、低周波数時間包絡形状に関する情報、及び高周波数時間包絡形状に関する情報の符号化に関して、例えば、別々に符号化された低周波数時間包絡形状に関する情報、及び高周波数時間包絡形状に関する情報を受け取ることもでき、また組み合わせて符号化された周波数時間包絡形状に関する情報、及び高周波数時間包絡形状に関する情報を受け取ることもできる。さらには、例えば、単一の情報により表され符号化された当該低周波数時間包絡形状に関する情報、及び当該高周波数時間包絡形状に関する情報を受け取ることもできる。 At this time, regarding the coding of the information regarding the low frequency time inclusion shape and the information regarding the high frequency time inclusion shape, for example, the information regarding the separately encoded low frequency time inclusion shape and the information regarding the high frequency time inclusion shape are received. It is also possible to receive information about the frequency time entrainment shape encoded in combination and information about the high frequency time entrainment shape. Further, for example, it is possible to receive information on the low frequency time envelope shape represented and encoded by a single piece of information, and information on the high frequency time envelope shape.

［第13の実施形態の音声復号装置の第1の変形例］
図84は、第13の実施形態に係る音声復号装置の第1の変形例120Aの構成を示す図である。第13の実施形態の音声復号装置120との相違点は、高周波数復号部100eAにて、高周波数信号の復号に低周波数時間包絡修正部100dで時間包絡形状を修正された低周波数信号を利用する点である。 [First variant of the audio decoding device of the thirteenth embodiment]
FIG. 84 is a diagram showing a configuration of a first modification 120A of the audio decoding device according to the thirteenth embodiment. The difference from the voice decoding device 120 of the thirteenth embodiment is that the high frequency decoding unit 100eA uses the low frequency signal whose time envelope shape is corrected by the low frequency time envelope correction unit 100d for decoding the high frequency signal. It is a point to do.

図85は、第13の実施形態に係る音声復号装置の第1の変形例120Aの動作を示すフローチャートである。図85のステップ100-5Aでは、高周波数信号の復号において低周波数復号部100bで得られた低周波数復号信号を利用する際に、低周波数時間包絡修正部100dで時間包絡形状を修正された低周波数信号を利用する。 FIG. 85 is a flowchart showing the operation of the first modification 120A of the audio decoding device according to the thirteenth embodiment. In step 100-5A of FIG. 85, when the low frequency decoding signal obtained by the low frequency decoding unit 100b is used in decoding the high frequency signal, the low frequency time wrapping shape is corrected by the low frequency time wrapping correction unit 100d. Use frequency signals.

［第13の実施形態の音声復号装置の第2の変形例］
図86は、第13の実施形態に係る音声復号装置の第2の変形例120Bの構成を示す図である。第13の実施形態の音声復号装置の第1の変形例との相違点は、低周波数／高周波数信号合成部100fに入力される低周波数信号が、低周波数時間包絡修正部100dからの出力ではなく、低周波数復号部100bからの出力である点である。 [Second variant of the audio decoding device of the thirteenth embodiment]
FIG. 86 is a diagram showing a configuration of a second modification 120B of the audio decoding device according to the thirteenth embodiment. The difference from the first modification of the voice decoding apparatus of the thirteenth embodiment is that the low frequency signal input to the low frequency / high frequency signal synthesis unit 100f is output from the low frequency time entrainment correction unit 100d. The point is that it is an output from the low frequency decoding unit 100b.

図87は、第13の実施形態に係る音声復号装置の第2の変形例120Bの動作を示すフローチャートである。図87のステップS100-6では、低周波数復号部100bからの低周波数信号と高周波数時間包絡修正部110cからの高周波数信号とが合成される。 FIG. 87 is a flowchart showing the operation of the second modification 120B of the audio decoding device according to the thirteenth embodiment. In step S100-6 of FIG. 87, the low frequency signal from the low frequency decoding unit 100b and the high frequency signal from the high frequency time envelope correction unit 110c are combined.

［第13の実施形態の音声復号装置の第3の変形例］
図185は、第13の実施形態に係る音声復号装置の第3の変形例120Cの構成を示す図である。 [Third variant of the audio decoding device of the thirteenth embodiment]
FIG. 185 is a diagram showing a configuration of a third modification 120C of the audio decoding device according to the thirteenth embodiment.

図186は、第13の実施形態に係る音声復号装置の第3の変形例120Cの動作を示すフローチャートである。 FIG. 186 is a flowchart showing the operation of the third modification 120C of the audio decoding device according to the thirteenth embodiment.

本変形例と前記第13の実施形態に係る音声復号装置120との相違点は、低周波数時間包絡形状決定部100c、高周波数時間包絡修正部110cにかえて、低周波数時間包絡形状決定部120c、高周波数時間包絡修正部120dを具備する点である。 The difference between this modification and the voice decoding device 120 according to the thirteenth embodiment is that the low frequency time envelope shape determination unit 120c is replaced with the low frequency time envelope shape determination unit 100c and the high frequency time envelope correction unit 110c. The point is that the high frequency time envelope correction unit 120d is provided.

本変形例においては、低周波数時間包絡形状決定部120cと前記低周波数時間包絡形状決定部100cとの相違点は、決定した時間包絡形状を高周波数時間包絡修正部120dへも通知する点である。 In this modification, the difference between the low frequency time envelope shape determining unit 120c and the low frequency time envelope shape determining unit 100c is that the determined time envelope shape is also notified to the high frequency time envelope correction unit 120d. ..

高周波数時間包絡修正部120dと前記高周波数時間包絡修正部110cとの相違点は、高周波数時間包絡形状決定部120bにて決定された時間包絡形状と低周波数時間包絡形状決定部120cで決定された時間包絡形状のうち少なくとも一つ以上に基づいて、高周波数復号部100eから出力される高周波数信号の時間包絡の形状を修正する点である(S120-3)。 The difference between the high frequency time envelope correction unit 120d and the high frequency time envelope correction unit 110c is determined by the time envelope shape determined by the high frequency time envelope shape determination unit 120b and the low frequency time envelope shape determination unit 120c. The point is to modify the shape of the time envelope of the high frequency signal output from the high frequency decoding unit 100e based on at least one of the time envelope shapes (S120-3).

例えば、低周波数時間包絡形状決定部120cにて時間包絡形状が平坦であると決定された場合には、高周波数時間包絡形状決定部120bにて決定される時間包絡形状によらず、高周波数復号部100eから出力される高周波数信号の時間包絡の形状を平坦に修正する。更に例えば、低周波数時間包絡形状決定部120cにて時間包絡形状が平坦でないと決定された場合には、高周波数時間包絡形状決定部120bにて決定される時間包絡形状によらず、高周波数復号部100eから出力される高周波数信号の時間包絡の形状を平坦に修正しない。立ち上がり、立ち下がりの場合も同様であり、時間包絡形状は限定されない。 For example, when the low frequency time envelope shape determining unit 120c determines that the time envelope shape is flat, the high frequency decoding is performed regardless of the time envelope shape determined by the high frequency time envelope shape determining unit 120b. The shape of the time envelope of the high frequency signal output from the unit 100e is corrected to be flat. Further, for example, when the low frequency time envelope shape determining unit 120c determines that the time envelope shape is not flat, the high frequency decoding is performed regardless of the time envelope shape determined by the high frequency time envelope shape determining unit 120b. The shape of the time envelope of the high frequency signal output from the unit 100e is not corrected flatly. The same applies to the case of rising and falling, and the time envelope shape is not limited.

［第13の実施形態の音声復号装置の第4の変形例］
図187は、第13の実施形態に係る音声復号装置の第4の変形例120Dの構成を示す図である。 [Fourth variant of the audio decoding device of the thirteenth embodiment]
FIG. 187 is a diagram showing a configuration of a fourth modification 120D of the audio decoding device according to the thirteenth embodiment.

図188は、第13の実施形態に係る音声復号装置の第4の変形例120Dの動作を示すフローチャートである。 FIG. 188 is a flowchart showing the operation of the fourth modification 120D of the audio decoding device according to the thirteenth embodiment.

本変形例と前記第13の実施形態に係る音声復号装置120との相違点は、高周波数時間包絡形状決定部120b、低周波数時間包絡修正部100dにかえて、高周波数時間包絡形状決定部120bA、低周波数時間包絡修正部120eを具備する点である。 The difference between this modification and the voice decoding device 120 according to the thirteenth embodiment is that the high frequency time envelope shape determination unit 120bA is replaced with the high frequency time envelope shape determination unit 120b and the low frequency time envelope correction unit 100d. The point is that the low frequency time envelope correction unit 120e is provided.

本変形例においては、高周波数時間包絡形状決定部120bAと前記高周波数時間包絡形状決定部120bとの相違点は、決定した時間包絡形状を低周波数時間包絡修正部120eへも通知する点である。 In this modification, the difference between the high frequency time envelope shape determination unit 120bA and the high frequency time envelope shape determination unit 120b is that the determined time envelope shape is also notified to the low frequency time envelope correction unit 120e. ..

高周波数時間包絡形状決定部120bAにおける時間包絡形状の決定は、前記の例に加えて、例えば、前記低周波数信号の周波数パワー分布に基づくこともできる。更には、例えば符号化系列逆多重化部120aから得られる高周波数信号の復号の際のフレーム長を用いることができる。例えば、フレーム長が長い場合は平坦である、フレーム長が短い場合は立ち上がりまたは立ち下がりであると決定でき、前記高周波数時間包絡形状決定部120bでも同様に決定できる。 The determination of the time envelope shape in the high frequency time envelope shape determination unit 120bA can be based on, for example, the frequency power distribution of the low frequency signal, in addition to the above example. Further, for example, the frame length at the time of decoding the high frequency signal obtained from the coded sequence demultiplexing unit 120a can be used. For example, if the frame length is long, it can be determined to be flat, if the frame length is short, it can be determined to be rising or falling, and the same can be determined by the high frequency time envelope shape determining unit 120b.

低周波数時間包絡修正部120eと前記低周波数時間包絡修正部100dとの相違点は、低周波数時間包絡形状決定部100cにて決定された時間包絡形状と高周波数時間包絡形状決定部120bAで決定された時間包絡形状のうち少なくとも一つ以上に基づいて、低周波数復号部100bから出力される低周波数信号の時間包絡の形状を修正する点である(S120-4)。 The difference between the low frequency time envelope correction unit 120e and the low frequency time envelope correction unit 100d is determined by the time envelope shape determined by the low frequency time envelope shape determination unit 100c and the high frequency time envelope shape determination unit 120bA. The point is to modify the shape of the time envelope of the low frequency signal output from the low frequency decoding unit 100b based on at least one of the time envelope shapes (S120-4).

例えば、高周波数時間包絡形状決定部120bAにて時間包絡形状が平坦であると決定された場合には、低周波数時間包絡形状決定部100cにて決定される時間包絡形状によらず、低周波数復号部100bから出力される低周波数信号の時間包絡の形状を平坦に修正する。更に例えば、高周波数時間包絡形状決定部120bAにて時間包絡形状が平坦であると決定された場合には、低周波数時間包絡形状決定部100cにて決定される時間包絡形状によらず、低周波数復号部100bから出力される低周波数信号の時間包絡の形状を平坦に修正しない。立ち上がり、立ち下がりの場合も同様であり、時間包絡形状は限定されない。 For example, when the high frequency time envelope shape determination unit 120bA determines that the time envelope shape is flat, the low frequency decoding is performed regardless of the time envelope shape determined by the low frequency time envelope shape determination unit 100c. The shape of the time envelope of the low frequency signal output from part 100b is corrected flat. Further, for example, when the high frequency time envelope shape determining unit 120bA determines that the time envelope shape is flat, the low frequency does not depend on the time envelope shape determined by the low frequency time envelope shape determining unit 100c. The shape of the time envelope of the low frequency signal output from the decoding unit 100b is not corrected flatly. The same applies to the case of rising and falling, and the time envelope shape is not limited.

［第13の実施形態の音声復号装置の第5の変形例］
図189は、第13の実施形態に係る音声復号装置の第5の変形例120Eの構成を示す図である。 [Fifth variant of the audio decoding device of the thirteenth embodiment]
FIG. 189 is a diagram showing a configuration of a fifth modification 120E of the audio decoding device according to the thirteenth embodiment.

図190は、第13の実施形態に係る音声復号装置の第5の変形例120Eの動作を示すフローチャートである。 FIG. 190 is a flowchart showing the operation of the fifth modification 120E of the audio decoding device according to the thirteenth embodiment.

本変形例においては、前記低周波数時間包絡形状決定部120c、前記高周波数時間包絡修正部120d、前記高周波数時間包絡形状決定部120bA、及び前記低周波数時間包絡修正部120eを具備する。 In this modification, the low frequency time envelope shape determining unit 120c, the high frequency time envelope correction unit 120d, the high frequency time envelope shape determining unit 120bA, and the low frequency time envelope correction unit 120e are provided.

［第13の実施形態の音声復号装置の第6の変形例］
図191は、第13の実施形態に係る音声復号装置の第6の変形例120Fの構成を示す図である。 [Sixth variant of the audio decoding device of the thirteenth embodiment]
FIG. 191 is a diagram showing a configuration of a sixth modification 120F of the audio decoding device according to the thirteenth embodiment.

図192は、第13の実施形態に係る音声復号装置の第6の変形例120Fの動作を示すフローチャートである。 FIG. 192 is a flowchart showing the operation of the sixth modification 120F of the audio decoding device according to the thirteenth embodiment.

本変形例と前記第13の実施形態に係る音声復号装置120との相違点は、低周波数時間包絡形状決定部100c及び高周波数時間包絡形状決定部120bにかえて時間包絡形状決定部120fを具備する点である。 The difference between this modification and the voice decoding device 120 according to the thirteenth embodiment is that the time envelope shape determination unit 120f is provided instead of the low frequency time envelope shape determination unit 100c and the high frequency time envelope shape determination unit 120b. It is a point to do.

時間包絡形状決定部120fは、符号化系列逆多重化部120aからの低周波数時間包絡形状に関する情報、高周波数時間包絡形状に関する情報、低周波数復号部100bからの低周波数信号、高周波数復号部100eからの高周波数信号のうち少なくとも一つ以上に基づいて時間包絡形状を決定する(S120-5)。決定した時間包絡形状は、低周波数時間包絡修正部100d、高周波数時間包絡修正部110cに通知される。 The time entrapment shape determination unit 120f is provided with information on the low frequency time entrapment shape from the coded sequence demultiplexing unit 120a, information on the high frequency time encapsulation shape, a low frequency signal from the low frequency decoding unit 100b, and a high frequency decoding unit 100e. Determine the time entrapment shape based on at least one of the high frequency signals from (S120-5). The determined time envelope shape is notified to the low frequency time envelope correction unit 100d and the high frequency time envelope correction unit 110c.

時間包絡形状決定部120fでは、例えば、前記低周波数時間包絡形状決定部100c、及び120c、前記高周波数時間包絡形状決定部120b、及び120bAと同様に時間包絡形状を決定できる。時間包絡形状の決定方法は上記の例に限定されない。 The time envelope shape determining unit 120f can determine the time envelope shape in the same manner as, for example, the low frequency time envelope shape determining units 100c and 120c, the high frequency time envelope shape determining unit 120b, and 120bA. The method for determining the time envelope shape is not limited to the above example.

［第13の実施形態の音声復号装置の第7の変形例］
図193は、第13の実施形態に係る音声復号装置の第7の変形例120Gの構成を示す図である。 [7th variant of the audio decoding device of the 13th embodiment]
FIG. 193 is a diagram showing a configuration of a seventh modification 120G of the audio decoding device according to the thirteenth embodiment.

図194は、第13の実施形態に係る音声復号装置の第7の変形例120Gの動作を示すフローチャートである。 FIG. 194 is a flowchart showing the operation of the seventh modification 120G of the voice decoding device according to the thirteenth embodiment.

本変形例と前記第13の実施形態に係る音声復号装置の第1の変形例120Aとの相違点は、低周波数時間包絡形状決定部100c、高周波数時間包絡修正部110cにかえて、低周波数時間包絡形状決定部120c、高周波数時間包絡修正部120dを具備する点である。 The difference between this modification and the first modification 120A of the voice decoding apparatus according to the thirteenth embodiment is that the low frequency is replaced with the low frequency time envelope shape determination unit 100c and the high frequency time envelope correction unit 110c. The point is that it includes a time envelope shape determining unit 120c and a high frequency time envelope correction unit 120d.

［第13の実施形態の音声復号装置の第8の変形例］
図195は、第13の実施形態に係る音声復号装置の第8の変形例120Hの構成を示す図である。 [Eighth variant of the audio decoding device of the thirteenth embodiment]
FIG. 195 is a diagram showing a configuration of an eighth modification 120H of the audio decoding device according to the thirteenth embodiment.

図196は、第13の実施形態に係る音声復号装置の第8の変形例120Hの動作を示すフローチャートである。 FIG. 196 is a flowchart showing the operation of the eighth modification 120H of the audio decoding device according to the thirteenth embodiment.

本変形例と前記第13の実施形態に係る音声復号装置の第1の変形例120Aとの相違点は、高周波数時間包絡形状決定部120b、低周波数時間包絡修正部100dにかえて、高周波数時間包絡形状決定部120bA、低周波数時間包絡修正部120eを具備する点である。 The difference between this modification and the first modification 120A of the voice decoding apparatus according to the thirteenth embodiment is that the high frequency is replaced with the high frequency time envelope shape determination unit 120b and the low frequency time envelope correction unit 100d. It is provided with a time envelope shape determining unit 120bA and a low frequency time envelope correction unit 120e.

［第13の実施形態の音声復号装置の第9の変形例］
図197は、第13の実施形態に係る音声復号装置の第9の変形例120Iの構成を示す図である。 [Ninth variant of the audio decoding device of the thirteenth embodiment]
FIG. 197 is a diagram showing a configuration of a ninth modification 120I of the audio decoding device according to the thirteenth embodiment.

図198は、第13の実施形態に係る音声復号装置の第9の変形例120Iの動作を示すフローチャートである。 FIG. 198 is a flowchart showing the operation of the ninth modification 120I of the audio decoding device according to the thirteenth embodiment.

［第13の実施形態の音声復号装置の第10の変形例］
図199は、第13の実施形態に係る音声復号装置の第10の変形例120Jの構成を示す図である。 [10th variant of the audio decoding device of the 13th embodiment]
FIG. 199 is a diagram showing a configuration of a tenth modification 120J of the audio decoding device according to the thirteenth embodiment.

図200は、第13の実施形態に係る音声復号装置の第10の変形例120Jの動作を示すフローチャートである。 FIG. 200 is a flowchart showing the operation of the tenth modification 120J of the audio decoding device according to the thirteenth embodiment.

本変形例と前記第13の実施形態に係る音声復号装置の第1の変形例120Aとの相違点は、低周波数時間包絡形状決定部100c及び高周波数時間包絡形状決定部120bにかえて時間包絡形状決定部120fを具備する点である。 The difference between this modification and the first modification 120A of the audio decoding device according to the thirteenth embodiment is that the time envelopment is replaced with the low frequency time envelope shape determination unit 100c and the high frequency time envelope shape determination unit 120b. The point is that the shape determining portion 120f is provided.

［第13の実施形態の音声復号装置の第11の変形例］
図201は、第13の実施形態に係る音声復号装置の第11の変形例120Kの構成を示す図である。 [11th modification of the audio decoding device of the 13th embodiment]
FIG. 201 is a diagram showing a configuration of an eleventh modification 120K of the audio decoding device according to the thirteenth embodiment.

図202は、第13の実施形態に係る音声復号装置の第11の変形例120Kの動作を示すフローチャートである。 FIG. 202 is a flowchart showing the operation of the eleventh modification 120K of the audio decoding device according to the thirteenth embodiment.

本変形例と前記第13の実施形態に係る音声復号装置の第2の変形例120Bとの相違点は、低周波数時間包絡形状決定部100c、高周波数時間包絡修正部110cにかえて、低周波数時間包絡形状決定部120c、高周波数時間包絡修正部120dを具備する点である。 The difference between this modification and the second modification 120B of the voice decoding apparatus according to the thirteenth embodiment is that the low frequency is replaced with the low frequency time envelope shape determination unit 100c and the high frequency time envelope correction unit 110c. The point is that it includes a time envelope shape determining unit 120c and a high frequency time envelope correction unit 120d.

［第13の実施形態の音声復号装置の第12の変形例］
図203は、第13の実施形態に係る音声復号装置の第12の変形例120Lの構成を示す図である。 [12th variant of the audio decoding device of the 13th embodiment]
FIG. 203 is a diagram showing a configuration of a twelfth modification 120L of the audio decoding device according to the thirteenth embodiment.

図204は、第13の実施形態に係る音声復号装置の第12の変形例120Lの動作を示すフローチャートである。 FIG. 204 is a flowchart showing the operation of the twelfth modification 120L of the audio decoding device according to the thirteenth embodiment.

本変形例と前記第13の実施形態に係る音声復号装置の第2の変形例120Bとの相違点は、高周波数時間包絡形状決定部120b、低周波数時間包絡修正部100dにかえて、高周波数時間包絡形状決定部120bA、低周波数時間包絡修正部120eを具備する点である。 The difference between this modification and the second modification 120B of the voice decoding device according to the thirteenth embodiment is that the high frequency is replaced with the high frequency time envelope shape determination unit 120b and the low frequency time envelope correction unit 100d. It is provided with a time envelope shape determining unit 120bA and a low frequency time envelope correction unit 120e.

［第13の実施形態の音声復号装置の第13の変形例］
図205は、第13の実施形態に係る音声復号装置の第13の変形例120Mの構成を示す図である。 [13th modification of the audio decoding device of the 13th embodiment]
FIG. 205 is a diagram showing a configuration of a thirteenth modification 120M of the voice decoding device according to the thirteenth embodiment.

図206は、第13の実施形態に係る音声復号装置の第13の変形例120Mの動作を示すフローチャートである。 FIG. 206 is a flowchart showing the operation of the thirteenth modification 120M of the voice decoding device according to the thirteenth embodiment.

［第13の実施形態の音声復号装置の第14の変形例］
図207は、第13の実施形態に係る音声復号装置の第14の変形例120Nの構成を示す図である。 [14th variant of the audio decoding device of the 13th embodiment]
FIG. 207 is a diagram showing a configuration of a 14th modification 120N of the audio decoding device according to the 13th embodiment.

図208は、第13の実施形態に係る音声復号装置の第14の変形例120Nの動作を示すフローチャートである。 FIG. 208 is a flowchart showing the operation of the 14th modification 120N of the audio decoding device according to the 13th embodiment.

本変形例と前記第13の実施形態に係る音声復号装置の第2の変形例120Bとの相違点は、低周波数時間包絡形状決定部100c及び高周波数時間包絡形状決定部120bにかえて時間包絡形状決定部120fを具備する点である。 The difference between this modification and the second modification 120B of the audio decoding device according to the thirteenth embodiment is that the time envelopment is replaced with the low frequency time envelope shape determination unit 100c and the high frequency time envelope shape determination unit 120b. The point is that the shape determining portion 120f is provided.

［第14の実施形態］
図88は、第14の実施形態に係る音声復号装置130の構成を示す図である。音声復号装置130の通信装置は、下記音声符号化装置230から出力される多重化された符号化系列を受信し、更に、復号した音声信号を外部に出力する。音声復号装置130は、図88に示すように、機能的には、符号化系列逆多重化部110a、低周波数復号部100b、高周波数時間包絡形状決定部110b、高周波数時間包絡修正部130a、高周波数復号部130b、及び低周波数/高周波数信号合成部100fを備える。 [14th Embodiment]
FIG. 88 is a diagram showing a configuration of the audio decoding device 130 according to the 14th embodiment. The communication device of the voice decoding device 130 receives the multiplexed coding sequence output from the following voice coding device 230, and further outputs the decoded voice signal to the outside. As shown in FIG. 88, the voice decoding apparatus 130 functionally includes a coding sequence demultiplexing unit 110a, a low frequency decoding unit 100b, a high frequency time envelope shape determining unit 110b, and a high frequency time envelope correcting unit 130a. It includes a high frequency decoding unit 130b and a low frequency / high frequency signal synthesis unit 100f.

図89は、第13の実施形態に係る音声復号装置の動作を示すフローチャートである。 FIG. 89 is a flowchart showing the operation of the voice decoding device according to the thirteenth embodiment.

高周波数時間包絡修正部130aは、高周波数時間包絡形状決定部110bで決定した時間包絡形状に基づいて、高周波数復号部130bに入力される低周波数信号の時間包絡の形状を修正する（ステップS130-1）。高周波数時間包絡修正部130aにおける時間包絡形状の修正は、例えば、時間包絡修正部1dにおける復号信号の時間包絡形状の修正処理において、音声復号部1bで得られる復号信号を、低周波数復号部100bで得られた低周波数信号に置き換えることにより実現できる。 The high frequency time envelope correction unit 130a corrects the shape of the time envelope of the low frequency signal input to the high frequency decoding unit 130b based on the time envelope shape determined by the high frequency time envelope shape determination unit 110b (step S130). -1). In the correction of the time envelope shape in the high frequency time envelope correction unit 130a, for example, in the time envelope shape correction process of the decoding signal in the time envelope correction unit 1d, the decoding signal obtained by the voice decoding unit 1b is obtained by the low frequency decoding unit 100b. It can be realized by replacing it with the low frequency signal obtained in.

高周波数復号部130bは、符号化系列逆多重化部100aにて分割された高周波数符号化部分を復号し、高周波数信号を得る（ステップS130-2）。 The high frequency decoding unit 130b decodes the high frequency coded portion divided by the coded sequence demultiplexing unit 100a to obtain a high frequency signal (step S130-2).

高周波数復号部130bでは、高周波数信号の復号において低周波数復号部で得られた低周波数復号信号を利用する際に、高周波数時間包絡修正部130aで時間包絡形状を修正された低周波数信号を利用する点が高周波数復号部100eと異なる点である。 In the high frequency decoding unit 130b, when the low frequency decoding signal obtained by the low frequency decoding unit is used in decoding the high frequency signal, the high frequency time wrapping correction unit 130a corrects the time wrapping shape of the low frequency signal. The point of use is different from the high frequency decoding unit 100e.

図90は、第14の実施形態に係る音声符号化装置230の構成を示す図である。音声符号化装置230の通信装置は、符号化の対象となる音声信号を外部から受信し、更に、符号化された符号化系列を外部に出力する。音声符号化装置230は、図90に示すように、機能的には、低周波数符号化部200a、高周波数符号化部200b、高周波数時間包絡情報符号化部230a、及び符号化系列多重化部210bを備える。 FIG. 90 is a diagram showing the configuration of the voice coding device 230 according to the 14th embodiment. The communication device of the voice coding device 230 receives the voice signal to be coded from the outside, and further outputs the coded coding sequence to the outside. As shown in FIG. 90, the voice coding device 230 functionally includes a low frequency coding unit 200a, a high frequency coding unit 200b, a high frequency time envelope information coding unit 230a, and a coding sequence multiplexing unit. Equipped with 210b.

図91は、第14の実施形態に係る音声符号化装置230の動作を示すフローチャートである。 FIG. 91 is a flowchart showing the operation of the voice coding device 230 according to the fourteenth embodiment.

高周波数時間包絡情報符号化部230aは、入力音声信号、低周波数符号化部200aにおける入力音声信号の符号化結果を含む符号化の過程で得られた情報、高周波数符号化部200bにおける入力音声信号の符号化結果を含む符号化の過程で得られた情報のうち少なくとも一つ以上に基づき、高周波数時間包絡形状情報を算出し符号化する（ステップS230-1）。 The high frequency time wrapping information coding unit 230a is the input voice signal, the information obtained in the coding process including the coding result of the input voice signal in the low frequency coding unit 200a, and the input voice in the high frequency coding unit 200b. High frequency time entrapment shape information is calculated and encoded based on at least one or more of the information obtained in the coding process including the signal coding result (step S230-1).

高周波数時間包絡形状情報の算出、符号化処理は、例えば、低周波数時間包絡情報符号化部200cにおける低周波数信号の時間包絡情報の算出、符号化処理と同様にして実現できる。ただし、高周波数時間包絡形状情報の算出、符号化処理は、高周波数符号化部200bにおける入力音声信号の符号化結果を含む符号化の過程で得られた情報をも用いることができる点で、入力音声信号の低周波数復号信号を用いる低周波数信号の時間包絡情報の算出、符号化処理とは異なる。 The calculation and coding process of the high frequency time envelope shape information can be realized in the same manner as the calculation and coding process of the time envelope information of the low frequency signal in the low frequency time envelope information coding unit 200c, for example. However, in the calculation and coding process of the high frequency time wrapping shape information, the information obtained in the coding process including the coding result of the input voice signal in the high frequency coding unit 200b can also be used. It is different from the calculation and coding processing of the time wrapping information of the low frequency signal using the low frequency decoding signal of the input voice signal.

［第15の実施形態］
図92は、第15の実施形態に係る音声復号装置140の構成を示す図である。音声復号装置140の通信装置は、下記音声符号化装置240から出力される多重化された符号化系列を受信し、更に、復号した音声信号を外部に出力する。音声復号装置140は、図92に示すように、機能的には、符号化系列逆多重化部120a、低周波数復号部100b、低周波数時間包絡形状決定部100c、低周波数時間包絡修正部100d、高周波数時間包絡形状決定部120b、高周波数時間包絡修正部130a、高周波数復号部130b、及び低周波数/高周波数信号合成部100fを備える。 [15th Embodiment]
FIG. 92 is a diagram showing a configuration of the audio decoding device 140 according to the fifteenth embodiment. The communication device of the voice decoding device 140 receives the multiplexed coding sequence output from the following voice coding device 240, and further outputs the decoded voice signal to the outside. As shown in FIG. 92, the voice decoding apparatus 140 functionally includes a coding sequence demultiplexing unit 120a, a low frequency decoding unit 100b, a low frequency time envelope shape determining unit 100c, and a low frequency time envelope correction unit 100d. It includes a high frequency time envelope shape determination unit 120b, a high frequency time envelope correction unit 130a, a high frequency decoding unit 130b, and a low frequency / high frequency signal synthesis unit 100f.

図93は、第15の実施形態に係る音声復号装置の動作を示すフローチャートである。符号化系列逆多重化部120a及び高周波数時間包絡形状決定部120bは、第13の実施形態における符号化系列逆多重化部120a及び高周波数時間包絡形状決定部120bと同様の動作を行う（ステップS120-1、S120-2）。高周波数時間包絡修正部130a及び高周波数復号部130bは、第14の実施形態における高周波数時間包絡修正部130a及び高周波数復号部130bと同様の動作を行う（ステップS130-1、S130-2）。 FIG. 93 is a flowchart showing the operation of the voice decoding device according to the fifteenth embodiment. The coded sequence demultiplexing section 120a and the high frequency time envelope shape determining section 120b perform the same operations as the coded sequence demultiplexing section 120a and the high frequency time envelope shape determining section 120b in the thirteenth embodiment (step). S120-1, S120-2). The high frequency time envelope correction unit 130a and the high frequency decoding unit 130b perform the same operations as the high frequency time envelope correction unit 130a and the high frequency decoding unit 130b in the 14th embodiment (steps S130-1, S130-2). ..

図94は、第15の実施形態に係る音声符号化装置240の構成を示す図である。音声符号化装置240の通信装置は、符号化の対象となる音声信号を外部から受信し、更に、符号化された符号化系列を外部に出力する。音声符号化装置240は、図94に示すように、機能的には、低周波数符号化部200a、高周波数符号化部200b、低周波数時間包絡情報符号化部200c、高周波数時間包絡情報符号化部220a、及び符号化系列多重化部220bを備える。 FIG. 94 is a diagram showing the configuration of the voice coding device 240 according to the fifteenth embodiment. The communication device of the voice coding device 240 receives the voice signal to be coded from the outside, and further outputs the coded coding sequence to the outside. As shown in FIG. 94, the voice coding device 240 functionally has a low frequency coding unit 200a, a high frequency coding unit 200b, a low frequency time wrapping information coding unit 200c, and a high frequency time wrapping information coding unit. A unit 220a and a coded sequence multiplexing unit 220b are provided.

図95は、第15の実施形態に係る音声符号化装置240の動作を示すフローチャートである。 FIG. 95 is a flowchart showing the operation of the voice coding device 240 according to the fifteenth embodiment.

［第15の実施形態の音声復号装置の第1の変形例］
図96は、第15の実施形態に係る音声復号装置の第1の変形例140Aの構成を示す図である。 [First Modified Example of Audio Decoding Device of Fifteenth Embodiment]
FIG. 96 is a diagram showing a configuration of a first modification 140A of the audio decoding device according to the fifteenth embodiment.

図97は、第15の実施形態に係る音声復号装置の第1の変形例140Aの動作を示すフローチャートである。 FIG. 97 is a flowchart showing the operation of the first modification 140A of the audio decoding device according to the fifteenth embodiment.

高周波数時間包絡修正部140aは、高周波数時間包絡形状決定部120bで決定した時間包絡形状に基づいて、低周波数時間包絡修正部100dにて時間包絡形状を修正された低周波数信号の時間包絡の形状を修正する（ステップS140-1）。高周波数時間包絡修正部130aとの相違点は、入力信号が低周波数時間包絡修正部100dにて時間包絡形状を修正された低周波数信号である点である。 The high frequency time envelope correction unit 140a is a low frequency signal time envelope whose time envelope shape is corrected by the low frequency time envelope correction unit 100d based on the time envelope shape determined by the high frequency time envelope shape determination unit 120b. Correct the shape (step S140-1). The difference from the high frequency time envelope correction unit 130a is that the input signal is a low frequency signal whose time envelope shape is corrected by the low frequency time envelope correction unit 100d.

［第15の実施形態の音声復号装置の第2の変形例］
図98は、第15の実施形態に係る音声復号装置の第2の変形例140Bの構成を示す図である。 [Second variant of the audio decoding device of the fifteenth embodiment]
FIG. 98 is a diagram showing a configuration of a second modification 140B of the audio decoding device according to the fifteenth embodiment.

当該実施形態の音声復号装置の第1の変形例との相違点は、低周波数/高周波数信号合成部100fでの合成処理に用いられる低周波数信号が、低周波数時間包絡修正部100dで時間包絡形状を修正された低周波数信号に代えて、低周波数復号部100bで復号された低周波数信号である点である。 The difference from the first modification of the voice decoding apparatus of the embodiment is that the low frequency signal used for the synthesis processing in the low frequency / high frequency signal synthesis unit 100f is time wrapped in the low frequency time wrapping correction unit 100d. This is a low-frequency signal decoded by the low-frequency decoding unit 100b instead of the low-frequency signal whose shape has been modified.

［第15の実施形態の音声復号装置の第3の変形例］
図209は、第15の実施形態に係る音声復号装置の第3の変形例140Cの構成を示す図である。 [Third variant of the audio decoding device of the fifteenth embodiment]
FIG. 209 is a diagram showing a configuration of a third modification 140C of the audio decoding device according to the fifteenth embodiment.

図210は、第15の実施形態に係る音声復号装置の第3の変形例140Cの動作を示すフローチャートである。 FIG. 210 is a flowchart showing the operation of the third modification 140C of the audio decoding device according to the fifteenth embodiment.

本変形例と前記第15の実施形態に係る音声復号装置140との相違点は、低周波数時間包絡形状決定部100c、高周波数時間包絡修正部130aにかえて、低周波数時間包絡形状決定部120c、高周波数時間包絡修正部140bを具備する点である。 The difference between this modification and the voice decoding device 140 according to the fifteenth embodiment is that the low frequency time envelope shape determination unit 120c is replaced with the low frequency time envelope shape determination unit 100c and the high frequency time envelope correction unit 130a. The point is that the high frequency time envelope correction unit 140b is provided.

高周波数時間包絡修正部140bと前記高周波数時間包絡修正部130aとの相違点は、高周波数時間包絡形状決定部120bにて決定された時間包絡形状と低周波数時間包絡形状決定部120cで決定された時間包絡形状のうち少なくとも一つ以上に基づいて、高周波数復号部130bへ入力される低周波数信号の時間包絡の形状を修正する点である(S140-2)。 The difference between the high frequency time envelope correction unit 140b and the high frequency time envelope correction unit 130a is determined by the time envelope shape determined by the high frequency time envelope shape determination unit 120b and the low frequency time envelope shape determination unit 120c. The point is to modify the shape of the time envelope of the low frequency signal input to the high frequency decoding unit 130b based on at least one of the time envelope shapes (S140-2).

例えば、低周波数時間包絡形状決定部120cにて時間包絡形状が平坦であると決定された場合には、高周波数時間包絡形状決定部120bにて決定される時間包絡形状によらず、高周波数復号部130bへ入力される低周波数信号の時間包絡の形状を平坦に修正する。更に例えば、低周波数時間包絡形状決定部120cにて時間包絡形状が平坦でないと決定された場合には、高周波数時間包絡形状決定部120bにて決定される時間包絡形状によらず、高周波数復号部130bへ入力される低周波数信号の時間包絡の形状を平坦に修正しない。立ち上がり、立ち下がりの場合も同様であり、時間包絡形状は限定されない。 For example, when the low frequency time envelope shape determining unit 120c determines that the time envelope shape is flat, the high frequency decoding is performed regardless of the time envelope shape determined by the high frequency time envelope shape determining unit 120b. The shape of the time envelope of the low frequency signal input to part 130b is corrected flat. Further, for example, when the low frequency time envelope shape determining unit 120c determines that the time envelope shape is not flat, the high frequency decoding is performed regardless of the time envelope shape determined by the high frequency time envelope shape determining unit 120b. Do not flatten the shape of the time envelope of the low frequency signal input to part 130b. The same applies to the case of rising and falling, and the time envelope shape is not limited.

［第15の実施形態の音声復号装置の第4の変形例］
図211は、第15の実施形態に係る音声復号装置の第4の変形例140Dの構成を示す図である。 [Fourth variant of the audio decoding device of the fifteenth embodiment]
FIG. 211 is a diagram showing a configuration of a fourth modification 140D of the audio decoding device according to the fifteenth embodiment.

図212は、第15の実施形態に係る音声復号装置の第4の変形例140Dの動作を示すフローチャートである。 FIG. 212 is a flowchart showing the operation of the fourth modification 140D of the audio decoding device according to the fifteenth embodiment.

本変形例と前記第15の実施形態に係る音声復号装置140との相違点は、高周波数時間包絡形状決定部120b、低周波数時間包絡修正部100dにかえて、高周波数時間包絡形状決定部120bA、低周波数時間包絡修正部120eを具備する点である。 The difference between this modification and the voice decoding device 140 according to the fifteenth embodiment is that the high frequency time envelope shape determination unit 120bA is replaced with the high frequency time envelope shape determination unit 120b and the low frequency time envelope correction unit 100d. The point is that the low frequency time envelope correction unit 120e is provided.

［第15の実施形態の音声復号装置の第5の変形例］
図213は、第15の実施形態に係る音声復号装置の第5の変形例140Eの構成を示す図である。 [Fifth variant of the audio decoding device of the fifteenth embodiment]
FIG. 213 is a diagram showing a configuration of a fifth modification 140E of the audio decoding device according to the fifteenth embodiment.

図214は、第15の実施形態に係る音声復号装置の第5の変形例140Eの動作を示すフローチャートである。 FIG. 214 is a flowchart showing the operation of the fifth modification 140E of the audio decoding device according to the fifteenth embodiment.

本変形例においては、前記低周波数時間包絡形状決定部120c、前記高周波数時間包絡修正部140b、前記高周波数時間包絡形状決定部120bA、及び前記低周波数時間包絡修正部120eを具備する。 In this modification, the low frequency time envelope shape determining unit 120c, the high frequency time envelope correction unit 140b, the high frequency time envelope shape determining unit 120bA, and the low frequency time envelope correction unit 120e are provided.

［第15の実施形態の音声復号装置の第6の変形例］
図215は、第15の実施形態に係る音声復号装置の第6の変形例140Fの構成を示す図である。 [Sixth variant of the audio decoding device of the fifteenth embodiment]
FIG. 215 is a diagram showing a configuration of a sixth modification 140F of the audio decoding device according to the fifteenth embodiment.

図216は、第15の実施形態に係る音声復号装置の第6の変形例140Fの動作を示すフローチャートである。 FIG. 216 is a flowchart showing the operation of the sixth modification 140F of the audio decoding device according to the fifteenth embodiment.

本変形例と前記第15の実施形態に係る音声復号装置140との相違点は、低周波数時間包絡形状決定部100c及び高周波数時間包絡形状決定部120bにかえて時間包絡形状決定部120fを具備する点である。 The difference between this modification and the voice decoding device 140 according to the fifteenth embodiment is that the time envelope shape determination unit 120f is provided instead of the low frequency time envelope shape determination unit 100c and the high frequency time envelope shape determination unit 120b. It is a point to do.

［第15の実施形態の音声復号装置の第7の変形例］
図217は、第15の実施形態に係る音声復号装置の第7の変形例140Gの構成を示す図である。 [7th variant of the audio decoding device of the 15th embodiment]
FIG. 217 is a diagram showing a configuration of a seventh modification 140G of the audio decoding device according to the fifteenth embodiment.

図218は、第15の実施形態に係る音声復号装置の第7の変形例140Gの動作を示すフローチャートである。 FIG. 218 is a flowchart showing the operation of the seventh modification 140G of the voice decoding device according to the fifteenth embodiment.

本変形例と前記第15の実施形態に係る音声復号装置の第1の変形例140Aとの相違点は、低周波数時間包絡形状決定部100c、高周波数時間包絡修正部140aにかえて、低周波数時間包絡形状決定部120c、高周波数時間包絡修正部140bを具備する点である。 The difference between this modification and the first modification 140A of the voice decoding apparatus according to the fifteenth embodiment is that the low frequency is replaced with the low frequency time envelope shape determination unit 100c and the high frequency time envelope correction unit 140a. The point is that it includes a time envelope shape determining unit 120c and a high frequency time envelope correction unit 140b.

本変形例においては、高周波数時間包絡修正部140bは、高周波数時間包絡形状決定部120bにて決定された時間包絡形状と低周波数時間包絡形状決定部120cで決定された時間包絡形状のうち少なくとも一つ以上に基づいて、高周波数復号部130bへ入力される時間包絡形状を修正された低周波数信号の時間包絡の形状を修正する(S140-2)。 In this modification, the high frequency time envelope correction unit 140b is at least one of the time envelope shape determined by the high frequency time envelope shape determination unit 120b and the time envelope shape determined by the low frequency time envelope shape determination unit 120c. Based on one or more, the shape of the time envelope of the low frequency signal whose time envelope shape is corrected to be input to the high frequency decoding unit 130b is corrected (S140-2).

［第15の実施形態の音声復号装置の第8の変形例］
図219は、第15の実施形態に係る音声復号装置の第8の変形例140Hの構成を示す図である。 [Eighth variant of the audio decoding device of the fifteenth embodiment]
FIG. 219 is a diagram showing a configuration of an eighth modification 140H of the audio decoding device according to the fifteenth embodiment.

図220は、第15の実施形態に係る音声復号装置の第8の変形例140Hの動作を示すフローチャートである。 FIG. 220 is a flowchart showing the operation of the eighth modification 140H of the audio decoding device according to the fifteenth embodiment.

本変形例と前記第15の実施形態に係る音声復号装置の第1の変形例140Aとの相違点は、高周波数時間包絡形状決定部120b、低周波数時間包絡修正部100dにかえて、高周波数時間包絡形状決定部120bA、低周波数時間包絡修正部120eを具備する点である。 The difference between this modification and the first modification 140A of the voice decoding apparatus according to the fifteenth embodiment is that the high frequency is replaced with the high frequency time envelope shape determination unit 120b and the low frequency time envelope correction unit 100d. It is provided with a time envelope shape determining unit 120bA and a low frequency time envelope correction unit 120e.

［第15の実施形態の音声復号装置の第9の変形例］
図221は、第15の実施形態に係る音声復号装置の第9の変形例140Iの構成を示す図である。 [Ninth modification of the audio decoding device of the fifteenth embodiment]
FIG. 221 is a diagram showing a configuration of a ninth modification 140I of the audio decoding device according to the fifteenth embodiment.

図222は、第15の実施形態に係る音声復号装置の第9の変形例140Iの動作を示すフローチャートである。 FIG. 222 is a flowchart showing the operation of the ninth modification 140I of the audio decoding device according to the fifteenth embodiment.

［第15の実施形態の音声復号装置の第10の変形例］
図223は、第15の実施形態に係る音声復号装置の第10の変形例140Jの構成を示す図である。 [10th variant of the audio decoding device of the 15th embodiment]
FIG. 223 is a diagram showing a configuration of a tenth modification 140J of the audio decoding device according to the fifteenth embodiment.

図224は、第15の実施形態に係る音声復号装置の第10の変形例140Jの動作を示すフローチャートである。 FIG. 224 is a flowchart showing the operation of the tenth modification 140J of the audio decoding device according to the fifteenth embodiment.

本変形例と前記第15の実施形態に係る音声復号装置の第1の変形例140Aとの相違点は、低周波数時間包絡形状決定部100c及び高周波数時間包絡形状決定部120bにかえて時間包絡形状決定部120fを具備する点である。 The difference between this modification and the first modification 140A of the voice decoding apparatus according to the fifteenth embodiment is that the time envelopment is replaced with the low frequency time envelope shape determination unit 100c and the high frequency time envelope shape determination unit 120b. The point is that the shape determining portion 120f is provided.

［第15の実施形態の音声復号装置の第11の変形例］
図225は、第15の実施形態に係る音声復号装置の第11の変形例140Kの構成を示す図である。 [11th modification of the audio decoding device of the 15th embodiment]
FIG. 225 is a diagram showing a configuration of an eleventh modification 140K of the audio decoding device according to the fifteenth embodiment.

図226は、第15の実施形態に係る音声復号装置の第11の変形例140Kの動作を示すフローチャートである。 FIG. 226 is a flowchart showing the operation of the eleventh modification 140K of the audio decoding device according to the fifteenth embodiment.

本変形例と前記第15の実施形態に係る音声復号装置の第2の変形例140Bとの相違点は、低周波数時間包絡形状決定部100c、高周波数時間包絡修正部140aにかえて、低周波数時間包絡形状決定部120c、高周波数時間包絡修正部140bを具備する点である。 The difference between this modification and the second modification 140B of the voice decoding apparatus according to the fifteenth embodiment is that the low frequency is replaced with the low frequency time envelope shape determination unit 100c and the high frequency time envelope correction unit 140a. The point is that it includes a time envelope shape determining unit 120c and a high frequency time envelope correction unit 140b.

［第15の実施形態の音声復号装置の第12の変形例］
図227は、第15の実施形態に係る音声復号装置の第12の変形例140Lの構成を示す図である。 [12th variant of the audio decoding device of the 15th embodiment]
FIG. 227 is a diagram showing a configuration of a twelfth modification 140L of the audio decoding device according to the fifteenth embodiment.

図228は、第15の実施形態に係る音声復号装置の第12の変形例140Lの動作を示すフローチャートである。 FIG. 228 is a flowchart showing the operation of the twelfth modification 140L of the audio decoding device according to the fifteenth embodiment.

本変形例と前記第15の実施形態に係る音声復号装置の第2の変形例140Bとの相違点は、高周波数時間包絡形状決定部120b、低周波数時間包絡修正部100dにかえて、高周波数時間包絡形状決定部120bA、低周波数時間包絡修正部120eを具備する点である。 The difference between this modification and the second modification 140B of the voice decoding apparatus according to the fifteenth embodiment is that the high frequency is replaced with the high frequency time envelope shape determination unit 120b and the low frequency time envelope correction unit 100d. It is provided with a time envelope shape determining unit 120bA and a low frequency time envelope correction unit 120e.

［第15の実施形態の音声復号装置の第13の変形例］
図229は、第15の実施形態に係る音声復号装置の第13の変形例140Mの構成を示す図である。 [13th variant of the audio decoding device of the 15th embodiment]
FIG. 229 is a diagram showing a configuration of a thirteenth modification 140M of the audio decoding device according to the fifteenth embodiment.

図230は、第15の実施形態に係る音声復号装置の第13の変形例140Mの動作を示すフローチャートである。 FIG. 230 is a flowchart showing the operation of the thirteenth modification 140M of the audio decoding device according to the fifteenth embodiment.

［第15の実施形態の音声復号装置の第14の変形例］
図231は、第15の実施形態に係る音声復号装置の第14の変形例140Nの構成を示す図である。 [14th modification of the audio decoding device of the 15th embodiment]
FIG. 231 is a diagram showing a configuration of a 14th modification 140N of the audio decoding device according to the 15th embodiment.

図232は、第15の実施形態に係る音声復号装置の第14の変形例140Nの動作を示すフローチャートである。 FIG. 232 is a flowchart showing the operation of the 14th modification 140N of the audio decoding device according to the 15th embodiment.

本変形例と前記第15の実施形態に係る音声復号装置の第2の変形例140Bとの相違点は、低周波数時間包絡形状決定部100c及び高周波数時間包絡形状決定部120bにかえて時間包絡形状決定部120fを具備する点である。 The difference between this modification and the second modification 140B of the audio decoding device according to the fifteenth embodiment is that the time envelopment is replaced with the low frequency time envelope shape determination unit 100c and the high frequency time envelope shape determination unit 120b. The point is that the shape determining portion 120f is provided.

［第16の実施形態］
図99は、第16の実施形態に係る音声復号装置150の構成を示す図である。音声復号装置150の通信装置は、下記音声符号化装置250から出力される多重化された符号化系列を受信し、更に、復号した音声信号を外部に出力する。音声復号装置150は、図99に示すように、機能的には、符号化系列逆多重化部150a、スイッチ群150b，低周波数復号部100b、低周波数時間包絡形状決定部100c、低周波数時間包絡修正部100d、高周波数復号部100e、高周波数時間包絡形状決定部120b、高周波数時間包絡修正部110c、及び低周波数/高周波数信号合成部150cを備える。 [16th Embodiment]
FIG. 99 is a diagram showing the configuration of the audio decoding device 150 according to the 16th embodiment. The communication device of the voice decoding device 150 receives the multiplexed coding sequence output from the following voice coding device 250, and further outputs the decoded voice signal to the outside. As shown in FIG. 99, the voice decoding device 150 functionally includes a coding sequence demultiplexing unit 150a, a switch group 150b, a low frequency decoding unit 100b, a low frequency time envelope shape determining unit 100c, and a low frequency time envelope. It includes a correction unit 100d, a high frequency decoding unit 100e, a high frequency time envelope shape determination unit 120b, a high frequency time envelope correction unit 110c, and a low frequency / high frequency signal synthesis unit 150c.

図100は、第16の実施形態に係る音声復号装置の動作を示すフローチャートである。 FIG. 100 is a flowchart showing the operation of the voice decoding device according to the sixteenth embodiment.

符号化系列逆多重化部150aは、符号化系列を、高周波数信号生成制御情報、低周波数符号化部分、時間包絡形状に関する情報に分割する（ステップS150-1）。 The coded sequence demultiplexing unit 150a divides the coded sequence into high frequency signal generation control information, a low frequency coded portion, and information on the time envelope shape (step S150-1).

符号化系列逆多重化部150aで得られた高周波数信号生成制御情報に基づき、高周波数信号を生成するか否かを判断する（ステップS150-2）。 Based on the high frequency signal generation control information obtained by the coded sequence demultiplexing unit 150a, it is determined whether or not to generate a high frequency signal (step S150-2).

高周波数信号を生成する場合、符号化系列逆多重化部150aは、符号化系列から高周波数符号化部分を抽出する(ステップS150-3)。そして、当該符号化系列の高周波数符号化部分を用いて高周波数信号を生成し、さらに高周波数信号の時間包絡形状を決定して、高周波数信号の時間包絡形状を修正する。 When generating a high frequency signal, the coded sequence demultiplexing unit 150a extracts a high frequency coded portion from the coded sequence (step S150-3). Then, a high frequency signal is generated using the high frequency coding portion of the coding series, and the time envelope shape of the high frequency signal is further determined to correct the time envelope shape of the high frequency signal.

なお、ステップS150-2およびS150-3の処理を行う順番については、高周波数時間包絡形状の決定及び高周波数符号化部分を復号の処理の前であればよく、図100のフローチャートの順番に制限されない。 The order of processing steps S150-2 and S150-3 may be limited to the order of the flowchart of FIG. 100 as long as the high frequency time envelope shape is determined and the high frequency coded portion is before the decoding process. Not done.

低周波数/高周波数信号合成部150cは、前記高周波数信号生成情報に基づき高周波数信号を生成すると判断された場合、時間包絡形状を修正された低周波数信号と時間包絡形状を修正された高周波数信号から出力音声信号を合成し、前記高周波数信号生成情報に基づき高周波数信号を生成しないと判断された場合、時間包絡形状を修正された低周波数信号から出力音声信号を合成する(ステップS150-4)。ただし、高周波数信号を生成しないと判断された場合で、時間包絡形状を修正された低周波数信号が出力できる状態で低周波数/高周波数信号合成部150cに入力された場合、入力された低周波数信号をそのまま出力することもできる。 When the low-frequency / high-frequency signal synthesizer 150c determines that a high-frequency signal is generated based on the high-frequency signal generation information, the low-frequency signal having the corrected time-wrapping shape and the high-frequency signal having the corrected time-wrapping shape have been corrected. When the output audio signal is synthesized from the signal and it is determined that the high frequency signal is not generated based on the high frequency signal generation information, the output audio signal is synthesized from the low frequency signal whose time-wrapping shape is corrected (step S150-). Four). However, if it is determined that a high frequency signal is not generated and the low frequency signal with the corrected time wrapping shape can be output and is input to the low frequency / high frequency signal synthesizer 150c, the input low frequency The signal can be output as it is.

図101は、第16の実施形態に係る音声符号化装置250の構成を示す図である。音声符号化装置250の通信装置は、符号化の対象となる音声信号を外部から受信し、更に、符号化された符号化系列を外部に出力する。音声符号化装置250は、図101に示すように、機能的には、高周波数信号生成制御情報符号化部250a、低周波数符号化部200a、高周波数符号化部200b、低周波数時間包絡情報符号化部200c、高周波数時間包絡情報符号化部220a、及び符号化系列多重化部250bを備える。 FIG. 101 is a diagram showing a configuration of a voice coding device 250 according to a sixteenth embodiment. The communication device of the voice coding device 250 receives the voice signal to be coded from the outside, and further outputs the coded coding sequence to the outside. As shown in FIG. 101, the voice coding device 250 functionally has a high frequency signal generation control information coding unit 250a, a low frequency coding unit 200a, a high frequency coding unit 200b, and a low frequency time wrapping information code. It includes a conversion unit 200c, a high frequency time wrapping information coding unit 220a, and a coding sequence multiplexing unit 250b.

図102は、第16の実施形態に係る音声符号化装置250の動作を示すフローチャートである。 FIG. 102 is a flowchart showing the operation of the voice coding device 250 according to the sixteenth embodiment.

高周波数信号生成制御情報符号化部250aは、入力音声信号、高周波数信号生成制御指示信号のうち少なくとも一つに基づいて高周波数信号を生成するか否かを決定し、高周波数信号生成制御情報を符号化する(ステップS250-1)。例えば、入力音声信号が高周波数符号化部200bにて符号化する周波数帯域の信号を含む場合は、高周波数信号を生成すると決定することができる。さらに例えば、高周波数信号生成制御指示信号により高周波数信号を生成することを指示された場合は、高周波数信号を生成すると決定することができる。さらに例えば、前記2つの方法を組み合わせることもでき、例えば前記2つの方法のうち少なくとも一つの方法にて高周波数信号を生成すると判断した場合には、高周波数信号を生成すると決定できる。 The high frequency signal generation control information coding unit 250a determines whether or not to generate a high frequency signal based on at least one of an input voice signal and a high frequency signal generation control instruction signal, and high frequency signal generation control information. Is encoded (step S250-1). For example, when the input audio signal includes a signal in the frequency band encoded by the high frequency coding unit 200b, it can be determined to generate the high frequency signal. Further, for example, when it is instructed to generate a high frequency signal by a high frequency signal generation control instruction signal, it can be determined to generate a high frequency signal. Further, for example, the two methods can be combined, and for example, when it is determined that the high frequency signal is generated by at least one of the two methods, it can be determined that the high frequency signal is generated.

高周波数信号生成制御情報は、例えば高周波数信号を生成するか否かを1ビットで表すことで符号化できる。 The high frequency signal generation control information can be encoded, for example, by expressing with one bit whether or not to generate a high frequency signal.

ただし、高周波数信号を生成するか否かの決定、及び高周波数信号生成制御情報の符号化方法は限定されない。 However, the method of determining whether or not to generate a high frequency signal and the method of encoding the high frequency signal generation control information are not limited.

高周波数信号生成制御情報符号化部250aにて高周波数信号を生成すると決定した場合は、高周波数符号化部200bにて入力音声信号の高周波数成分にあたる高周波数信号を符号化し、高周波数時間包絡情報符号化部220aにて高周波数時間包絡形状情報を算出し符号化する。一方、高周波数信号生成制御情報符号化部250aにて高周波数信号を生成しないと判断した場合、前記高周波数信号の符号化及び高周波数時間包絡形状情報の算出、符号化は実施されない(ステップS250-2)。 When the high frequency signal generation control information coding unit 250a determines to generate a high frequency signal, the high frequency coding unit 200b encodes the high frequency signal corresponding to the high frequency component of the input audio signal, and the high frequency time wrapping The information coding unit 220a calculates and encodes the high frequency time wrapping shape information. On the other hand, when the high frequency signal generation control information coding unit 250a determines that the high frequency signal is not generated, the high frequency signal is not coded and the high frequency time wrapping shape information is not calculated or coded (step S250). -2).

符号化系列多重化部250cは、高周波数信号生成制御情報符号化部250aより符号化された高周波数信号生成制御情報を受け取り、低周波数符号化部200aより低周波数音声信号の符号化系列を受け取り、低周波数時間包絡情報符号化部200cより符号化された低周波数時間包絡形状情報を受け取り、これらに加えて高周波数信号生成制御情報符号化部250aにて高周波数信号を生成すると決定した場合には、高周波数符号化部200bより高周波数音声信号の符号化系列を、高周波数時間包絡情報符号化部210aより符号化された高周波数時間包絡形状情報を受け取り、多重化して符号化系列として出力する（ステップS250-3）。 The coded sequence multiplexing unit 250c receives the high frequency signal generation control information encoded from the high frequency signal generation control information coding unit 250a, and receives the coded sequence of the low frequency audio signal from the low frequency coding unit 200a. , When the low frequency time wrapping shape information encoded from the low frequency time wrapping information coding unit 200c is received, and in addition to these, the high frequency signal generation control information coding unit 250a determines to generate a high frequency signal. Receives the coded sequence of the high frequency audio signal from the high frequency coding unit 200b, receives the high frequency time wrapping shape information encoded from the high frequency time wrapping information coding unit 210a, multiplexes it, and outputs it as a coded sequence. (Step S250-3).

高周波数信号生成制御情報符号化部250aにて高周波数信号を生成すると決定した場合には、低周波数時間包絡形状に関する情報、及び高周波数時間包絡形状に関する情報の符号化に関して、例えば、別々に符号化された低周波数時間包絡形状に関する情報、及び高周波数時間包絡形状に関する情報を受け取ることもでき、また低周波数時間包絡形状に関する情報、及び高周波数時間包絡形状に関する情報を組み合わせて符号化された形式で受け取ることもできる。さらには、例えば、単一の情報により表され符号化された当該低周波数時間包絡形状に関する情報、及び当該高周波数時間包絡形状に関する情報を受け取ることもできる。 When the high-frequency signal generation control information coding unit 250a determines to generate a high-frequency signal, for example, the information regarding the low-frequency time-enclosed shape and the information regarding the high-frequency time-enclosed shape are coded separately. It is also possible to receive information on the low-frequency time-enclosed shape and the information on the high-frequency time-enclosed shape, and a format encoded by combining information on the low-frequency time-enclosed shape and information on the high-frequency time-enclosed shape. You can also receive it at. Further, for example, it is possible to receive information on the low frequency time envelope shape represented and encoded by a single piece of information, and information on the high frequency time envelope shape.

［第16の実施形態の音声復号装置の第1の変形例］
図103は、第16の実施形態に係る音声復号装置の第1の変形例150Aの構成を示す図である。 [First modification of the audio decoding device of the 16th embodiment]
FIG. 103 is a diagram showing a configuration of a first modification 150A of the audio decoding device according to the sixteenth embodiment.

図104は、第16の実施形態に係る音声復号装置の第1の変形例150Aの動作を示すフローチャートである。第16の実施形態の音声復号装置150との相違点は、高周波数復号部100eAにて、高周波数信号の復号に低周波数時間包絡修正部100dで時間包絡形状を修正された低周波数信号を利用する点である。図104のステップ100-5Aでは、高周波数信号の復号において低周波数復号部100bで得られた低周波数復号信号を利用する際に、低周波数時間包絡修正部100dで時間包絡形状を修正された低周波数信号を利用する。 FIG. 104 is a flowchart showing the operation of the first modification 150A of the audio decoding device according to the sixteenth embodiment. The difference from the voice decoding device 150 of the 16th embodiment is that the high frequency decoding unit 100eA uses the low frequency signal whose time envelope shape is corrected by the low frequency time envelope correction unit 100d for decoding the high frequency signal. It is a point to do. In step 100-5A of FIG. 104, when the low frequency decoding signal obtained by the low frequency decoding unit 100b is used in decoding the high frequency signal, the low frequency time wrapping correction unit 100d corrects the time wrapping shape. Use frequency signals.

なお、ステップS150-2およびS150-3の処理を行う順番については、高周波数時間包絡形状の決定及び高周波数符号化部分を復号の処理の前であればよく、図104のフローチャートの順番に制限されない。 The order in which the processes of steps S150-2 and S150-3 are performed may be limited to the order of the flowchart of FIG. 104 as long as the high frequency time envelope shape is determined and the high frequency coded portion is before the decoding process. Not done.

［第16の実施形態の音声復号装置の第2の変形例］
図105は、第16の実施形態に係る音声復号装置の第2の変形例150Bの構成を示す図である。第16の実施形態の音声復号装置の第1の変形例との相違点は、低周波数／高周波数信号合成部150cに入力される低周波数信号が、低周波数時間包絡修正部100dからの出力ではなく、低周波数復号部100bからの出力である点である。 [Second variant of the audio decoding device of the 16th embodiment]
FIG. 105 is a diagram showing a configuration of a second modification 150B of the audio decoding device according to the sixteenth embodiment. The difference from the first modification of the voice decoding apparatus of the 16th embodiment is that the low frequency signal input to the low frequency / high frequency signal synthesis unit 150c is output from the low frequency time entrainment correction unit 100d. The point is that it is an output from the low frequency decoding unit 100b.

［第16の実施形態の音声復号装置の第3の変形例］
図233は、第16の実施形態に係る音声復号装置の第3の変形例150Cの構成を示す図である。 [Third variant of the audio decoding device of the 16th embodiment]
FIG. 233 is a diagram showing a configuration of a third modification 150C of the audio decoding device according to the sixteenth embodiment.

図234は、第16の実施形態に係る音声復号装置の第3の変形例150Cの動作を示すフローチャートである。 FIG. 234 is a flowchart showing the operation of the third modification 150C of the audio decoding device according to the sixteenth embodiment.

本変形例と前記第16の実施形態に係る音声復号装置150との相違点は、低周波数時間包絡形状決定部100c、高周波数時間包絡修正部110cにかえて、低周波数時間包絡形状決定部120c、高周波数時間包絡修正部120dを具備する点である。 The difference between this modification and the voice decoding device 150 according to the 16th embodiment is that the low frequency time envelope shape determination unit 120c is replaced with the low frequency time envelope shape determination unit 100c and the high frequency time envelope correction unit 110c. The point is that the high frequency time envelope correction unit 120d is provided.

［第16の実施形態の音声復号装置の第4の変形例］
図235は、第16の実施形態に係る音声復号装置の第4の変形例150Dの構成を示す図である。 [Fourth variant of the audio decoding device of the sixteenth embodiment]
FIG. 235 is a diagram showing a configuration of a fourth modification 150D of the audio decoding device according to the sixteenth embodiment.

図236は、第16の実施形態に係る音声復号装置の第4の変形例150Dの動作を示すフローチャートである。 FIG. 236 is a flowchart showing the operation of the fourth modification 150D of the audio decoding device according to the sixteenth embodiment.

本変形例と前記第16の実施形態に係る音声復号装置150との相違点は、高周波数時間包絡形状決定部120b、低周波数時間包絡修正部100dにかえて、高周波数時間包絡形状決定部120bA、低周波数時間包絡修正部120eを具備する点である。 The difference between this modification and the voice decoding device 150 according to the 16th embodiment is that the high frequency time envelope shape determination unit 120bA is replaced with the high frequency time envelope shape determination unit 120b and the low frequency time envelope correction unit 100d. The point is that the low frequency time envelope correction unit 120e is provided.

［第16の実施形態の音声復号装置の第5の変形例］
図237は、第16の実施形態に係る音声復号装置の第5の変形例150Eの構成を示す図である。 [Fifth Modification Example of Audio Decoding Device of the Sixteenth Embodiment]
FIG. 237 is a diagram showing a configuration of a fifth modification 150E of the audio decoding device according to the sixteenth embodiment.

図238は、第16の実施形態に係る音声復号装置の第5の変形例150Eの動作を示すフローチャートである。 FIG. 238 is a flowchart showing the operation of the fifth modification 150E of the audio decoding device according to the sixteenth embodiment.

［第16の実施形態の音声復号装置の第6の変形例］
図239は、第16の実施形態に係る音声復号装置の第6の変形例150Fの構成を示す図である。 [Sixth variant of the audio decoding device of the sixteenth embodiment]
FIG. 239 is a diagram showing a configuration of a sixth modification 150F of the audio decoding device according to the sixteenth embodiment.

図240は、第16の実施形態に係る音声復号装置の第6の変形例150Fの動作を示すフローチャートである。 FIG. 240 is a flowchart showing the operation of the sixth modification 150F of the audio decoding device according to the sixteenth embodiment.

本変形例と前記第16の実施形態に係る音声復号装置150との相違点は、低周波数時間包絡形状決定部100c及び高周波数時間包絡形状決定部120bにかえて時間包絡形状決定部120fを具備する点である。 The difference between this modification and the voice decoding device 150 according to the 16th embodiment is that the time envelope shape determination unit 120f is provided instead of the low frequency time envelope shape determination unit 100c and the high frequency time envelope shape determination unit 120b. It is a point to do.

［第16の実施形態の音声復号装置の第7の変形例］
図241は、第16の実施形態に係る音声復号装置の第7の変形例150Gの構成を示す図である。 [7th variant of the audio decoding device of the 16th embodiment]
FIG. 241 is a diagram showing a configuration of a seventh modification 150G of the audio decoding device according to the sixteenth embodiment.

図242は、第16の実施形態に係る音声復号装置の第7の変形例150Gの動作を示すフローチャートである。 FIG. 242 is a flowchart showing the operation of the seventh modification 150G of the voice decoding device according to the sixteenth embodiment.

本変形例と前記第16の実施形態に係る音声復号装置の第1の変形例150Aとの相違点は、低周波数時間包絡形状決定部100c、高周波数時間包絡修正部110cにかえて、低周波数時間包絡形状決定部120c、高周波数時間包絡修正部120dを具備する点である。 The difference between this modification and the first modification 150A of the voice decoding apparatus according to the 16th embodiment is that the low frequency is replaced with the low frequency time envelope shape determination unit 100c and the high frequency time envelope correction unit 110c. The point is that it includes a time envelope shape determining unit 120c and a high frequency time envelope correction unit 120d.

［第16の実施形態の音声復号装置の第8の変形例］
図243は、第16の実施形態に係る音声復号装置の第8の変形例150Hの構成を示す図である。 [Eighth variant of the audio decoding device of the sixteenth embodiment]
FIG. 243 is a diagram showing a configuration of an eighth modification 150H of the audio decoding device according to the sixteenth embodiment.

図244は、第16の実施形態に係る音声復号装置の第8の変形例150Hの動作を示すフローチャートである。 FIG. 244 is a flowchart showing the operation of the eighth modification 150H of the audio decoding device according to the sixteenth embodiment.

本変形例と前記第16の実施形態に係る音声復号装置の第1の変形例150Aとの相違点は、高周波数時間包絡形状決定部120b、低周波数時間包絡修正部100dにかえて、高周波数時間包絡形状決定部120bA、低周波数時間包絡修正部120eを具備する点である。 The difference between this modification and the first modification 150A of the voice decoding apparatus according to the 16th embodiment is that the high frequency is replaced with the high frequency time envelope shape determination unit 120b and the low frequency time envelope correction unit 100d. It is provided with a time envelope shape determining unit 120bA and a low frequency time envelope correction unit 120e.

［第16の実施形態の音声復号装置の第9の変形例］
図245は、第16の実施形態に係る音声復号装置の第9の変形例150Iの構成を示す図である。 [9th modification of the audio decoding device of the 16th embodiment]
FIG. 245 is a diagram showing a configuration of a ninth modification 150I of the audio decoding device according to the sixteenth embodiment.

図246は、第16の実施形態に係る音声復号装置の第9の変形例150Iの動作を示すフローチャートである。 FIG. 246 is a flowchart showing the operation of the ninth modification 150I of the audio decoding device according to the sixteenth embodiment.

［第16の実施形態の音声復号装置の第10の変形例］
図247は、第16の実施形態に係る音声復号装置の第10の変形例150Jの構成を示す図である。 [10th variant of the audio decoding device of the 16th embodiment]
FIG. 247 is a diagram showing a configuration of a tenth modification 150J of the audio decoding device according to the sixteenth embodiment.

図248は、第16の実施形態に係る音声復号装置の第10の変形例150Jの動作を示すフローチャートである。 FIG. 248 is a flowchart showing the operation of the tenth modification 150J of the audio decoding device according to the sixteenth embodiment.

本変形例と前記第16の実施形態に係る音声復号装置の第1の変形例150Aとの相違点は、低周波数時間包絡形状決定部100c及び高周波数時間包絡形状決定部120bにかえて時間包絡形状決定部120fを具備する点である。 The difference between this modification and the first modification 150A of the audio decoding device according to the 16th embodiment is that the time envelopment is replaced with the low frequency time envelope shape determination unit 100c and the high frequency time envelope shape determination unit 120b. The point is that the shape determining portion 120f is provided.

［第16の実施形態の音声復号装置の第11の変形例］
図249は、第16の実施形態に係る音声復号装置の第11の変形例150Kの構成を示す図である。 [11th modification of the audio decoding device of the 16th embodiment]
FIG. 249 is a diagram showing a configuration of an eleventh modification 150K of the audio decoding device according to the sixteenth embodiment.

図250は、第16の実施形態に係る音声復号装置の第11の変形例150Kの動作を示すフローチャートである。 FIG. 250 is a flowchart showing the operation of the eleventh modification 150K of the audio decoding device according to the sixteenth embodiment.

本変形例と前記第16の実施形態に係る音声復号装置の第2の変形例150Bとの相違点は、低周波数時間包絡形状決定部100c、高周波数時間包絡修正部110cにかえて、低周波数時間包絡形状決定部120c、高周波数時間包絡修正部120dを具備する点である。 The difference between this modification and the second modification 150B of the voice decoding apparatus according to the 16th embodiment is that the low frequency is replaced with the low frequency time envelope shape determination unit 100c and the high frequency time envelope correction unit 110c. The point is that it includes a time envelope shape determining unit 120c and a high frequency time envelope correction unit 120d.

［第16の実施形態の音声復号装置の第12の変形例］
図251は、第16の実施形態に係る音声復号装置の第12の変形例150Lの構成を示す図である。 [12th variant of the audio decoding device of the 16th embodiment]
FIG. 251 is a diagram showing a configuration of a twelfth modification 150L of the audio decoding device according to the sixteenth embodiment.

図252は、第16の実施形態に係る音声復号装置の第12の変形例150Lの動作を示すフローチャートである。 FIG. 252 is a flowchart showing the operation of the twelfth modification 150L of the audio decoding device according to the sixteenth embodiment.

本変形例と前記第16の実施形態に係る音声復号装置の第2の変形例150Bとの相違点は、高周波数時間包絡形状決定部120b、低周波数時間包絡修正部100dにかえて、高周波数時間包絡形状決定部120bA、低周波数時間包絡修正部120eを具備する点である。 The difference between this modification and the second modification 150B of the voice decoding apparatus according to the 16th embodiment is that the high frequency is replaced with the high frequency time envelope shape determination unit 120b and the low frequency time envelope correction unit 100d. It is provided with a time envelope shape determining unit 120bA and a low frequency time envelope correction unit 120e.

［第16の実施形態の音声復号装置の第13の変形例］
図253は、第16の実施形態に係る音声復号装置の第13の変形例150Mの構成を示す図である。 [13th modification of the audio decoding device of the 16th embodiment]
FIG. 253 is a diagram showing a configuration of a thirteenth modification 150M of the audio decoding device according to the sixteenth embodiment.

図254は、第16の実施形態に係る音声復号装置の第13の変形例150Mの動作を示すフローチャートである。 FIG. 254 is a flowchart showing the operation of the thirteenth modification 150M of the audio decoding device according to the sixteenth embodiment.

［第16の実施形態の音声復号装置の第14の変形例］
図255は、第16の実施形態に係る音声復号装置の第14の変形例150Nの構成を示す図である。 [14th modification of the audio decoding device of the 16th embodiment]
FIG. 255 is a diagram showing a configuration of a 14th modification 150N of the audio decoding device according to the 16th embodiment.

図256は、第16の実施形態に係る音声復号装置の第14の変形例150Nの動作を示すフローチャートである。 FIG. 256 is a flowchart showing the operation of the 14th modification 150N of the audio decoding device according to the 16th embodiment.

本変形例と前記第16の実施形態に係る音声復号装置の第2の変形例150Bとの相違点は、低周波数時間包絡形状決定部100c及び高周波数時間包絡形状決定部120bにかえて時間包絡形状決定部120fを具備する点である。 The difference between this modification and the second modification 150B of the audio decoding device according to the 16th embodiment is that the time envelopment is replaced with the low frequency time envelope shape determination unit 100c and the high frequency time envelope shape determination unit 120b. The point is that the shape determining portion 120f is provided.

［第17の実施形態］
図106は、第17の実施形態に係る音声復号装置160の構成を示す図である。音声復号装置160の通信装置は、下記音声符号化装置260から出力される多重化された符号化系列を受信し、更に、復号した音声信号を外部に出力する。音声復号装置160は、図106に示すように、機能的には、符号化系列逆多重化部150a、スイッチ群150b、低周波数復号部100b、低周波数時間包絡形状決定部100c、低周波数時間包絡修正部100d、高周波数時間包絡形状決定部120b、高周波数時間包絡修正部130a、高周波数復号部130b、及び低周波数/高周波数信号合成部150cを備える。 [17th Embodiment]
FIG. 106 is a diagram showing the configuration of the audio decoding device 160 according to the 17th embodiment. The communication device of the voice decoding device 160 receives the multiplexed coding sequence output from the following voice coding device 260, and further outputs the decoded voice signal to the outside. As shown in FIG. 106, the voice decoding device 160 functionally includes a coding sequence demultiplexing unit 150a, a switch group 150b, a low frequency decoding unit 100b, a low frequency time envelope shape determining unit 100c, and a low frequency time envelope. It includes a correction unit 100d, a high frequency time envelope shape determination unit 120b, a high frequency time envelope correction unit 130a, a high frequency decoding unit 130b, and a low frequency / high frequency signal synthesis unit 150c.

図107は、第17の実施形態に係る音声復号装置の動作を示すフローチャートである。なお、ステップS150-2およびS150-3の処理を行う順番については、高周波数時間包絡形状の決定及び高周波数符号化部分を復号の処理の前であればよく、図107のフローチャートの順番に制限されない。 FIG. 107 is a flowchart showing the operation of the voice decoding device according to the 17th embodiment. The order of processing steps S150-2 and S150-3 may be limited to the order of the flowchart of FIG. 107, as long as the high frequency time envelope shape is determined and the high frequency coded portion is before the decoding process. Not done.

図108は、第17の実施形態に係る音声符号化装置260の構成を示す図である。音声符号化装置260の通信装置は、符号化の対象となる音声信号を外部から受信し、更に、符号化された符号化系列を外部に出力する。音声符号化装置260は、図108に示すように、機能的には、高周波数信号生成制御情報符号化部250a、低周波数符号化部200a、高周波数符号化部200b、低周波数時間包絡情報符号化部200c、高周波数時間包絡情報符号化部220a、及び符号化系列多重化部250bを備える。 FIG. 108 is a diagram showing the configuration of the voice coding device 260 according to the 17th embodiment. The communication device of the voice coding device 260 receives the voice signal to be coded from the outside, and further outputs the coded coding sequence to the outside. As shown in FIG. 108, the voice coding device 260 functionally has a high frequency signal generation control information coding unit 250a, a low frequency coding unit 200a, a high frequency coding unit 200b, and a low frequency time wrapping information code. It includes a conversion unit 200c, a high frequency time wrapping information coding unit 220a, and a coding sequence multiplexing unit 250b.

図109は、第17の実施形態に係る音声符号化装置260の動作を示すフローチャートである。 FIG. 109 is a flowchart showing the operation of the voice coding device 260 according to the seventeenth embodiment.

［第17の実施形態の音声復号装置の第1の変形例］
図110は、第17の実施形態に係る音声復号装置の第1の変形例160Aの構成を示す図である。 [First modification of the audio decoding device of the 17th embodiment]
FIG. 110 is a diagram showing a configuration of a first modification 160A of the audio decoding device according to the 17th embodiment.

図111は、第17の実施形態に係る音声復号装置の第1の変形例160Aの動作を示すフローチャートである。 FIG. 111 is a flowchart showing the operation of the first modification 160A of the audio decoding device according to the 17th embodiment.

当該実施形態の音声復号装置160との相違点は、高周波数時間包絡修正部130aに代えて、第15の実施形態の音声復号装置の第1の変形例で説明した高周波数時間包絡修正部140aを用いている点である。 The difference from the voice decoding device 160 of the embodiment is that the high frequency time envelope correction unit 140a described in the first modification of the voice decoding device of the fifteenth embodiment is replaced with the high frequency time envelope correction unit 130a. Is used.

なお、ステップS150-2およびS150-3の処理を行う順番については、高周波数時間包絡形状の決定及び高周波数符号化部分を復号の処理の前であればよく、図111のフローチャートの順番に制限されない。 The order in which the processes of steps S150-2 and S150-3 are performed may be limited to the order of the flowchart of FIG. 111 as long as the high frequency time envelope shape is determined and the high frequency coded portion is before the decoding process. Not done.

［第17の実施形態の音声復号装置の第2の変形例］
図112は、第17の実施形態に係る音声復号装置の第2の変形例170Bの構成を示す図である。 [Second variant of the audio decoding device of the 17th embodiment]
FIG. 112 is a diagram showing a configuration of a second modification 170B of the audio decoding device according to the 17th embodiment.

当該実施形態の音声復号装置の第1の変形例160Aとの相違点は、第15の実施形態の音声復号装置の第2の変形例と同様に、低周波数/高周波数信号合成部150cでの合成処理に用いられる低周波数信号が、低周波数時間包絡修正部100dで時間包絡形状を修正された低周波数信号に代えて、低周波数復号部100bで復号された低周波数信号である点である。 The difference from the first modification 160A of the voice decoding device of the embodiment is that the low frequency / high frequency signal synthesizer 150c is the same as the second modification of the voice decoding device of the fifteenth embodiment. The point is that the low frequency signal used in the synthesis process is a low frequency signal decoded by the low frequency decoding unit 100b instead of the low frequency signal whose time wrapping shape is corrected by the low frequency time wrapping correction unit 100d.

［第17の実施形態の音声復号装置の第3の変形例］
図257は、第17の実施形態に係る音声復号装置の第3の変形例160Cの構成を示す図である。 [Third variant of the audio decoding device of the 17th embodiment]
FIG. 257 is a diagram showing a configuration of a third modification 160C of the audio decoding device according to the 17th embodiment.

図258は、第17の実施形態に係る音声復号装置の第3の変形例160Cの動作を示すフローチャートである。 FIG. 258 is a flowchart showing the operation of the third modification 160C of the audio decoding device according to the 17th embodiment.

本変形例と前記第17の実施形態に係る音声復号装置160との相違点は、低周波数時間包絡形状決定部100c、高周波数時間包絡修正部130aにかえて、低周波数時間包絡形状決定部120c、高周波数時間包絡修正部140bを具備する点である。 The difference between this modification and the voice decoding device 160 according to the 17th embodiment is the low frequency time envelope shape determination unit 120c instead of the low frequency time envelope shape determination unit 100c and the high frequency time envelope correction unit 130a. The point is that the high frequency time envelope correction unit 140b is provided.

［第17の実施形態の音声復号装置の第4の変形例］
図259は、第17の実施形態に係る音声復号装置の第4の変形例160Dの構成を示す図である。 [Fourth variant of the audio decoding device of the seventeenth embodiment]
FIG. 259 is a diagram showing a configuration of a fourth modification 160D of the audio decoding device according to the 17th embodiment.

図260は、第17の実施形態に係る音声復号装置の第4の変形例160Dの動作を示すフローチャートである。 FIG. 260 is a flowchart showing the operation of the fourth modification 160D of the audio decoding device according to the seventeenth embodiment.

本変形例と前記第17の実施形態に係る音声復号装置160との相違点は、高周波数時間包絡形状決定部120b、低周波数時間包絡修正部100dにかえて、高周波数時間包絡形状決定部120bA、低周波数時間包絡修正部120eを具備する点である。 The difference between this modification and the voice decoding device 160 according to the 17th embodiment is that the high frequency time envelope shape determination unit 120bA is replaced with the high frequency time envelope shape determination unit 120b and the low frequency time envelope correction unit 100d. The point is that the low frequency time envelope correction unit 120e is provided.

［第17の実施形態の音声復号装置の第5の変形例］
図261は、第17の実施形態に係る音声復号装置の第5の変形例160Eの構成を示す図である。 [Fifth variant of the audio decoding device of the seventeenth embodiment]
FIG. 261 is a diagram showing a configuration of a fifth modification 160E of the audio decoding device according to the seventeenth embodiment.

図262は、第17の実施形態に係る音声復号装置の第5の変形例160Eの動作を示すフローチャートである。 FIG. 262 is a flowchart showing the operation of the fifth modification 160E of the audio decoding device according to the seventeenth embodiment.

［第17の実施形態の音声復号装置の第6の変形例］
図263は、第17の実施形態に係る音声復号装置の第6の変形例160Fの構成を示す図である。 [Sixth modification of the audio decoding device of the seventeenth embodiment]
FIG. 263 is a diagram showing a configuration of a sixth modification 160F of the audio decoding device according to the seventeenth embodiment.

図264は、第17の実施形態に係る音声復号装置の第6の変形例160Fの動作を示すフローチャートである。 FIG. 264 is a flowchart showing the operation of the sixth modification 160F of the audio decoding device according to the seventeenth embodiment.

本変形例と前記第17の実施形態に係る音声復号装置160との相違点は、低周波数時間包絡形状決定部100c及び高周波数時間包絡形状決定部120bにかえて時間包絡形状決定部120fを具備する点である。 The difference between this modification and the audio decoding device 160 according to the 17th embodiment is that the time envelope shape determination unit 120f is provided instead of the low frequency time envelope shape determination unit 100c and the high frequency time envelope shape determination unit 120b. It is a point to do.

［第17の実施形態の音声復号装置の第7の変形例］
図265は、第17の実施形態に係る音声復号装置の第7の変形例160Gの構成を示す図である。 [7th variant of the audio decoding device of the 17th embodiment]
FIG. 265 is a diagram showing a configuration of a seventh modification 160G of the audio decoding device according to the seventeenth embodiment.

図266は、第17の実施形態に係る音声復号装置の第7の変形例160Gの動作を示すフローチャートである。 FIG. 266 is a flowchart showing the operation of the seventh modification 160G of the audio decoding device according to the seventeenth embodiment.

本変形例と前記第17の実施形態に係る音声復号装置の第1の変形例160Aとの相違点は、低周波数時間包絡形状決定部100c、高周波数時間包絡修正部140aにかえて、低周波数時間包絡形状決定部120c、高周波数時間包絡修正部140bを具備する点である。 The difference between this modification and the first modification 160A of the voice decoding apparatus according to the 17th embodiment is that the low frequency is replaced with the low frequency time envelope shape determination unit 100c and the high frequency time envelope correction unit 140a. The point is that it includes a time envelope shape determining unit 120c and a high frequency time envelope correction unit 140b.

［第17の実施形態の音声復号装置の第8の変形例］
図267は、第17の実施形態に係る音声復号装置の第8の変形例160Hの構成を示す図である。 [Eighth variant of the audio decoding device of the seventeenth embodiment]
FIG. 267 is a diagram showing a configuration of an eighth modification 160H of the audio decoding device according to the seventeenth embodiment.

図268は、第17の実施形態に係る音声復号装置の第8の変形例160Hの動作を示すフローチャートである。 FIG. 268 is a flowchart showing the operation of the eighth modification 160H of the audio decoding device according to the seventeenth embodiment.

本変形例と前記第17の実施形態に係る音声復号装置の第1の変形例160Aとの相違点は、高周波数時間包絡形状決定部120b、低周波数時間包絡修正部100dにかえて、高周波数時間包絡形状決定部120bA、低周波数時間包絡修正部120eを具備する点である。 The difference between this modification and the first modification 160A of the voice decoding apparatus according to the 17th embodiment is that the high frequency is replaced with the high frequency time envelope shape determination unit 120b and the low frequency time envelope correction unit 100d. It is provided with a time envelope shape determining unit 120bA and a low frequency time envelope correction unit 120e.

［第17の実施形態の音声復号装置の第9の変形例］
図269は、第17の実施形態に係る音声復号装置の第9の変形例160Iの構成を示す図である。 [9th modification of the audio decoding device of the 17th embodiment]
FIG. 269 is a diagram showing a configuration of a ninth modification 160I of the audio decoding device according to the seventeenth embodiment.

図270は、第17の実施形態に係る音声復号装置の第9の変形例160Iの動作を示すフローチャートである。 FIG. 270 is a flowchart showing the operation of the ninth modification 160I of the audio decoding device according to the seventeenth embodiment.

［第17の実施形態の音声復号装置の第10の変形例］
図271は、第17の実施形態に係る音声復号装置の第10の変形例160Jの構成を示す図である。 [10th variant of the audio decoding device of the 17th embodiment]
FIG. 271 is a diagram showing a configuration of a tenth modification 160J of the audio decoding device according to the seventeenth embodiment.

図272は、第17の実施形態に係る音声復号装置の第10の変形例160Jの動作を示すフローチャートである。 FIG. 272 is a flowchart showing the operation of the tenth modification 160J of the audio decoding device according to the seventeenth embodiment.

本変形例と前記第17の実施形態に係る音声復号装置の第1の変形例160Aとの相違点は、低周波数時間包絡形状決定部100c及び高周波数時間包絡形状決定部120bにかえて時間包絡形状決定部120fを具備する点である。 The difference between this modification and the first modification 160A of the audio decoding device according to the 17th embodiment is that the time envelopment is replaced with the low frequency time envelope shape determination unit 100c and the high frequency time envelope shape determination unit 120b. The point is that the shape determining portion 120f is provided.

［第17の実施形態の音声復号装置の第11の変形例］
図273は、第17の実施形態に係る音声復号装置の第11の変形例160Kの構成を示す図である。 [11th modification of the audio decoding device of the 17th embodiment]
FIG. 273 is a diagram showing the configuration of the eleventh modification 160K of the audio decoding device according to the seventeenth embodiment.

図274は、第17の実施形態に係る音声復号装置の第11の変形例160Kの動作を示すフローチャートである。 FIG. 274 is a flowchart showing the operation of the eleventh modification 160K of the audio decoding device according to the seventeenth embodiment.

本変形例と前記第17の実施形態に係る音声復号装置の第2の変形例160Bとの相違点は、低周波数時間包絡形状決定部100c、高周波数時間包絡修正部140aにかえて、低周波数時間包絡形状決定部120c、高周波数時間包絡修正部140bを具備する点である。 The difference between this modification and the second modification 160B of the voice decoding apparatus according to the 17th embodiment is that the low frequency is replaced with the low frequency time envelope shape determination unit 100c and the high frequency time envelope correction unit 140a. The point is that it includes a time envelope shape determining unit 120c and a high frequency time envelope correction unit 140b.

［第17の実施形態の音声復号装置の第12の変形例］
図275は、第17の実施形態に係る音声復号装置の第12の変形例160Lの構成を示す図である。 [12th variant of the audio decoding device of the 17th embodiment]
FIG. 275 is a diagram showing a configuration of a twelfth modification 160L of the audio decoding device according to the seventeenth embodiment.

図276は、第17の実施形態に係る音声復号装置の第12の変形例160Lの動作を示すフローチャートである。 FIG. 276 is a flowchart showing the operation of the twelfth modification 160L of the audio decoding device according to the seventeenth embodiment.

本変形例と前記第17の実施形態に係る音声復号装置の第2の変形例160Bとの相違点は、高周波数時間包絡形状決定部120b、低周波数時間包絡修正部100dにかえて、高周波数時間包絡形状決定部120bA、低周波数時間包絡修正部120eを具備する点である。 The difference between this modification and the second modification 160B of the voice decoding apparatus according to the 17th embodiment is that the high frequency is replaced with the high frequency time envelope shape determination unit 120b and the low frequency time envelope correction unit 100d. It is provided with a time envelope shape determining unit 120bA and a low frequency time envelope correction unit 120e.

［第17の実施形態の音声復号装置の第13の変形例］
図277は、第17の実施形態に係る音声復号装置の第13の変形例160Mの構成を示す図である。 [13th variant of the audio decoding device of the 17th embodiment]
FIG. 277 is a diagram showing a configuration of a thirteenth modification 160M of the audio decoding device according to the seventeenth embodiment.

図278は、第17の実施形態に係る音声復号装置の第13の変形例160Mの動作を示すフローチャートである。 FIG. 278 is a flowchart showing the operation of the thirteenth modification 160M of the voice decoding device according to the seventeenth embodiment.

［第17の実施形態の音声復号装置の第14の変形例］
図279は、第17の実施形態に係る音声復号装置の第14の変形例160Nの構成を示す図である。 [14th modification of the audio decoding device of the 17th embodiment]
FIG. 279 is a diagram showing the configuration of the 14th modification 160N of the audio decoding device according to the 17th embodiment.

図280は、第17の実施形態に係る音声復号装置の第14の変形例160Nの動作を示すフローチャートである。 FIG. 280 is a flowchart showing the operation of the 14th modification 160N of the audio decoding device according to the 17th embodiment.

本変形例と前記第17の実施形態に係る音声復号装置の第2の変形例160Bとの相違点は、低周波数時間包絡形状決定部100c及び高周波数時間包絡形状決定部120bにかえて時間包絡形状決定部120fを具備する点である。 The difference between this modification and the second modification 160B of the audio decoding device according to the 17th embodiment is that the time envelopment is replaced with the low frequency time envelope shape determination unit 100c and the high frequency time envelope shape determination unit 120b. The point is that the shape determining portion 120f is provided.

［第18の実施形態］
図113は、第18の実施形態に係る音声復号装置170の構成を示す図である。音声復号装置170の通信装置は、下記音声符号化装置270から出力される多重化された符号化系列を受信し、更に、復号した音声信号を外部に出力する。音声復号装置170は、図113に示すように、機能的には、符号化系列逆多重化部170a、スイッチ群170b、コア復号部10b、分析フィルタバンク部10c、符号化系列解析部13c、低周波数時間包絡形状決定部10e、低周波数時間包絡修正部10f、高周波数時間包絡形状決定部13a、時間包絡修正部13b、高周波数信号生成部10g、復号/逆量子化部10h、周波数包絡調整部10i、及び合成フィルタバンク部170cを備える。 [18th Embodiment]
FIG. 113 is a diagram showing a configuration of the audio decoding device 170 according to the eighteenth embodiment. The communication device of the voice decoding device 170 receives the multiplexed coding sequence output from the following voice coding device 270, and further outputs the decoded voice signal to the outside. As shown in FIG. 113, the voice decoding apparatus 170 functionally has a coded sequence demultiplexing section 170a, a switch group 170b, a core decoding section 10b, an analysis filter bank section 10c, a coded sequence analysis section 13c, and a low frequency. Frequency time envelope shape determination unit 10e, low frequency time envelope correction unit 10f, high frequency time envelope shape determination unit 13a, time envelope correction unit 13b, high frequency signal generation unit 10g, decoding / inverse quantization unit 10h, frequency inclusion adjustment unit It is provided with 10i and a synthetic filter bank unit 170c.

図114は、第18の実施形態に係る音声復号装置の動作を示すフローチャートである。 FIG. 114 is a flowchart showing the operation of the voice decoding device according to the eighteenth embodiment.

符号化系列逆多重化部170aは、符号化系列を、高周波数信号生成制御情報、低周波数信号を符号化したコア符号化部分、低周波数時間包絡形状決定部10eで必要な時間包絡形状に関する情報に分割する（ステップS170-1）。 The coded sequence demultiplexing unit 170a converts the coded sequence into high-frequency signal generation control information, a core coding part that encodes a low-frequency signal, and information on the time-enclosed shape required by the low-frequency time-enclosed shape determining unit 10e. Divide into (step S170-1).

符号化系列逆多重化部170aで得られた高周波数信号生成制御情報に基づき，高周波数信号を生成するか否かを判断する（ステップS170-2）。 Based on the high frequency signal generation control information obtained by the coded sequence demultiplexing unit 170a, it is determined whether or not to generate a high frequency signal (step S170-2).

高周波数信号を生成する場合、符号化系列逆多重化部170aは、符号化系列から低周波数信号から高周波数信号を生成するための帯域拡張部分を抽出し、符号化系列解析部13cは、符号化系列逆多重化部170aで抽出された符号化系列の帯域拡張部分を解析し、高周波数信号生成部10g、及び復号/逆量子化部10hで必要な情報、高周波数時間包絡形状決定部13aで必要な時間包絡形状に関する情報に分割する（ステップS170-3）。そして、当該符号化系列の高周波数符号化部分を用いて高周波数信号を生成し、さらに高周波数信号の時間包絡形状を決定して、高周波数信号の時間包絡形状を修正する。 When generating a high frequency signal, the coded sequence demultiplexing unit 170a extracts a band extension portion for generating a high frequency signal from a low frequency signal from the coded sequence, and the coded sequence analysis unit 13c extracts a code. The band expansion part of the coding sequence extracted by the demultiplexing unit 170a is analyzed, and the information required by the high frequency signal generation unit 10g and the decoding / dequantization unit 10h, the high frequency time entrainment shape determination unit 13a The time required in is divided into information on the entrapment shape (step S170-3). Then, a high frequency signal is generated using the high frequency coding portion of the coding series, and the time envelope shape of the high frequency signal is further determined to correct the time envelope shape of the high frequency signal.

なお、ステップS170-2およびS170-3の処理を行う順番については、高周波数信号の時間包絡形状の決定および帯域拡張部分の復号・逆量子化の処理の前であればよく、図114のフローチャートの順番に制限されない。 The order in which the processes of steps S170-2 and S170-3 are performed may be before the determination of the time envelope shape of the high frequency signal and the decoding / dequantization processing of the band extension portion, and the flowchart of FIG. 114 The order of is not limited.

合成フィルタバンク部170cは、前記高周波数信号生成情報に基づき高周波数信号を生成すると判断された場合、時間包絡形状を修正された低周波数サブバンド信号と時間包絡形状を修正された高周波数サブバンド信号から出力音声信号を合成し、前記高周波数信号生成情報に基づき高周波数信号を生成しないと判断された場合、時間包絡形状を修正された低周波数サブバンド信号から出力音声信号を合成する(ステップS170-4)。 When the synthetic filter bank unit 170c is determined to generate a high frequency signal based on the high frequency signal generation information, the low frequency subband signal having the corrected time wrapping shape and the high frequency subband having the corrected time wrapping shape have been generated. When the output audio signal is synthesized from the signal and it is determined that the high frequency signal is not generated based on the high frequency signal generation information, the output audio signal is synthesized from the low frequency subband signal whose time wrapping shape is corrected (step). S170-4).

なお、本実施形態に係る音声復号装置170の低周波数時間包絡形状決定部10eに対して、本発明の第1の実施形態の音声復号装置の第1、第2、及び第3の変形例が適用できることは明白である。 It should be noted that the first, second, and third modifications of the voice decoding device according to the first embodiment of the present invention are provided with respect to the low frequency time envelope shape determining unit 10e of the voice decoding device 170 according to the present invention. It is clear that it is applicable.

さらには、本実施形態に係る音声復号装置170の高周波数時間包絡形状決定部13aに対して、本発明の第4の実施形態の音声復号装置の第1、第2、及び第3の変形例、及び本発明第7の実施形態の音声復号装置の第1の変形例が適用できることは明白である。 Further, the first, second, and third modifications of the voice decoding device according to the fourth embodiment of the present invention are made with respect to the high frequency time-enclosed shape determining unit 13a of the voice decoding device 170 according to the present invention. , And the first modification of the audio decoding device of the seventh embodiment of the present invention is clearly applicable.

図115は、第18の実施形態に係る音声符号化装置270の構成を示す図である。音声符号化装置270の通信装置は、符号化の対象となる音声信号を外部から受信し、更に、符号化された符号化系列を外部に出力する。音声符号化装置270は、図115に示すように、機能的には、高周波数信号生成制御情報符号化部270a、ダウンサンプリング部20a、コア符号化部20b、分析フィルタバンク部20c及び20c1、制御パラメータ符号化部20d、包絡算出部20e、量子化/符号化部20f、コア復号信号生成部20i、サブバンド信号パワー算出部20j、時間包絡情報符号化部270b、及び符号化系列多重化部270cを備える。 FIG. 115 is a diagram showing the configuration of the voice coding device 270 according to the eighteenth embodiment. The communication device of the voice coding device 270 receives the voice signal to be coded from the outside, and further outputs the coded coding sequence to the outside. As shown in FIG. 115, the voice coding device 270 functionally includes a high frequency signal generation control information coding unit 270a, a downsampling unit 20a, a core coding unit 20b, an analysis filter bank unit 20c and 20c1, and control. Parameter coding unit 20d, inclusion calculation unit 20e, quantization / coding unit 20f, core decoding signal generation unit 20i, subband signal power calculation unit 20j, time inclusion information coding unit 270b, and coding sequence multiplexing unit 270c. To be equipped.

図116は、第18の実施形態に係る音声符号化装置270の動作を示すフローチャートである。 FIG. 116 is a flowchart showing the operation of the voice coding device 270 according to the eighteenth embodiment.

高周波数信号生成制御情報符号化部270aは、入力音声信号、高周波数信号生成制御指示信号のうち少なくとも一つに基づいて高周波数信号を生成するか否かを決定し、高周波数信号生成制御情報を符号化する(ステップS270-1)。例えば、入力音声信号が量子化/符号化部20fにて量子化・符号化する帯域拡張にて生成される周波数帯域の信号を含む場合は、高周波数信号を生成すると決定することができる。さらに例えば、高周波数信号生成制御指示信号により高周波数信号を生成することを指示された場合は、高周波数信号を生成すると決定することができる。さらに例えば、前記2つの方法を組み合わせることもでき、例えば前記2つの方法のうち少なくとも一つの方法にて高周波数信号を生成すると判断した場合には、高周波数信号を生成すると決定できる。 The high frequency signal generation control information coding unit 270a determines whether or not to generate a high frequency signal based on at least one of an input voice signal and a high frequency signal generation control instruction signal, and high frequency signal generation control information. Is encoded (step S270-1). For example, when the input audio signal includes a signal in a frequency band generated by band expansion that is quantized and encoded by the quantization / coding unit 20f, it can be determined that a high frequency signal is generated. Further, for example, when it is instructed to generate a high frequency signal by a high frequency signal generation control instruction signal, it can be determined to generate a high frequency signal. Further, for example, the two methods can be combined, and for example, when it is determined that the high frequency signal is generated by at least one of the two methods, it can be determined that the high frequency signal is generated.

高周波数信号生成制御情報符号化部270aにて高周波数信号を生成すると決定した場合は、帯域拡張にて高周波数信号を生成するのに必要な情報を算出・符号化する。一方、高周波数信号生成制御情報符号化部270aにて高周波数信号を生成しないと判断した場合、前記高周波数信号を生成するのに必要な情報の算出・符号化は実施されない(ステップS270-2)。 When the high frequency signal generation control information coding unit 270a determines to generate a high frequency signal, the information necessary for generating the high frequency signal by band expansion is calculated and encoded. On the other hand, when the high-frequency signal generation control information coding unit 270a determines that the high-frequency signal is not generated, the information required for generating the high-frequency signal is not calculated / coded (step S270-2). ).

時間包絡情報符号化部270bは、高周波数信号生成制御情報符号化部270aにて高周波数信号を生成すると決定した場合は、低周波数信号の時間包絡と高周波数信号の時間包絡のうち少なくとも一つ以上を算出し、さらにサブバンド信号パワー算出部20jにて算出されたコア復号信号のサブバンド信号のパワーを用いてコア復号信号の時間包絡を算出し、当該低周波数信号の時間包絡及び高周波数信号の時間包絡のうち少なくとも一つ以上とコア復号信号の時間包絡より時間包絡情報を符号化する。当該時間包絡情報は、低周波数時間包絡情報と高周波数時間包絡情報を含む。第7の実施形態の音声符号化装置26の時間包絡情報符号化部26aの動作と同様に、当該低周波数時間包絡情報と高周波数時間包絡情報の符号化の方法は限定されない。一方、高周波数信号生成制御情報符号化部270aにて高周波数信号を生成しないと判断した場合は、低周波数信号の時間包絡を算出し、さらにサブバンド信号パワー算出部20jにて算出されたコア復号信号のサブバンド信号のパワーを用いてコア復号信号の時間包絡を算出し、当該低周波数信号の時間包絡とコア復号信号の時間包絡より、低周波数信号に関する時間包絡情報を符号化する（ステップS270-3）。ここで高周波数信号生成制御情報符号化部270aにて高周波数信号を生成しないと判断した場合は、包絡算出部270dは、低周波数信号のサブバンド信号のパワーのみを算出することができ、さらには低周波数信号のサブバンド信号のパワーを算出せずに低周波数信号のサブバンド信号を時間包絡情報符号化部270bに送ることもできる。低周波数信号のサブバンド信号のパワーが算出されていない場合は、時間包絡情報符号化部270bにて低周波数信号のサブバンド信号のパワーを算出してもよく、低周波数信号のサブバンド信号のパワーがどこで算出されるかは限定されない。 When the time wrapping information coding unit 270b determines that the high frequency signal generation control information coding unit 270a generates a high frequency signal, at least one of the time wrapping of the low frequency signal and the time wrapping of the high frequency signal. After calculating the above, the time wrapping of the core decoding signal is calculated using the power of the subband signal of the core decoding signal calculated by the subband signal power calculation unit 20j, and the time wrapping and high frequency of the low frequency signal are calculated. The time wrapping information is encoded from at least one of the time wrappings of the signal and the time wrapping of the core decoded signal. The time envelope information includes low frequency time envelope information and high frequency time envelope information. Similar to the operation of the time envelope information coding unit 26a of the voice coding device 26 of the seventh embodiment, the method of encoding the low frequency time envelope information and the high frequency time envelope information is not limited. On the other hand, when the high frequency signal generation control information coding unit 270a determines that the high frequency signal is not generated, the time entrainment of the low frequency signal is calculated, and the core calculated by the subband signal power calculation unit 20j. The time wrapping of the core decoding signal is calculated using the power of the subband signal of the decoding signal, and the time wrapping information related to the low frequency signal is encoded from the time wrapping of the low frequency signal and the time wrapping of the core decoding signal (step). S270-3). If the high-frequency signal generation control information coding unit 270a determines that the high-frequency signal is not generated, the wraparound calculation unit 270d can calculate only the power of the sub-band signal of the low-frequency signal, and further. Can also send the subband signal of the low frequency signal to the time wrapping information coding unit 270b without calculating the power of the subband signal of the low frequency signal. When the power of the subband signal of the low frequency signal is not calculated, the power of the subband signal of the low frequency signal may be calculated by the time wrapping information coding unit 270b, and the power of the subband signal of the low frequency signal may be calculated. There is no limit to where the power is calculated.

符号化系列多重化部270cは、高周波数信号生成制御情報符号化部270aより符号化された高周波数信号生成制御情報を受け取り、コア符号化部20bより低周波数信号の符号化系列を受け取り、時間包絡情報符号化部20gより符号化された時間包絡情報を受け取り、高周波数信号生成制御情報符号化部270aにて高周波数信号を生成すると決定した場合は、制御パラメータ符号化部20dより符号化された制御パラメータをさらに受け取り、量子化/符号化部20fより符号化された高周波数信号に対するゲインおよびノイズ信号の大きさをさらに受け取り、これらを多重化して符号化系列として出力する（ステップS270-4）。 The coded sequence multiplexing unit 270c receives the high frequency signal generation control information encoded from the high frequency signal generation control information coding unit 270a, receives the coded sequence of the low frequency signal from the core coding unit 20b, and receives the time. When the time inclusion information encoded from the inclusion information coding unit 20g is received and the high frequency signal generation control information coding unit 270a determines to generate a high frequency signal, it is encoded by the control parameter coding unit 20d. It further receives the control parameters, and further receives the magnitude of the gain and noise signals for the high frequency signal encoded by the quantization / coding unit 20f, multiplexes them, and outputs them as a coding series (step S270-4). ).

［第18の実施形態の音声復号装置の第1の変形例］
図281は、第18の実施形態に係る音声復号装置の第1の変形例170Aの構成を示す図である。 [First Modified Example of Audio Decoding Device of 18th Embodiment]
FIG. 281 is a diagram showing a configuration of a first modification 170A of the audio decoding device according to the eighteenth embodiment.

図282は、第18の実施形態に係る音声復号装置の第1の変形例170Aの動作を示すフローチャートである。 FIG. 282 is a flowchart showing the operation of the first modification 170A of the audio decoding device according to the eighteenth embodiment.

本変形例と第18の実施形態に係る音声復号装置170との相違点は、低周波数時間包絡形状決定部10eC（10e、10eA、及び10eBでもよいことは明白）、時間包絡修正部13bにかえて、低周波数時間包絡形状決定部16b、時間包絡修正部16cを具備する点である。 The difference between this modification and the audio decoding device 170 according to the 18th embodiment is that the low frequency time envelope shape determination unit 10eC (clearly, 10e, 10eA, and 10eB may be used) and the time envelope correction unit 13b are replaced. Therefore, it is provided with a low frequency time envelope shape determining unit 16b and a time envelope correction unit 16c.

［第18の実施形態の音声復号装置の第2の変形例］
図283は、第18の実施形態に係る音声復号装置の第2の変形例170Bの構成を示す図である。 [Second variant of the audio decoding device of the 18th embodiment]
FIG. 283 is a diagram showing a configuration of a second modification 170B of the audio decoding device according to the eighteenth embodiment.

図284は、第18の実施形態に係る音声復号装置の第2の変形例170Bの動作を示すフローチャートである。 FIG. 284 is a flowchart showing the operation of the second modification 170B of the audio decoding device according to the eighteenth embodiment.

本変形例と第18の実施形態に係る音声復号装置170との相違点は、高周波数時間包絡形状決定部13aC（13a、13aA、及び13aBでもよいことは明白）、低周波数時間包絡修正部10fにかえて、高周波数時間包絡形状決定部16d、低周波数時間包絡修正部16eを具備する点である。 The difference between this modification and the audio decoding device 170 according to the eighteenth embodiment is the high frequency time envelope shape determination unit 13aC (it is clear that 13a, 13aA, and 13aB may be used), and the low frequency time envelope correction unit 10f. Instead, it is provided with a high frequency time envelope shape determining unit 16d and a low frequency time envelope correction unit 16e.

［第18の実施形態の音声復号装置の第3の変形例］
図285は、第18の実施形態に係る音声復号装置の第3の変形例170Cの構成を示す図である。 [Third variant of the audio decoding device of the eighteenth embodiment]
FIG. 285 is a diagram showing a configuration of a third modification 170C of the audio decoding device according to the eighteenth embodiment.

図286は、第18の実施形態に係る音声復号装置の第3の変形例170Cの動作を示すフローチャートである。 FIG. 286 is a flowchart showing the operation of the third modification 170C of the audio decoding device according to the eighteenth embodiment.

［第18の実施形態の音声復号装置の第4の変形例］
図287は、第18の実施形態に係る音声復号装置の第4の変形例170Dの構成を示す図である。 [Fourth variant of the audio decoding device of the eighteenth embodiment]
FIG. 287 is a diagram showing a configuration of a fourth modification 170D of the audio decoding device according to the eighteenth embodiment.

図288は、第18の実施形態に係る音声復号装置の第4の変形例170Dの動作を示すフローチャートである。 FIG. 288 is a flowchart showing the operation of the fourth modification 170D of the audio decoding device according to the eighteenth embodiment.

本変形例と前記第18の実施形態に係る音声復号装置170との相違点は、低周波数時間包絡形状決定部10e及び高周波数時間包絡形状決定部13aにかえて時間包絡形状決定部16fを具備する点である。 The difference between this modification and the voice decoding device 170 according to the eighteenth embodiment is that the time envelope shape determination unit 16f is provided instead of the low frequency time envelope shape determination unit 10e and the high frequency time envelope shape determination unit 13a. It is a point to do.

［第19の実施形態］
図117は、第19の実施形態に係る音声復号装置180の構成を示す図である。音声復号装置180の通信装置は、下記音声符号化装置280から出力される多重化された符号化系列を受信し、更に、復号した音声信号を外部に出力する。音声復号装置180は、図117に示すように、機能的には、符号化系列逆多重化部170a、スイッチ群170b、コア復号部10b、分析フィルタバンク部10c、符号化系列解析部13c、低周波数時間包絡形状決定部10e、低周波数時間包絡修正部10f、高周波数時間包絡形状決定部13a、高周波数信号生成部10g、時間包絡修正部14a、復号/逆量子化部10h、周波数包絡調整部10i、及び合成フィルタバンク部170cを備える。 [19th Embodiment]
FIG. 117 is a diagram showing a configuration of the audio decoding device 180 according to the nineteenth embodiment. The communication device of the voice decoding device 180 receives the multiplexed coding sequence output from the following voice coding device 280, and further outputs the decoded voice signal to the outside. As shown in FIG. 117, the voice decoding apparatus 180 functionally has a coded sequence demultiplexing section 170a, a switch group 170b, a core decoding section 10b, an analysis filter bank section 10c, a coded sequence analysis section 13c, and a low frequency. Frequency time envelope shape determination unit 10e, low frequency time envelope correction unit 10f, high frequency time envelope shape determination unit 13a, high frequency signal generation unit 10g, time envelope correction unit 14a, decoding / inverse quantization unit 10h, frequency envelope adjustment unit It is provided with 10i and a synthetic filter bank unit 170c.

図118は、第19の実施形態に係る音声復号装置の動作を示すフローチャートである。なお、ステップS170-2およびS170-3の処理を行う順番については、高周波数信号の時間包絡形状の決定および帯域拡張部分の復号・逆量子化の処理の前であればよく、図118のフローチャートの順番に制限されない。 FIG. 118 is a flowchart showing the operation of the voice decoding device according to the nineteenth embodiment. The order in which the processes of steps S170-2 and S170-3 are performed may be before the determination of the time envelope shape of the high frequency signal and the decoding / dequantization process of the band extension portion, and the flowchart of FIG. 118 The order of is not limited.

なお、本実施形態に係る音声復号装置180の低周波数時間包絡形状決定部10eに対して、本発明の第1の実施形態の音声復号装置の第1、第2、及び第3の変形例が適用できることは明白である。 It should be noted that the first, second, and third modifications of the voice decoding device of the first embodiment of the present invention are provided with respect to the low frequency time envelope shape determining unit 10e of the voice decoding device 180 according to the present invention. It is clear that it is applicable.

さらには、本実施形態に係る音声復号装置180の高周波数時間包絡形状決定部13aに対して、本発明の第4の実施形態の音声復号装置の第1、第2、及び第3の変形例、本発明第5の実施形態の音声復号装置の第1の変形例、及び本発明第7の実施形態の音声復号装置の第1の変形例が適用できることは明白である。 Further, the first, second, and third modifications of the voice decoding device according to the fourth embodiment of the present invention are provided with respect to the high frequency time wrapping shape determining unit 13a of the voice decoding device 180 according to the present invention. It is clear that the first modification of the voice decoding device according to the fifth embodiment of the present invention and the first modification of the voice decoding device according to the seventh embodiment of the present invention can be applied.

図119は、第19の実施形態に係る音声符号化装置280の構成を示す図である。音声符号化装置280の通信装置は、符号化の対象となる音声信号を外部から受信し、更に、符号化された符号化系列を外部に出力する。音声符号化装置280は、図119に示すように、機能的には、高周波数信号生成制御情報符号化部270a、ダウンサンプリング部20a、コア符号化部20b、分析フィルタバンク部20c及び20c1、制御パラメータ符号化部20d、包絡算出部270d、量子化/符号化部20f、コア復号信号生成部20i、サブバンド信号パワー算出部20j及び24b、擬似高周波数信号生成部24a、時間包絡情報符号化部280a、及び符号化系列多重化部270cを備える。 FIG. 119 is a diagram showing a configuration of the voice coding device 280 according to the nineteenth embodiment. The communication device of the voice coding device 280 receives the voice signal to be coded from the outside, and further outputs the coded coding sequence to the outside. As shown in FIG. 119, the voice coding device 280 functionally includes a high frequency signal generation control information coding unit 270a, a downsampling unit 20a, a core coding unit 20b, an analysis filter bank unit 20c and 20c1, and control. Parameter coding unit 20d, inclusion calculation unit 270d, quantization / coding unit 20f, core decoding signal generation unit 20i, subband signal power calculation units 20j and 24b, pseudo high frequency signal generation unit 24a, time inclusion information coding unit It includes a 280a and a coded sequence multiplexing unit 270c.

図120は、第19の実施形態に係る音声符号化装置280の動作を示すフローチャートである。 FIG. 120 is a flowchart showing the operation of the voice coding device 280 according to the nineteenth embodiment.

高周波数信号生成制御情報符号化部270aにて高周波数信号を生成すると決定した場合は、帯域拡張にて高周波数信号を生成するのに必要な情報を算出・符号化し、さらに擬似高周波数信号を生成し当該擬似高周波数信号の時間包絡を算出する。一方、高周波数信号生成制御情報符号化部270aにて高周波数信号を生成しないと判断した場合、前記帯域拡張にて高周波数信号を生成するのに必要な情報を算出・符号化、及び前記擬似高周波数信号の生成・時間包絡の算出は実施されない(ステップS280-1)。 When the high frequency signal generation control information coding unit 270a decides to generate a high frequency signal, the information required to generate the high frequency signal by band expansion is calculated and encoded, and the pseudo high frequency signal is further generated. Generate and calculate the time wrapping of the pseudo high frequency signal. On the other hand, when the high frequency signal generation control information coding unit 270a determines that the high frequency signal is not generated, the information necessary for generating the high frequency signal by the band expansion is calculated and encoded, and the pseudo. High frequency signal generation and time wrapping are not calculated (step S280-1).

時間包絡情報符号化部280aは、高周波数信号生成制御情報符号化部270aにて高周波数信号を生成すると決定した場合は、入力音声信号の低周波数信号の時間包絡、高周波数信号の時間包絡、コア復号信号の時間包絡、擬似高周波数信号の時間包絡のうち少なくとも一つ以上を算出し、算出された時間包絡より時間包絡情報を符号化する。当該時間包絡情報は、低周波数時間包絡情報と高周波数時間包絡情報を含む。第7の実施形態の音声符号化装置26の時間包絡情報符号化部26aの動作と同様に、当該低周波数時間包絡情報と高周波数時間包絡情報の符号化の方法は限定されない。一方、高周波数信号生成制御情報符号化部270aにて高周波数信号を生成しないと決定した場合は、入力音声信号の低周波数信号の時間包絡、コア復号信号の時間包絡のうち少なくとも一つ以上を算出し、算出された時間包絡より、低周波数信号に関する時間包絡情報を符号化する（ステップS280-2）。 When the time wrapping information coding unit 280a determines that the high frequency signal generation control information coding unit 270a generates a high frequency signal, the time wrapping of the low frequency signal of the input audio signal, the time wrapping of the high frequency signal, At least one or more of the time wrapping of the core decoded signal and the time wrapping of the pseudo high frequency signal is calculated, and the time wrapping information is encoded from the calculated time wrapping. The time envelope information includes low frequency time envelope information and high frequency time envelope information. Similar to the operation of the time envelope information coding unit 26a of the voice coding device 26 of the seventh embodiment, the method of encoding the low frequency time envelope information and the high frequency time envelope information is not limited. On the other hand, when the high frequency signal generation control information coding unit 270a determines that the high frequency signal is not generated, at least one of the time wrapping of the low frequency signal of the input voice signal and the time wrapping of the core decoding signal is performed. From the calculated time wrapping, the time wrapping information related to the low frequency signal is encoded (step S280-2).

なお、本実施形態に係る音声符号化装置280に対して、本発明の第7の実施形態の音声符号化装置の第1の変形例が適用できることは明白である。 It is clear that the first modification of the voice coding device of the seventh embodiment of the present invention can be applied to the voice coding device 280 according to the present embodiment.

［第19の実施形態の音声復号装置の第1の変形例］
図289は、第19の実施形態に係る音声復号装置の第1の変形例180Aの構成を示す図である。 [First Modified Example of Audio Decoding Device of 19th Embodiment]
FIG. 289 is a diagram showing a configuration of a first modification 180A of the audio decoding device according to the nineteenth embodiment.

図290は、第19の実施形態に係る音声復号装置の第1の変形例180Aの動作を示すフローチャートである。 FIG. 290 is a flowchart showing the operation of the first modification 180A of the audio decoding device according to the nineteenth embodiment.

本変形例と第19の実施形態に係る音声復号装置180との相違点は、低周波数時間包絡形状決定部10eC（10e、10eA、及び10eBでもよいことは明白）、時間包絡修正部14aにかえて、低周波数時間包絡形状決定部16b、時間包絡修正部17aを具備する点である。 The difference between this modification and the audio decoding device 180 according to the 19th embodiment is that the low frequency time envelope shape determination unit 10eC (clearly, 10e, 10eA, and 10eB may be used) and the time envelope correction unit 14a are replaced. Therefore, it is provided with a low frequency time envelope shape determining unit 16b and a time envelope correction unit 17a.

［第19の実施形態の音声復号装置の第2の変形例］
図291は、第19の実施形態に係る音声復号装置の第2の変形例180Bの構成を示す図である。 [Second variant of the audio decoding device of the 19th embodiment]
FIG. 291 is a diagram showing a configuration of a second modification 180B of the audio decoding device according to the nineteenth embodiment.

図292は、第19の実施形態に係る音声復号装置の第2の変形例180Bの動作を示すフローチャートである。 FIG. 292 is a flowchart showing the operation of the second modification 180B of the audio decoding device according to the nineteenth embodiment.

本変形例と第19の実施形態に係る音声復号装置180との相違点は、高周波数時間包絡形状決定部13aC（13a、13aA、及び13aBでもよいことは明白）、低周波数時間包絡修正部10fにかえて、高周波数時間包絡形状決定部16d、低周波数時間包絡修正部16eを具備する点である。 The difference between this modification and the audio decoding device 180 according to the 19th embodiment is the high frequency time envelope shape determining unit 13aC (it is clear that 13a, 13aA, and 13aB may be used), and the low frequency time envelope correction unit 10f. Instead, it is provided with a high frequency time envelope shape determining unit 16d and a low frequency time envelope correction unit 16e.

［第19の実施形態の音声復号装置の第3の変形例］
図293は、第19の実施形態に係る音声復号装置の第3の変形例180Cの構成を示す図である。 [Third variant of the audio decoding device of the nineteenth embodiment]
FIG. 293 is a diagram showing a configuration of a third modification 180C of the audio decoding device according to the nineteenth embodiment.

図294は、第19の実施形態に係る音声復号装置の第3の変形例180Cの動作を示すフローチャートである。 FIG. 294 is a flowchart showing the operation of the third modification 180C of the audio decoding device according to the nineteenth embodiment.

［第19の実施形態の音声復号装置の第4の変形例］
図295は、第19の実施形態に係る音声復号装置の第4の変形例180Dの構成を示す図である。 [Fourth variant of the audio decoding device of the nineteenth embodiment]
FIG. 295 is a diagram showing a configuration of a fourth modification 180D of the audio decoding device according to the nineteenth embodiment.

図296は、第19の実施形態に係る音声復号装置の第4の変形例180Dの動作を示すフローチャートである。 FIG. 296 is a flowchart showing the operation of the fourth modification 180D of the audio decoding device according to the nineteenth embodiment.

本変形例と前記第19の実施形態に係る音声復号装置180との相違点は、低周波数時間包絡形状決定部10e及び高周波数時間包絡形状決定部13aにかえて時間包絡形状決定部16fを具備する点である。 The difference between this modification and the voice decoding device 180 according to the nineteenth embodiment is that the low frequency time envelope shape determination unit 10e and the high frequency time envelope shape determination unit 13a are replaced with the time envelope shape determination unit 16f. It is a point to do.

［第20の実施形態］
図121は、第20の実施形態に係る音声復号装置190の構成を示す図である。音声復号装置190の通信装置は、下記音声符号化装置290から出力される多重化された符号化系列を受信し、更に、復号した音声信号を外部に出力する。音声復号装置190は、図121に示すように、機能的には、符号化系列逆多重化部170a、スイッチ群170b、コア復号部10b、分析フィルタバンク部10c、符号化系列解析部13c、低周波数時間包絡形状決定部10e、低周波数時間包絡修正部10f、高周波数時間包絡形状決定部13a、高周波数信号生成部10g、復号/逆量子化部10h、周波数包絡調整部10i、時間包絡修正部15a、及び合成フィルタバンク部170cを備える。 [20th Embodiment]
FIG. 121 is a diagram showing a configuration of an audio decoding device 190 according to a twentieth embodiment. The communication device of the voice decoding device 190 receives the multiplexed coding sequence output from the following voice coding device 290, and further outputs the decoded voice signal to the outside. As shown in FIG. 121, the voice decoding device 190 functionally has a coded sequence demultiplexing unit 170a, a switch group 170b, a core decoding unit 10b, an analysis filter bank unit 10c, a coded sequence analysis unit 13c, and a low frequency. Frequency time envelope shape determination unit 10e, low frequency time envelope correction unit 10f, high frequency time envelope shape determination unit 13a, high frequency signal generation unit 10g, decoding / dequantization unit 10h, frequency envelope adjustment unit 10i, time envelope correction unit It includes 15a and a synthetic filter bank unit 170c.

図122は、第20の実施形態に係る音声復号装置の動作を示すフローチャートである。なお、ステップS170-2およびS170-3の処理を行う順番については、高周波数信号の時間包絡形状の決定および帯域拡張部分の復号・逆量子化の処理の前であればよく、図122のフローチャートの順番に制限されない。 FIG. 122 is a flowchart showing the operation of the voice decoding device according to the twentieth embodiment. The order in which the processes of steps S170-2 and S170-3 are performed may be before the determination of the time envelope shape of the high frequency signal and the decoding / dequantization process of the band extension portion, and the flowchart of FIG. 122 shows. The order of is not limited.

なお、本実施形態に係る音声復号装置190の低周波数時間包絡形状決定部10eに対して、本発明の第1の実施形態の音声復号装置の第1、第2、及び第3の変形例が適用できることは明白である。 It should be noted that the first, second, and third modifications of the voice decoding device according to the first embodiment of the present invention are provided with respect to the low frequency time envelope shape determining unit 10e of the voice decoding device 190 according to the present invention. It is clear that it is applicable.

さらには、本実施形態に係る音声復号装置190の高周波数時間包絡形状決定部13aに対して、本発明の第4の実施形態の音声復号装置の第1、第2、及び第3の変形例、本発明第5の実施形態の音声復号装置の第1の変形例、及び本発明第7の実施形態の音声復号装置の第1の変形例が適用できることは明白である。 Further, the first, second, and third modifications of the voice decoding device according to the fourth embodiment of the present invention are provided with respect to the high frequency time wrapping shape determining unit 13a of the voice decoding device 190 according to the present invention. It is clear that the first modification of the voice decoding device according to the fifth embodiment of the present invention and the first modification of the voice decoding device according to the seventh embodiment of the present invention can be applied.

図123は、第20の実施形態に係る音声符号化装置290の構成を示す図である。音声符号化装置290の通信装置は、符号化の対象となる音声信号を外部から受信し、更に、符号化された符号化系列を外部に出力する。音声符号化装置290は、図123に示すように、機能的には、高周波数信号生成制御情報符号化部270a、ダウンサンプリング部20a、コア符号化部20b、分析フィルタバンク部20c及び20c1、制御パラメータ符号化部20d、包絡算出部270d、量子化/符号化部20f、コア復号信号生成部20i、サブバンド信号パワー算出部20j及び24b、擬似高周波数信号生成部24a、時間包絡情報符号化部280a、及び符号化系列多重化部270cを備える。 FIG. 123 is a diagram showing the configuration of the voice coding device 290 according to the twentieth embodiment. The communication device of the voice coding device 290 receives the voice signal to be coded from the outside, and further outputs the coded coding sequence to the outside. As shown in FIG. 123, the voice coding device 290 functionally includes a high frequency signal generation control information coding unit 270a, a downsampling unit 20a, a core coding unit 20b, an analysis filter bank unit 20c and 20c1, and control. Parameter coding unit 20d, inclusion calculation unit 270d, quantization / coding unit 20f, core decoding signal generation unit 20i, subband signal power calculation units 20j and 24b, pseudo high frequency signal generation unit 24a, time inclusion information coding unit It includes a 280a and a coded sequence multiplexing unit 270c.

図124は、第20の実施形態に係る音声符号化装置290の動作を示すフローチャートである。 FIG. 124 is a flowchart showing the operation of the voice coding device 290 according to the twentieth embodiment.

時間包絡情報符号化部290aは、高周波数信号生成制御情報符号化部270aにて高周波数信号を生成すると決定した場合は、入力音声信号の低周波数信号の時間包絡、高周波数信号の時間包絡、コア復号信号の時間包絡、周波数包絡調整された擬似高周波数信号の時間包絡のうち少なくとも一つ以上を算出し、算出された時間包絡より時間包絡情報を符号化する。当該時間包絡情報は、低周波数時間包絡情報と高周波数時間包絡情報を含む。第7の実施形態の音声符号化装置26の時間包絡情報符号化部26aの動作と同様に、当該低周波数時間包絡情報と高周波数時間包絡情報の符号化の方法は限定されない。
一方、高周波数信号生成制御情報符号化部270aにて高周波数信号を生成しないと決定した場合は、入力音声信号の低周波数信号の時間包絡、コア復号信号の時間包絡のうち少なくとも一つ以上を算出し、算出された時間包絡より、低周波数信号に関する時間包絡情報を符号化する（ステップS290-1）。 When the time wrapping information coding unit 290a determines that the high frequency signal generation control information coding unit 270a generates a high frequency signal, the time wrapping of the low frequency signal of the input audio signal, the time wrapping of the high frequency signal, At least one or more of the time wrapping of the core decoded signal and the time wrapping of the frequency wrapping adjusted pseudo high frequency signal is calculated, and the time wrapping information is encoded from the calculated time wrapping. The time envelope information includes low frequency time envelope information and high frequency time envelope information. Similar to the operation of the time envelope information coding unit 26a of the voice coding device 26 of the seventh embodiment, the method of encoding the low frequency time envelope information and the high frequency time envelope information is not limited.
On the other hand, when the high frequency signal generation control information coding unit 270a determines that the high frequency signal is not generated, at least one of the time wrapping of the low frequency signal of the input voice signal and the time wrapping of the core decoding signal is performed. From the calculated time wrapping, the time wrapping information related to the low frequency signal is encoded (step S290-1).

なお、本実施形態に係る音声符号化装置290に対して、本発明の第7の実施形態の音声符号化装置の第1の変形例が適用できることは明白である。 It is clear that the first modification of the voice coding device of the seventh embodiment of the present invention can be applied to the voice coding device 290 according to the present embodiment.

［第20の実施形態の音声復号装置の第1の変形例］
図297は、第20の実施形態に係る音声復号装置の第1の変形例190Aの構成を示す図である。 [First Modified Example of Audio Decoding Device of 20th Embodiment]
FIG. 297 is a diagram showing a configuration of a first modification 190A of the audio decoding device according to the twentieth embodiment.

図298は、第20の実施形態に係る音声復号装置の第1の変形例190Aの動作を示すフローチャートである。 FIG. 298 is a flowchart showing the operation of the first modification 190A of the audio decoding device according to the twentieth embodiment.

本変形例と前記第20の実施形態に係る音声復号装置190との相違点は、時間包絡修正部13aにかえて時間包絡修正部15aAを具備する点である。 The difference between this modification and the voice decoding device 190 according to the twentieth embodiment is that the time envelope correction unit 15aA is provided instead of the time envelope correction unit 13a.

［第20の実施形態の音声復号装置の第2の変形例］
図299は、第20の実施形態に係る音声復号装置の第2の変形例190Bの構成を示す図である。 [Second variant of the audio decoding device of the 20th embodiment]
FIG. 299 is a diagram showing a configuration of a second modification 190B of the audio decoding device according to the twentieth embodiment.

図300は、第20の実施形態に係る音声復号装置の第2の変形例190Bの動作を示すフローチャートである。 FIG. 300 is a flowchart showing the operation of the second modification 190B of the audio decoding device according to the twentieth embodiment.

本変形例と第20の実施形態に係る音声復号装置190との相違点は、低周波数時間包絡形状決定部10eC（10e、10eA、及び10eBでもよいことは明白）、時間包絡修正部15aにかえて、低周波数時間包絡形状決定部16b、時間包絡修正部18aを具備する点である。 The difference between this modification and the audio decoding device 190 according to the twentieth embodiment is that the low frequency time envelope shape determination unit 10eC (clearly, 10e, 10eA, and 10eB may be used) and the time envelope correction unit 15a are replaced. Therefore, it is provided with a low frequency time envelope shape determining unit 16b and a time envelope correction unit 18a.

［第20の実施形態の音声復号装置の第3の変形例］
図301は、第20の実施形態に係る音声復号装置の第3の変形例190Cの構成を示す図である。 [Third variant of the audio decoding device of the twentieth embodiment]
FIG. 301 is a diagram showing a configuration of a third modification 190C of the audio decoding device according to the twentieth embodiment.

図302は、第20の実施形態に係る音声復号装置の第3の変形例190Cの動作を示すフローチャートである。 FIG. 302 is a flowchart showing the operation of the third modification 190C of the audio decoding device according to the twentieth embodiment.

本変形例と第20の実施形態に係る音声復号装置190との相違点は、高周波数時間包絡形状決定部13aC（13a、13aA、及び13aBでもよいことは明白）、低周波数時間包絡修正部10fにかえて、高周波数時間包絡形状決定部16d、低周波数時間包絡修正部16eを具備する点である。 The difference between this modification and the audio decoding device 190 according to the twentieth embodiment is the high frequency time envelope shape determining unit 13aC (it is clear that 13a, 13aA, and 13aB may be used), and the low frequency time envelope correction unit 10f. Instead, it is provided with a high frequency time envelope shape determining unit 16d and a low frequency time envelope correction unit 16e.

［第20の実施形態の音声復号装置の第4の変形例］
図303は、第20の実施形態に係る音声復号装置の第4の変形例190Dの構成を示す図である。 [Fourth variant of the audio decoding device of the twentieth embodiment]
FIG. 303 is a diagram showing a configuration of a fourth modification 190D of the audio decoding device according to the twentieth embodiment.

図304は、第20の実施形態に係る音声復号装置の第4の変形例190Dの動作を示すフローチャートである。 FIG. 304 is a flowchart showing the operation of the fourth modification 190D of the audio decoding device according to the twentieth embodiment.

［第20の実施形態の音声復号装置の第5の変形例］
図305は、第20の実施形態に係る音声復号装置の第5の変形例190Eの構成を示す図である。 [Fifth variant of the audio decoding device of the twentieth embodiment]
FIG. 305 is a diagram showing a configuration of a fifth modification 190E of the audio decoding device according to the twentieth embodiment.

図306は、第20の実施形態に係る音声復号装置の第5の変形例190Eの動作を示すフローチャートである。 FIG. 306 is a flowchart showing the operation of the fifth modification 190E of the audio decoding device according to the twentieth embodiment.

本変形例と前記第20の実施形態に係る音声復号装置190との相違点は、低周波数時間包絡形状決定部10e及び高周波数時間包絡形状決定部13aにかえて時間包絡形状決定部16fを具備する点である。 The difference between this modification and the voice decoding device 190 according to the twentieth embodiment is that the time envelope shape determination unit 16f is provided instead of the low frequency time envelope shape determination unit 10e and the high frequency time envelope shape determination unit 13a. It is a point to do.

［第20の実施形態の音声復号装置の第6の変形例］
図307は、第20の実施形態に係る音声復号装置の第6の変形例190Fの構成を示す図である。 [Sixth variant of the audio decoding device of the twentieth embodiment]
FIG. 307 is a diagram showing a configuration of a sixth modification 190F of the audio decoding device according to the twentieth embodiment.

図308は、第20の実施形態に係る音声復号装置の第6の変形例190Fの動作を示すフローチャートである。 FIG. 308 is a flowchart showing the operation of the sixth modification 190F of the audio decoding device according to the twentieth embodiment.

本変形例と第20の実施形態の第1の変形例に係る音声復号装置190Aとの相違点は、低周波数時間包絡形状決定部10eC（10e、10eA、及び10eBでもよいことは明白）、時間包絡修正部15aAにかえて、低周波数時間包絡形状決定部16b、時間包絡修正部18aAを具備する点である。 The difference between this modification and the voice decoding device 190A according to the first modification of the twentieth embodiment is the low frequency time envelope shape determination unit 10eC (clearly, 10e, 10eA, and 10eB may be used), time. Instead of the envelope correction unit 15aA, the low frequency time envelope shape determination unit 16b and the time envelope correction unit 18aA are provided.

［第20の実施形態の音声復号装置の第7の変形例］
図309は、第20の実施形態に係る音声復号装置の第7の変形例190Gの構成を示す図である。 [7th variant of the audio decoding device of the 20th embodiment]
FIG. 309 is a diagram showing a configuration of a seventh modification 190G of the audio decoding device according to the twentieth embodiment.

図310は、第20の実施形態に係る音声復号装置の第7の変形例190Gの動作を示すフローチャートである。 FIG. 310 is a flowchart showing the operation of the seventh modification 190G of the voice decoding device according to the twentieth embodiment.

本変形例と第20の実施形態の第1の変形例に係る音声復号装置190Aとの相違点は、高周波数時間包絡形状決定部13aC（13a、13aA、及び13aBでもよいことは明白）、低周波数時間包絡修正部10fにかえて、高周波数時間包絡形状決定部16d、低周波数時間包絡修正部16eを具備する点である。 The difference between this modification and the voice decoding device 190A according to the first modification of the twentieth embodiment is that the high frequency time envelope shape determining unit 13aC (it is clear that 13a, 13aA, and 13aB may be used) and low. Instead of the frequency time envelope correction unit 10f, the high frequency time envelope shape determination unit 16d and the low frequency time envelope correction unit 16e are provided.

［第20の実施形態の音声復号装置の第8の変形例］
図311は、第20の実施形態に係る音声復号装置の第8の変形例190Hの構成を示す図である。 [Eighth variant of the audio decoding device of the twentieth embodiment]
FIG. 311 is a diagram showing a configuration of an eighth modification 190H of the audio decoding device according to the twentieth embodiment.

図312は、第20の実施形態に係る音声復号装置の第8の変形例190Hの動作を示すフローチャートである。 FIG. 312 is a flowchart showing the operation of the eighth modification 190H of the audio decoding device according to the twentieth embodiment.

［第20の実施形態の音声復号装置の第9の変形例］
図313は、第20の実施形態に係る音声復号装置の第9の変形例190Iの構成を示す図である。 [Ninth variant of the audio decoding device of the twentieth embodiment]
FIG. 313 is a diagram showing a configuration of a ninth modification 190I of the audio decoding device according to the twentieth embodiment.

図314は、第20の実施形態に係る音声復号装置の第9の変形例190Iの動作を示すフローチャートである。 FIG. 314 is a flowchart showing the operation of the ninth modification 190I of the audio decoding device according to the twentieth embodiment.

本変形例と前記第20の実施形態の第1の変形例に係る音声復号装置190Aとの相違点は、低周波数時間包絡形状決定部10e及び高周波数時間包絡形状決定部13aにかえて時間包絡形状決定部16fを具備する点である。 The difference between this modification and the voice decoding device 190A according to the first modification of the 20th embodiment is that the time envelopment is replaced with the low frequency time envelope shape determination unit 10e and the high frequency time envelope shape determination unit 13a. The point is that the shape determining portion 16f is provided.

［第21の実施形態］
図125は、第21の実施形態に係る音声復号装置300の構成を示す図である。音声復号装置300の通信装置は、下記音声符号化装置400から出力される多重化された符号化系列を受信し、更に、復号した音声信号を外部に出力する。音声復号装置300は、図125に示すように、機能的には、符号化系列逆多重化部10a、コア復号部10b、分析フィルタバンク部10c、符号化系列解析部13c、低周波数時間包絡形状決定部10e、低周波数時間包絡修正部10f、高周波数時間包絡形状決定部13a、時間包絡修正部300a、高周波数信号生成部10g、復号/逆量子化部10h、周波数包絡調整部10i、及び合成フィルタバンク部10jを備える。 [21st Embodiment]
FIG. 125 is a diagram showing a configuration of the audio decoding device 300 according to the 21st embodiment. The communication device of the voice decoding device 300 receives the multiplexed coding sequence output from the following voice coding device 400, and further outputs the decoded voice signal to the outside. As shown in FIG. 125, the voice decoding apparatus 300 functionally has a coded sequence demultiplexing section 10a, a core decoding section 10b, an analysis filter bank section 10c, a coded sequence analysis section 13c, and a low frequency time wrapping shape. Determination unit 10e, low frequency time wrapping correction unit 10f, high frequency time wrapping shape determination unit 13a, time wrapping correction unit 300a, high frequency signal generation unit 10g, decoding / inverse quantization unit 10h, frequency wrapping adjustment unit 10i, and synthesis. It has a filter bank section 10j.

図126は、第21の実施形態に係る音声復号装置の動作を示すフローチャートである。 FIG. 126 is a flowchart showing the operation of the voice decoding device according to the 21st embodiment.

時間包絡修正部300aは、高周波数時間包絡形状決定部13aで決定した時間包絡形状に基づいて、低周波数時間包絡修正部10fから出力され、高周波数信号生成部10gにて高周波数信号の生成に利用する時間包絡形状を修正された低周波数信号の複数のサブバンド信号の時間包絡の形状を修正する（ステップS300-1）。時間包絡修正部13bとの相違点は、入力される信号が分析フィルタバンク部10cから出力される低周波数信号の複数のサブバンド信号に代わって、低周波数時間包絡修正部10fから出力される時間包絡形状を修正された低周波数信号の複数のサブバンド信号である点である。時間包絡修正部13bにおける時間包絡の修正処理において、分析フィルタバンク部10cから出力される低周波数信号の複数のサブバンド信号を、低周波数時間包絡修正部10fから出力される時間包絡形状を修正された低周波数信号の複数のサブバンド信号にかえることにより実現できる。 The time envelope correction unit 300a is output from the low frequency time envelope correction unit 10f based on the time envelope shape determined by the high frequency time envelope shape determination unit 13a, and the high frequency signal generation unit 10g generates a high frequency signal. Corrected the time-envelope shape to be used Correct the time-envelope shape of multiple subband signals of the low-frequency signal (step S300-1). The difference from the time wrapping correction unit 13b is the time when the input signal is output from the low frequency time wrapping correction unit 10f instead of the plurality of subband signals of the low frequency signal output from the analysis filter bank unit 10c. The point is that it is a plurality of subband signals of a low frequency signal whose envelopment shape has been modified. In the time envelope correction process in the time envelope correction unit 13b, the time envelope shape output from the low frequency time envelope correction unit 10f is corrected for a plurality of subband signals of the low frequency signal output from the analysis filter bank unit 10c. This can be achieved by replacing the low frequency signal with a plurality of subband signals.

なお、本実施形態に係る音声復号装置300の低周波数時間包絡形状決定部10eに対して、本発明の第1の実施形態の音声復号装置の第1、第2、及び第3の変形例が適用できることは明白である。 It should be noted that the first, second, and third modifications of the voice decoding device of the first embodiment of the present invention are provided with respect to the low frequency time envelope shape determining unit 10e of the voice decoding device 300 according to the present invention. It is clear that it is applicable.

さらには、本実施形態に係る音声復号装置300の高周波数時間包絡形状決定部13aに対して、本発明の第4の実施形態の音声復号装置の第1、第2、及び第3の変形例、及び本発明第7の実施形態の音声復号装置の第1の変形例が適用できることは明白である。 Further, the first, second, and third modifications of the voice decoding device according to the fourth embodiment of the present invention are made with respect to the high frequency time-enclosed shape determining unit 13a of the voice decoding device 300 according to the present invention. , And the first modification of the voice decoding apparatus of the seventh embodiment of the present invention is clearly applicable.

図127は、第21の実施形態に係る音声符号化装置400の構成を示す図である。音声符号化装置400の通信装置は、符号化の対象となる音声信号を外部から受信し、更に、符号化された符号化系列を外部に出力する。音声符号化装置400は、図127に示すように、機能的には、ダウンサンプリング部20a、コア符号化部20b、分析フィルタバンク部20c及び20c1、制御パラメータ符号化部20d、包絡算出部20e、量子化/符号化部20f、コア復号信号生成部20i、サブバンド信号パワー算出部20j、時間包絡情報符号化部400a、及び符号化系列多重化部20hを備える。 FIG. 127 is a diagram showing a configuration of the voice coding device 400 according to the 21st embodiment. The communication device of the voice coding device 400 receives the voice signal to be coded from the outside, and further outputs the coded coding sequence to the outside. As shown in FIG. 127, the voice coding device 400 functionally includes a downsampling unit 20a, a core coding unit 20b, an analysis filter bank unit 20c and 20c1, a control parameter coding unit 20d, and an entrapment calculation unit 20e. It includes a quantization / coding unit 20f, a core decoding signal generation unit 20i, a subband signal power calculation unit 20j, a time wrapping information coding unit 400a, and a coding sequence multiplexing unit 20h.

図128は、第21の実施形態に係る音声符号化装置400の動作を示すフローチャートである。 FIG. 128 is a flowchart showing the operation of the voice coding device 400 according to the 21st embodiment.

時間包絡情報符号化部400aは、低周波数信号の時間包絡と高周波数信号の時間包絡のうち少なくとも一つ以上を算出し、さらにサブバンド信号パワー算出部20jにて算出されたコア復号信号のサブバンド信号のパワーを用いてコア復号信号の時間包絡を算出し、当該低周波数信号の時間包絡及び高周波数信号の時間包絡のうち少なくとも一つ以上とコア復号信号の時間包絡より時間包絡情報を符号化する（ステップS400-1）。当該時間包絡情報は、低周波数時間包絡情報と高周波数時間包絡情報を含む。第7の実施形態の音声符号化装置26の時間包絡情報符号化部26aの動作と同様に、当該低周波数時間包絡情報と高周波数時間包絡情報の符号化の方法は限定されない。時間包絡情報符号化部26aとの相違点は、高周波数信号に関する時間包絡情報を算出する場合には、コア復号信号の時間包絡と低周波数信号に関する時間包絡情報のうち少なくとも一つ以上を用いて、時間包絡形状を修正されたコア復号信号の時間包絡を用いることができる点である。なお、高周波数信号の時間包絡情報は、低周波数信号の時間包絡情報を元に生成可能である。 The time wrapping information coding unit 400a calculates at least one of the time wrapping of the low frequency signal and the time wrapping of the high frequency signal, and further sub-band signal power calculation unit 20j of the core decoded signal. The time wrapping of the core decoding signal is calculated using the power of the band signal, and the time wrapping information is coded from at least one of the time wrapping of the low frequency signal and the time wrapping of the high frequency signal and the time wrapping of the core decoding signal. (Step S400-1). The time envelope information includes low frequency time envelope information and high frequency time envelope information. Similar to the operation of the time envelope information coding unit 26a of the voice coding device 26 of the seventh embodiment, the method of encoding the low frequency time envelope information and the high frequency time envelope information is not limited. The difference from the time envelope information coding unit 26a is that when calculating the time envelope information related to the high frequency signal, at least one of the time envelope information related to the core decoded signal and the time envelope information related to the low frequency signal is used. The point is that the time envelope of the core decoded signal whose time envelope shape is modified can be used. The time envelope information of the high frequency signal can be generated based on the time envelope information of the low frequency signal.

［第21の実施形態の音声復号装置の第1の変形例］
図315は、第21の実施形態に係る音声復号装置の第1の変形例300Aの構成を示す図である。 [First variant of the audio decoding device of the 21st embodiment]
FIG. 315 is a diagram showing a configuration of a first modification 300A of the audio decoding device according to the 21st embodiment.

図316は、第21の実施形態に係る音声復号装置の第1の変形例300Aの動作を示すフローチャートである。 FIG. 316 is a flowchart showing the operation of the first modification 300A of the audio decoding device according to the 21st embodiment.

本変形例と第21の実施形態に係る音声復号装置300との相違点は、低周波数時間包絡形状決定部10eC（10e、10eA、及び10eBでもよいことは明白）、時間包絡修正部300aにかえて、低周波数時間包絡形状決定部16b、時間包絡修正部300aAを具備する点である。 The difference between this modification and the audio decoding device 300 according to the 21st embodiment is that the low frequency time envelope shape determination unit 10eC (clearly, 10e, 10eA, and 10eB may be used) and the time envelope correction unit 300a are replaced. Therefore, it is provided with a low frequency time envelope shape determination unit 16b and a time envelope correction unit 300aA.

本変形例においては、時間包絡修正部300aAと前記時間包絡修正部300aとの相違点は、高周波数時間包絡形状決定部13aC（13a、13aA、13aBでもよいことは明白）から受け取る時間包絡形状と低周波数時間包絡形状決定部16bから受け取る時間包絡形状のうち少なくとも一つ以上に基づいて、低周波数時間包絡修正部10fから出力され、高周波数信号生成部10gにて高周波数信号の生成に利用する時間包絡形状を修正された低周波数信号の複数のサブバンド信号の時間包絡の形状を修正する点である(S300-1a)。 In this modification, the difference between the time wrapping correction unit 300aA and the time wrapping correction unit 300a is the time wrapping shape received from the high frequency time wrapping shape determining unit 13aC (it is clear that 13a, 13aA, 13aB may be used). Based on at least one of the time wrapping shapes received from the low frequency time wrapping shape determination unit 16b, it is output from the low frequency time wrapping correction unit 10f and used by the high frequency signal generation unit 10g to generate a high frequency signal. Corrected the time-wrapping shape The point is to correct the time-wrapping shape of multiple subband signals of the low frequency signal (S300-1a).

［第21の実施形態の音声復号装置の第2の変形例］
図317は、第21の実施形態に係る音声復号装置の第2の変形例300Bの構成を示す図である。 [Second variant of the audio decoding device of the 21st embodiment]
FIG. 317 is a diagram showing a configuration of a second modification 300B of the audio decoding device according to the 21st embodiment.

図318は、第21の実施形態に係る音声復号装置の第2の変形例300Bの動作を示すフローチャートである。 FIG. 318 is a flowchart showing the operation of the second modification 300B of the audio decoding device according to the 21st embodiment.

本変形例と第21の実施形態に係る音声復号装置300との相違点は、高周波数時間包絡形状決定部13aC（13a、13aA、及び13aBでもよいことは明白）、低周波数時間包絡修正部10fにかえて、高周波数時間包絡形状決定部16d、低周波数時間包絡修正部16eを具備する点である。 The difference between this modification and the audio decoding device 300 according to the 21st embodiment is the high frequency time envelope shape determining unit 13aC (it is clear that 13a, 13aA, and 13aB may be used), and the low frequency time envelope correction unit 10f. Instead, it is provided with a high frequency time envelope shape determining unit 16d and a low frequency time envelope correction unit 16e.

［第21の実施形態の音声復号装置の第3の変形例］
図319は、第21の実施形態に係る音声復号装置の第3の変形例300Cの構成を示す図である。 [Third variant of the audio decoding device of the 21st embodiment]
FIG. 319 is a diagram showing a configuration of a third modification 300C of the audio decoding device according to the 21st embodiment.

図320は、第21の実施形態に係る音声復号装置の第3の変形例300Cの動作を示すフローチャートである。 FIG. 320 is a flowchart showing the operation of the third modification 300C of the audio decoding device according to the 21st embodiment.

本変形例においては、前記低周波数時間包絡形状決定部16b、前記時間包絡修正部300aA、前記高周波数時間包絡形状決定部16d、及び前記低周波数時間包絡修正部16eを具備する。 In this modification, the low frequency time envelope shape determination unit 16b, the time envelope correction unit 300aA, the high frequency time envelope shape determination unit 16d, and the low frequency time envelope correction unit 16e are provided.

［第21の実施形態の音声復号装置の第4の変形例］
図321は、第21の実施形態に係る音声復号装置の第4の変形例300Dの構成を示す図である。 [Fourth variant of the audio decoding device of the 21st embodiment]
FIG. 321 is a diagram showing a configuration of a fourth modification 300D of the audio decoding device according to the 21st embodiment.

図322は、第21の実施形態に係る音声復号装置の第4の変形例300Dの動作を示すフローチャートである。 FIG. 322 is a flowchart showing the operation of the fourth modification 300D of the audio decoding device according to the 21st embodiment.

本変形例と前記第21の実施形態に係る音声復号装置300との相違点は、低周波数時間包絡形状決定部10e及び高周波数時間包絡形状決定部13aにかえて時間包絡形状決定部16fを具備する点である。 The difference between this modification and the voice decoding device 300 according to the 21st embodiment is that the time envelope shape determination unit 16f is provided instead of the low frequency time envelope shape determination unit 10e and the high frequency time envelope shape determination unit 13a. It is a point to do.

［第22の実施形態］
図129は、第22の実施形態に係る音声復号装置310の構成を示す図である。音声復号装置310の通信装置は、下記音声符号化装置410から出力される多重化された符号化系列を受信し、更に、復号した音声信号を外部に出力する。音声復号装置310は、図129に示すように、機能的には、符号化系列逆多重化部10a、コア復号部10b、分析フィルタバンク部10c、符号化系列解析部13c、低周波数時間包絡形状決定部10e、低周波数時間包絡修正部10f、高周波数時間包絡形状決定部13a、高周波数信号生成部10g、時間包絡修正部14a、復号/逆量子化部10h、周波数包絡調整部10i、及び合成フィルタバンク部10jを備える。 [22nd Embodiment]
FIG. 129 is a diagram showing a configuration of the audio decoding device 310 according to the 22nd embodiment. The communication device of the voice decoding device 310 receives the multiplexed coding sequence output from the following voice coding device 410, and further outputs the decoded voice signal to the outside. As shown in FIG. 129, the voice decoding device 310 functionally has a coded sequence demultiplexing section 10a, a core decoding section 10b, an analysis filter bank section 10c, a coded sequence analysis section 13c, and a low frequency time envelope shape. Determination unit 10e, low frequency time envelope correction unit 10f, high frequency time envelope shape determination unit 13a, high frequency signal generation unit 10g, time envelope correction unit 14a, decoding / inverse quantization unit 10h, frequency envelope adjustment unit 10i, and synthesis. It has a filter bank section 10j.

図130は、第22の実施形態に係る音声復号装置の動作を示すフローチャートである。 FIG. 130 is a flowchart showing the operation of the voice decoding device according to the 22nd embodiment.

本発明第8の実施形態の音声復号装置17との相違点は、高周波数信号生成部10gが、分析フィルタバンク部10cから出力される低周波数信号の複数のサブバンド信号に代えて、低周波数時間包絡修正部10fから出力される時間包絡形状を修正された低周波数信号の複数のサブバンド信号を用いて高周波数信号を生成する点である。 The difference from the voice decoding device 17 of the eighth embodiment of the present invention is that the high frequency signal generation unit 10g replaces a plurality of subband signals of the low frequency signal output from the analysis filter bank unit 10c with a low frequency. The point is that a high frequency signal is generated by using a plurality of subband signals of the low frequency signal whose time wrapping shape is corrected, which is output from the time wrapping correction unit 10f.

なお、本実施形態に係る音声復号装置310の低周波数時間包絡形状決定部10eに対して、本発明の第1の実施形態の音声復号装置の第1、第2、及び第3の変形例が適用できることは明白である。 It should be noted that the first, second, and third modifications of the voice decoding device according to the first embodiment of the present invention are provided with respect to the low frequency time envelope shape determining unit 10e of the voice decoding device 310 according to the present invention. It is clear that it is applicable.

さらには、本実施形態に係る音声復号装置310の高周波数時間包絡形状決定部13aに対して、本発明の第4の実施形態の音声復号装置の第1、第2、及び第3の変形例、本発明第5の実施形態の音声復号装置の第1の変形例、及び本発明第7の実施形態の音声復号装置の第1の変形例が適用できることは明白である。 Further, the first, second, and third modifications of the voice decoding device according to the fourth embodiment of the present invention are provided with respect to the high frequency time wrapping shape determining unit 13a of the voice decoding device 310 according to the present invention. It is clear that the first modification of the voice decoding device according to the fifth embodiment of the present invention and the first modification of the voice decoding device according to the seventh embodiment of the present invention can be applied.

図131は、第19の実施形態に係る音声符号化装置410の構成を示す図である。音声符号化装置410の通信装置は、符号化の対象となる音声信号を外部から受信し、更に、符号化された符号化系列を外部に出力する。音声符号化装置410は、図131に示すように、機能的には、ダウンサンプリング部20a、コア符号化部20b、分析フィルタバンク部20c及び20c1、制御パラメータ符号化部20d、包絡算出部270d、量子化/符号化部20f、コア復号信号生成部20i、サブバンド信号パワー算出部20j及び24b、擬似高周波数信号生成部410b、時間包絡情報符号化部410a、及び符号化系列多重化部270cを備える。 FIG. 131 is a diagram showing the configuration of the voice coding device 410 according to the nineteenth embodiment. The communication device of the voice coding device 410 receives the voice signal to be coded from the outside, and further outputs the coded coding sequence to the outside. As shown in FIG. 131, the voice coding device 410 functionally includes a downsampling unit 20a, a core coding unit 20b, an analysis filter bank unit 20c and 20c1, a control parameter coding unit 20d, and an entrapment calculation unit 270d. Quantization / coding unit 20f, core decoding signal generation unit 20i, subband signal power calculation unit 20j and 24b, pseudo high frequency signal generation unit 410b, time wrapping information coding unit 410a, and coding sequence multiplexing unit 270c. Be prepared.

図132は、第22の実施形態に係る音声符号化装置410の動作を示すフローチャートである。 FIG. 132 is a flowchart showing the operation of the voice coding device 410 according to the 22nd embodiment.

時間包絡情報符号化部410aは、入力音声信号の低周波数信号の時間包絡、コア復号信号の時間包絡のうち少なくとも一つ以上を算出し、算出された時間包絡より、低周波数信号に関する時間包絡情報を符号化する（ステップS410-1）。 The time envelope information coding unit 410a calculates at least one of the time envelope of the low frequency signal of the input voice signal and the time envelope of the core decoded signal, and from the calculated time envelope, the time envelope information regarding the low frequency signal. Is encoded (step S410-1).

擬似高周波数信号生成部410bは、分析フィルタバンク部20cで得られる入力音声信号の低周波数信号のサブバンド信号と、制御パラメータ符号化部20dで得られる高周波数信号を生成するために必要な制御パラメータに基づいて、擬似高周波数信号を生成する（ステップS410-2）。擬似高周波数信号生成部24aとの相違点は、擬似高周波数信号を生成する際に、時間包絡情報符号化部410aにて符号化された低周波数信号に関する時間包絡情報を用いて、分析フィルタバンク部20cで得られる入力音声信号の低周波数信号のサブバンド信号を修正することができる点である。 The pseudo high frequency signal generation unit 410b is a control required to generate a subband signal of a low frequency signal of the input audio signal obtained by the analysis filter bank unit 20c and a high frequency signal obtained by the control parameter coding unit 20d. Generate a pseudo high frequency signal based on the parameters (step S410-2). The difference from the pseudo high frequency signal generation unit 24a is that when generating the pseudo high frequency signal, the analysis filter bank uses the time wrapping information related to the low frequency signal encoded by the time wrapping information coding unit 410a. The point is that the sub-band signal of the low frequency signal of the input audio signal obtained in the part 20c can be corrected.

時間包絡情報符号化部410aは、入力音声信号の高周波数信号の時間包絡、擬似高周波数信号の時間包絡のうち少なくとも一つ以上を算出し、算出された時間包絡より、高周波数信号に関する時間包絡情報を符号化する（ステップS410-3）。 The time envelope information coding unit 410a calculates at least one of the time envelope of the high frequency signal of the input voice signal and the time envelope of the pseudo high frequency signal, and from the calculated time envelope, the time envelope related to the high frequency signal. Encode the information (step S410-3).

なお、時間包絡情報符号化部410aは、低周波数信号に関する時間包絡情報と高周波数信号に関する時間包絡情報とを別々に符号化した符号化系列として出力することができ、また当該低周波数信号に関する時間包絡情報と高周波数信号に関する時間包絡情報とをあわせて符号化した符号化系列として出力することもでき、本発明において時間包絡情報の符号化系列の形式は限定されない。また、第7の実施形態の音声符号化装置26の時間包絡情報符号化部26aの動作と同様に、当該低周波数時間包絡情報と高周波数時間包絡情報の符号化の方法は限定されない。 The time-wrapping information coding unit 410a can output the time-wrapping information related to the low-frequency signal and the time-wrapping information related to the high-frequency signal as separately encoded coding sequences, and the time related to the low-frequency signal. It is also possible to output the inclusion information and the time inclusion information related to the high frequency signal as a coded sequence, and the format of the coding sequence of the time inclusion information is not limited in the present invention. Further, the method of encoding the low frequency time envelope information and the high frequency time envelope information is not limited as in the operation of the time envelope information coding unit 26a of the voice coding device 26 of the seventh embodiment.

なお、擬似高周波数信号生成部410bにて擬似高周波数信号を生成する際に、時間包絡情報符号化部410aにて符号化された低周波数信号に関する時間包絡情報を用いない場合は、時間包絡情報符号化部410aはステップS410-1及びS410-3の処理を一緒に実施することができる。例えば、時間包絡情報符号化部27aと同様にして、入力音声信号の低周波数信号の時間包絡、高周波数信号の時間包絡、コア復号信号の時間包絡、擬似高周波数信号の時間包絡のうち少なくとも一つ以上を算出し、算出された時間包絡より、時間包絡情報を符号化することができる。 When the pseudo high frequency signal generation unit 410b generates the pseudo high frequency signal, if the time inclusion information related to the low frequency signal encoded by the time inclusion information coding unit 410a is not used, the time inclusion information The coding unit 410a can carry out the processes of steps S410-1 and S410-3 together. For example, similarly to the time envelope information coding unit 27a, at least one of the time envelope of the low frequency signal of the input audio signal, the time envelope of the high frequency signal, the time envelope of the core decoded signal, and the time envelope of the pseudo high frequency signal. One or more can be calculated, and the time envelope information can be encoded from the calculated time envelope.

なお、本実施形態に係る音声符号化装置410に対して、本発明の第7の実施形態の音声符号化装置の第1の変形例が適用できることは明白である。また、高周波数信号の時間包絡情報は、低周波数信号の時間包絡情報を元に生成可能である。 It is clear that the first modification of the voice coding device of the seventh embodiment of the present invention can be applied to the voice coding device 410 according to the present embodiment. Further, the time envelope information of the high frequency signal can be generated based on the time envelope information of the low frequency signal.

［第22の実施形態の音声復号装置の第1の変形例］
図323は、第22の実施形態に係る音声復号装置の第1の変形例310Aの構成を示す図である。 [First modification of the audio decoding device of the 22nd embodiment]
FIG. 323 is a diagram showing a configuration of a first modification 310A of the audio decoding device according to the 22nd embodiment.

図324は、第22の実施形態に係る音声復号装置の第1の変形例310Aの動作を示すフローチャートである。 FIG. 324 is a flowchart showing the operation of the first modification 310A of the audio decoding device according to the 22nd embodiment.

本変形例と第22の実施形態に係る音声復号装置310との相違点は、低周波数時間包絡形状決定部10eC（10e、10eA、及び10eBでもよいことは明白）、時間包絡修正部14aにかえて、低周波数時間包絡形状決定部16b、時間包絡修正部17aを具備する点である。 The difference between this modification and the audio decoding device 310 according to the 22nd embodiment is that the low frequency time envelope shape determination unit 10eC (clearly, 10e, 10eA, and 10eB may be used) and the time envelope correction unit 14a are replaced. Therefore, it is provided with a low frequency time envelope shape determining unit 16b and a time envelope correction unit 17a.

［第22の実施形態の音声復号装置の第2の変形例］
図325は、第22の実施形態に係る音声復号装置の第2の変形例310Bの構成を示す図である。 [Second variant of the audio decoding device of the 22nd embodiment]
FIG. 325 is a diagram showing a configuration of a second modification 310B of the audio decoding device according to the 22nd embodiment.

図326は、第22の実施形態に係る音声復号装置の第2の変形例310Bの動作を示すフローチャートである。 FIG. 326 is a flowchart showing the operation of the second modification 310B of the audio decoding device according to the 22nd embodiment.

本変形例と第22の実施形態に係る音声復号装置310との相違点は、高周波数時間包絡形状決定部13aC（13a、13aA、及び13aBでもよいことは明白）、低周波数時間包絡修正部10fにかえて、高周波数時間包絡形状決定部16d、低周波数時間包絡修正部16eを具備する点である。 The difference between this modification and the audio decoding device 310 according to the 22nd embodiment is the high frequency time envelope shape determination unit 13aC (it is clear that 13a, 13aA, and 13aB may be used), and the low frequency time envelope correction unit 10f. Instead, it is provided with a high frequency time envelope shape determining unit 16d and a low frequency time envelope correction unit 16e.

［第22の実施形態の音声復号装置の第3の変形例］
図327は、第22の実施形態に係る音声復号装置の第3の変形例310Cの構成を示す図である。 [Third variant of the audio decoding device of the 22nd embodiment]
FIG. 327 is a diagram showing a configuration of a third modification 310C of the audio decoding device according to the 22nd embodiment.

図328は、第22の実施形態に係る音声復号装置の第3の変形例310Cの動作を示すフローチャートである。 FIG. 328 is a flowchart showing the operation of the third modification 310C of the audio decoding device according to the 22nd embodiment.

［第22の実施形態の音声復号装置の第4の変形例］
図329は、第22の実施形態に係る音声復号装置の第4の変形例310Dの構成を示す図である。 [Fourth variant of the audio decoding device of the 22nd embodiment]
FIG. 329 is a diagram showing a configuration of a fourth modification 310D of the audio decoding device according to the 22nd embodiment.

図330は、第22の実施形態に係る音声復号装置の第4の変形例310Dの動作を示すフローチャートである。 FIG. 330 is a flowchart showing the operation of the fourth modification 310D of the audio decoding device according to the 22nd embodiment.

本変形例と前記第22の実施形態に係る音声復号装置310との相違点は、低周波数時間包絡形状決定部10e及び高周波数時間包絡形状決定部13aにかえて時間包絡形状決定部16fを具備する点である。 The difference between this modification and the voice decoding device 310 according to the 22nd embodiment is that the time envelope shape determination unit 16f is provided instead of the low frequency time envelope shape determination unit 10e and the high frequency time envelope shape determination unit 13a. It is a point to do.

［第23の実施形態］
図133は、第23の実施形態に係る音声復号装置320の構成を示す図である。音声復号装置320の通信装置は、下記音声符号化装置420から出力される多重化された符号化系列を受信し、更に、復号した音声信号を外部に出力する。音声復号装置320は、図133に示すように、機能的には、符号化系列逆多重化部10a、コア復号部10b、分析フィルタバンク部10c、符号化系列解析部13c、低周波数時間包絡形状決定部10e、低周波数時間包絡修正部10f、高周波数信号生成部10g、復号/逆量子化部10h、周波数包絡調整部10i、高周波数時間包絡形状決定部13a、時間包絡修正部14a、及び合成フィルタバンク部10jを備える。 [23rd Embodiment]
FIG. 133 is a diagram showing the configuration of the audio decoding device 320 according to the 23rd embodiment. The communication device of the voice decoding device 320 receives the multiplexed coding sequence output from the following voice coding device 420, and further outputs the decoded voice signal to the outside. As shown in FIG. 133, the voice decoding device 320 functionally has a coded sequence demultiplexing section 10a, a core decoding section 10b, an analysis filter bank section 10c, a coded sequence analysis section 13c, and a low frequency time envelope shape. Determination unit 10e, low frequency time envelope correction unit 10f, high frequency signal generation unit 10g, decoding / dequantization unit 10h, frequency envelope adjustment unit 10i, high frequency time envelope shape determination unit 13a, time envelope correction unit 14a, and synthesis. It has a filter bank section 10j.

図134は、第23の実施形態に係る音声復号装置の動作を示すフローチャートである。 FIG. 134 is a flowchart showing the operation of the voice decoding device according to the 23rd embodiment.

前記第9の実施形態の音声復号装置18との相違点は、高周波数信号生成部10gが、分析フィルタバンク部10cから出力される低周波数信号の複数のサブバンド信号に代えて、低周波数時間包絡修正部10fから出力される時間包絡形状を修正された低周波数信号の複数のサブバンド信号を用いて高周波数信号を生成する点である。 The difference from the voice decoding device 18 of the ninth embodiment is that the high frequency signal generation unit 10g replaces the plurality of subband signals of the low frequency signal output from the analysis filter bank unit 10c with the low frequency time. The point is that a high frequency signal is generated by using a plurality of subband signals of the low frequency signal whose time wrapping shape is corrected, which is output from the wrapping correction unit 10f.

なお、本実施形態に係る音声復号装置320の低周波数時間包絡形状決定部10eに対して、本発明の第1の実施形態の音声復号装置の第1、第2、及び第3の変形例が適用できることは明白である。 It should be noted that the first, second, and third modifications of the voice decoding device according to the first embodiment of the present invention are provided for the low frequency time envelope shape determining unit 10e of the voice decoding device 320 according to the present invention. It is clear that it is applicable.

さらには、本実施形態に係る音声復号装置320の高周波数時間包絡形状決定部13aに対して、本発明の第4の実施形態の音声復号装置の第1、第2、及び第3の変形例、本発明第5の実施形態の音声復号装置の第1の変形例、及び本発明第7の実施形態の音声復号装置の第1の変形例が適用できることは明白である。 Further, the first, second, and third modifications of the voice decoding device according to the fourth embodiment of the present invention are provided with respect to the high frequency time wrapping shape determining unit 13a of the voice decoding device 320 according to the present invention. It is clear that the first modification of the voice decoding device according to the fifth embodiment of the present invention and the first modification of the voice decoding device according to the seventh embodiment of the present invention can be applied.

図135は、第23の実施形態に係る音声符号化装置420の構成を示す図である。音声符号化装置420の通信装置は、符号化の対象となる音声信号を外部から受信し、更に、符号化された符号化系列を外部に出力する。音声符号化装置420は、図135に示すように、機能的には、ダウンサンプリング部20a、コア符号化部20b、分析フィルタバンク部20c及び20c1、制御パラメータ符号化部20d、包絡算出部20e、量子化/符号化部20f、擬似高周波数信号生成部410b、周波数包絡調整部25a、コア復号信号生成部20i、サブバンド信号パワー算出部20j及び24b、時間包絡情報符号化部420a、及び符号化系列多重化部20hを備える。 FIG. 135 is a diagram showing the configuration of the voice coding device 420 according to the 23rd embodiment. The communication device of the voice coding device 420 receives the voice signal to be coded from the outside, and further outputs the coded coding sequence to the outside. As shown in FIG. 135, the voice coding device 420 functionally includes a downsampling unit 20a, a core coding unit 20b, an analysis filter bank unit 20c and 20c1, a control parameter coding unit 20d, and an entrapment calculation unit 20e. Quantization / coding unit 20f, pseudo high frequency signal generation unit 410b, frequency wrapping adjustment unit 25a, core decoding signal generation unit 20i, subband signal power calculation unit 20j and 24b, time wrapping information coding unit 420a, and coding It is equipped with a sequence multiplexing unit 20h.

図136は、第23の実施形態に係る音声符号化装置420の動作を示すフローチャートである。 FIG. 136 is a flowchart showing the operation of the voice coding device 420 according to the 23rd embodiment.

時間包絡情報符号化部420aは、入力音声信号の高周波数信号の時間包絡、波数包絡調整された擬似高周波数信号の時間包絡のうち少なくとも一つ以上を算出し、算出された時間包絡より、高周波数信号に関する時間包絡情報を符号化する（ステップS420-1）。 The time envelope information coding unit 420a calculates at least one of the time envelope of the high frequency signal of the input voice signal and the time envelope of the pseudo high frequency signal adjusted by the wave number envelope, and is higher than the calculated time envelope. Encode the time envelope information about the frequency signal (step S420-1).

なお、時間包絡情報符号化部420aは、低周波数信号に関する時間包絡情報と高周波数信号に関する時間包絡情報とを別々に符号化した符号化系列として出力することができ、また当該低周波数信号に関する時間包絡情報と高周波数信号に関する時間包絡情報とをあわせて符号化した符号化系列として出力することもでき、本発明において時間包絡情報の符号化系列の形式は限定されない。また、第7の実施形態の音声符号化装置26の時間包絡情報符号化部26aの動作と同様に、当該低周波数時間包絡情報と高周波数時間包絡情報の符号化の方法は限定されない。 The time-wrapping information coding unit 420a can output the time-wrapping information related to the low-frequency signal and the time-wrapping information related to the high-frequency signal as separately encoded coding sequences, and the time related to the low-frequency signal. It is also possible to output the inclusion information and the time inclusion information related to the high frequency signal as a coded sequence, and the format of the coding sequence of the time inclusion information is not limited in the present invention. Further, the method of encoding the low frequency time envelope information and the high frequency time envelope information is not limited as in the operation of the time envelope information coding unit 26a of the voice coding device 26 of the seventh embodiment.

なお、前記第22の実施形態に係る音声符号化装置410と同様に、時間包絡情報符号化部420aはステップS410-1及びS420-1の処理を一緒に実施することができる。また、本実施形態に係る音声符号化装置420に対して、本発明の第7の実施形態の音声符号化装置の第1の変形例が適用できることは明白である。また、高周波数信号の時間包絡情報は、低周波数信号の時間包絡情報を元に生成可能である。 Similarly to the voice coding device 410 according to the 22nd embodiment, the time-envelope information coding unit 420a can carry out the processes of steps S410-1 and S420-1 together. Further, it is clear that the first modification of the voice coding device of the seventh embodiment of the present invention can be applied to the voice coding device 420 according to the present embodiment. Further, the time envelope information of the high frequency signal can be generated based on the time envelope information of the low frequency signal.

［第23の実施形態の音声復号装置の第1の変形例］
図137は、第23の実施形態の第1の変形例に係る音声復号装置320Aの構成を示す図である。 [First modification of the audio decoding device of the 23rd embodiment]
FIG. 137 is a diagram showing a configuration of an audio decoding device 320A according to a first modification of the 23rd embodiment.

図138は、第23の実施形態の第1の変形例に係る音声復号装置320Aの動作を示すフローチャートである。 FIG. 138 is a flowchart showing the operation of the audio decoding device 320A according to the first modification of the 23rd embodiment.

前記第23の実施形態に係る音声復号装置320との相違点は、時間包絡修正部15aに代えて、時間包絡修正部15aAを用いている点である。 The difference from the voice decoding device 320 according to the 23rd embodiment is that the time envelope correction unit 15aA is used instead of the time envelope correction unit 15a.

なお、本変形例に係る音声復号装置320Aの低周波数時間包絡形状決定部10eに対して、本発明の第1の実施形態の音声復号装置の第1、第2、及び第3の変形例が適用できることは明白である。 It should be noted that the first, second, and third modifications of the voice decoding device according to the first embodiment of the present invention are provided with respect to the low frequency time envelope shape determining unit 10e of the voice decoding device 320A according to the present modification. It is clear that it is applicable.

さらには、本変形例に係る音声復号装置320Aの高周波数時間包絡形状決定部13aに対して、本発明の第4の実施形態の音声復号装置の第1、第2、及び第3の変形例、本発明第5の実施形態の音声復号装置の第1の変形例、及び本発明第7の実施形態の音声復号装置の第1の変形例が適用できることは明白である。 Further, with respect to the high frequency time wrapping shape determining unit 13a of the voice decoding device 320A according to the present modification, the first, second, and third modifications of the voice decoding device according to the fourth embodiment of the present invention. It is clear that the first modification of the voice decoding device according to the fifth embodiment of the present invention and the first modification of the voice decoding device according to the seventh embodiment of the present invention can be applied.

［第23の実施形態の音声復号装置の第2の変形例］
図331は、第23の実施形態に係る音声復号装置の第2の変形例320Bの構成を示す図である。 [Second variant of the audio decoding device of the 23rd embodiment]
FIG. 331 is a diagram showing a configuration of a second modification 320B of the audio decoding device according to the 23rd embodiment.

図332は、第23の実施形態に係る音声復号装置の第2の変形例320Bの動作を示すフローチャートである。 FIG. 332 is a flowchart showing the operation of the second modification 320B of the audio decoding device according to the 23rd embodiment.

本変形例と第23の実施形態に係る音声復号装置320との相違点は、低周波数時間包絡形状決定部10eC（10e、10eA、及び10eBでもよいことは明白）、時間包絡修正部15aにかえて、低周波数時間包絡形状決定部16b、時間包絡修正部18aを具備する点である。 The difference between this modification and the audio decoding device 320 according to the 23rd embodiment is that the low frequency time envelope shape determination unit 10eC (clearly, 10e, 10eA, and 10eB may be used) and the time envelope correction unit 15a are replaced. Therefore, it is provided with a low frequency time envelope shape determining unit 16b and a time envelope correction unit 18a.

［第23の実施形態の音声復号装置の第3の変形例］
図333は、第23の実施形態に係る音声復号装置の第3の変形例320Cの構成を示す図である。 [Third variant of the audio decoding device of the 23rd embodiment]
FIG. 333 is a diagram showing a configuration of a third modification 320C of the audio decoding device according to the 23rd embodiment.

図334は、第23の実施形態に係る音声復号装置の第3の変形例320Cの動作を示すフローチャートである。 FIG. 334 is a flowchart showing the operation of the third modification 320C of the audio decoding device according to the 23rd embodiment.

本変形例と第23の実施形態に係る音声復号装置320との相違点は、高周波数時間包絡形状決定部13aC（13a、13aA、及び13aBでもよいことは明白）、低周波数時間包絡修正部10fにかえて、高周波数時間包絡形状決定部16d、低周波数時間包絡修正部16eを具備する点である。 The difference between this modification and the audio decoding device 320 according to the 23rd embodiment is the high frequency time envelope shape determination unit 13aC (it is clear that 13a, 13aA, and 13aB may be used), and the low frequency time envelope correction unit 10f. Instead, it is provided with a high frequency time envelope shape determining unit 16d and a low frequency time envelope correction unit 16e.

［第23の実施形態の音声復号装置の第4の変形例］
図335は、第23の実施形態に係る音声復号装置の第4の変形例320Dの構成を示す図である。 [Fourth variant of the audio decoding device of the 23rd embodiment]
FIG. 335 is a diagram showing a configuration of a fourth modification 320D of the audio decoding device according to the 23rd embodiment.

図336は、第23の実施形態に係る音声復号装置の第4の変形例320Dの動作を示すフローチャートである。 FIG. 336 is a flowchart showing the operation of the fourth modification 320D of the audio decoding device according to the 23rd embodiment.

［第23の実施形態の音声復号装置の第5の変形例］
図337は、第23の実施形態に係る音声復号装置の第5の変形例320Eの構成を示す図である。 [Fifth variant of the audio decoding device of the 23rd embodiment]
FIG. 337 is a diagram showing a configuration of a fifth modification 320E of the audio decoding device according to the 23rd embodiment.

図338は、第23の実施形態に係る音声復号装置の第5の変形例320Eの動作を示すフローチャートである。 FIG. 338 is a flowchart showing the operation of the fifth modification 320E of the audio decoding device according to the 23rd embodiment.

本変形例と前記第23の実施形態に係る音声復号装置320との相違点は、低周波数時間包絡形状決定部10e及び高周波数時間包絡形状決定部13aにかえて時間包絡形状決定部16fを具備する点である。 The difference between this modification and the voice decoding device 320 according to the 23rd embodiment is that the time envelope shape determination unit 16f is provided instead of the low frequency time envelope shape determination unit 10e and the high frequency time envelope shape determination unit 13a. It is a point to do.

［第23の実施形態の音声復号装置の第6の変形例］
図339は、第23の実施形態に係る音声復号装置の第6の変形例320Fの構成を示す図である。 [Sixth variant of the audio decoding device of the 23rd embodiment]
FIG. 339 is a diagram showing a configuration of a sixth modification 320F of the audio decoding device according to the 23rd embodiment.

図340は、第23の実施形態に係る音声復号装置の第6の変形例320Fの動作を示すフローチャートである。 FIG. 340 is a flowchart showing the operation of the sixth modification 320F of the audio decoding device according to the 23rd embodiment.

本変形例と第23の実施形態の第1の変形例に係る音声復号装置320Aとの相違点は、低周波数時間包絡形状決定部10eC（10e、10eA、及び10eBでもよいことは明白）、時間包絡修正部15aAにかえて、低周波数時間包絡形状決定部16b、時間包絡修正部18aAを具備する点である。 The difference between this modification and the voice decoding device 320A according to the first modification of the 23rd embodiment is the low frequency time envelope shape determination unit 10eC (clearly, 10e, 10eA, and 10eB may be used), time. Instead of the envelope correction unit 15aA, the low frequency time envelope shape determination unit 16b and the time envelope correction unit 18aA are provided.

［第23の実施形態の音声復号装置の第7の変形例］
図341は、第23の実施形態に係る音声復号装置の第7の変形例320Gの構成を示す図である。 [7th variant of the audio decoding device of the 23rd embodiment]
FIG. 341 is a diagram showing a configuration of a seventh modification 320G of the audio decoding device according to the 23rd embodiment.

図342は、第23の実施形態に係る音声復号装置の第7の変形例320Gの動作を示すフローチャートである。 FIG. 342 is a flowchart showing the operation of the seventh modification 320G of the voice decoding device according to the 23rd embodiment.

本変形例と第23の実施形態の第1の変形例に係る音声復号装置320Aとの相違点は、高周波数時間包絡形状決定部13aC（13a、13aA、及び13aBでもよいことは明白）、低周波数時間包絡修正部10fにかえて、高周波数時間包絡形状決定部16d、低周波数時間包絡修正部16eを具備する点である。 The difference between this modification and the voice decoding device 320A according to the first modification of the 23rd embodiment is the high frequency time envelope shape determination unit 13aC (clearly, 13a, 13aA, and 13aB may be used) and low. Instead of the frequency time envelope correction unit 10f, the high frequency time envelope shape determination unit 16d and the low frequency time envelope correction unit 16e are provided.

［第23の実施形態の音声復号装置の第8の変形例］
図343は、第23の実施形態に係る音声復号装置の第8の変形例320Hの構成を示す図である。 [8th variant of the audio decoding device of the 23rd embodiment]
FIG. 343 is a diagram showing a configuration of an eighth modification 320H of the audio decoding device according to the 23rd embodiment.

図344は、第23の実施形態に係る音声復号装置の第8の変形例320Hの動作を示すフローチャートである。 FIG. 344 is a flowchart showing the operation of the eighth modification 320H of the audio decoding device according to the 23rd embodiment.

［第23の実施形態の音声復号装置の第9の変形例］
図345は、第23の実施形態に係る音声復号装置の第9の変形例320Iの構成を示す図である。 [Ninth variant of the audio decoding device of the 23rd embodiment]
FIG. 345 is a diagram showing a configuration of a ninth modification 320I of the audio decoding device according to the 23rd embodiment.

図346は、第23の実施形態に係る音声復号装置の第9の変形例320Iの動作を示すフローチャートである。 FIG. 346 is a flowchart showing the operation of the ninth modification 320I of the audio decoding device according to the 23rd embodiment.

本変形例と前記第23の実施形態の第1の変形例に係る音声復号装置320Aとの相違点は、低周波数時間包絡形状決定部10e及び高周波数時間包絡形状決定部13aにかえて時間包絡形状決定部16fを具備する点である。 The difference between this modification and the voice decoding device 320A according to the first modification of the 23rd embodiment is that the time envelopment is replaced with the low frequency time envelope shape determination unit 10e and the high frequency time envelope shape determination unit 13a. The point is that the shape determining portion 16f is provided.

［第24の実施形態］
図139は、第24の実施形態に係る音声復号装置330の構成を示す図である。音声復号装置330の通信装置は、下記音声符号化装置430から出力される多重化された符号化系列を受信し、更に、復号した音声信号を外部に出力する。音声復号装置330は、図139に示すように、機能的には、符号化系列逆多重化部170a、スイッチ群170b、コア復号部10b、分析フィルタバンク部10c、符号化系列解析部13c、低周波数時間包絡形状決定部10e、低周波数時間包絡修正部10f、高周波数時間包絡形状決定部13a、時間包絡修正部300a、高周波数信号生成部10g、復号/逆量子化部10h、周波数包絡調整部10i、及び合成フィルタバンク部170cを備える。 [24th Embodiment]
FIG. 139 is a diagram showing a configuration of the audio decoding device 330 according to the 24th embodiment. The communication device of the voice decoding device 330 receives the multiplexed coding sequence output from the following voice coding device 430, and further outputs the decoded voice signal to the outside. As shown in FIG. 139, the voice decoding device 330 functionally has a coded sequence demultiplexing section 170a, a switch group 170b, a core decoding section 10b, an analysis filter bank section 10c, a coded sequence analysis section 13c, and a low frequency. Frequency time envelope shape determination unit 10e, low frequency time envelope correction unit 10f, high frequency time envelope shape determination unit 13a, time envelope correction unit 300a, high frequency signal generation unit 10g, decoding / inverse quantization unit 10h, frequency envelope adjustment unit It is provided with 10i and a synthetic filter bank unit 170c.

図140は、第24の実施形態に係る音声復号装置の動作を示すフローチャートである。なお、ステップS170-2およびS170-3の処理を行う順番については、高周波数信号の時間包絡形状の決定および帯域拡張部分の復号・逆量子化の処理の前であればよく、図140のフローチャートの順番に制限されない。 FIG. 140 is a flowchart showing the operation of the voice decoding device according to the 24th embodiment. The order in which the processes of steps S170-2 and S170-3 are performed may be before the determination of the time envelope shape of the high frequency signal and the decoding / dequantization process of the band extension portion, and the flowchart of FIG. 140 The order of is not limited.

なお、本変形例に係る音声復号装置330の低周波数時間包絡形状決定部10eに対して、本発明の第1の実施形態の音声復号装置の第1、第2、及び第3の変形例が適用できることは明白である。 It should be noted that the first, second, and third modifications of the voice decoding device according to the first embodiment of the present invention are provided with respect to the low frequency time envelope shape determining unit 10e of the voice decoding device 330 according to the present modification. It is clear that it is applicable.

さらには、本変形例に係る音声復号装置330の高周波数時間包絡形状決定部13aに対して、本発明の第4の実施形態の音声復号装置の第1、第2、及び第3の変形例、及び本発明第7の実施形態の音声復号装置の第1の変形例が適用できることは明白である。 Further, with respect to the high frequency time entrainment shape determining unit 13a of the voice decoding device 330 according to the present modification, the first, second, and third modifications of the voice decoding device according to the fourth embodiment of the present invention. , And the first modification of the voice decoding apparatus of the seventh embodiment of the present invention is clearly applicable.

図141は、第24の実施形態に係る音声符号化装置430の構成を示す図である。音声符号化装置430の通信装置は、符号化の対象となる音声信号を外部から受信し、更に、符号化された符号化系列を外部に出力する。音声符号化装置430は、図141に示すように、機能的には、高周波数信号生成制御情報符号化部270a、ダウンサンプリング部20a、コア符号化部20b、分析フィルタバンク部20c及び20c1、制御パラメータ符号化部20d、包絡算出部20e、量子化/符号化部20f、コア復号信号生成部20i、サブバンド信号パワー算出部20j、時間包絡情報符号化部400a、及び符号化系列多重化部270cを備える。 FIG. 141 is a diagram showing the configuration of the voice coding device 430 according to the 24th embodiment. The communication device of the voice coding device 430 receives the voice signal to be coded from the outside, and further outputs the coded coding sequence to the outside. As shown in FIG. 141, the voice coding device 430 functionally includes a high frequency signal generation control information coding unit 270a, a downsampling unit 20a, a core coding unit 20b, an analysis filter bank unit 20c and 20c1, and control. Parameter coding unit 20d, inclusion calculation unit 20e, quantization / coding unit 20f, core decoding signal generation unit 20i, subband signal power calculation unit 20j, time inclusion information coding unit 400a, and coding sequence multiplexing unit 270c. To be equipped.

図142は、第24の実施形態に係る音声符号化装置430の動作を示すフローチャートである。時間包絡情報符号化部400aは、ステップS400-1にて時間包絡情報を算出・符号化する。なお、高周波数信号の時間包絡情報は、低周波数信号の時間包絡情報を元に生成可能である。 FIG. 142 is a flowchart showing the operation of the voice coding device 430 according to the 24th embodiment. The time envelope information coding unit 400a calculates and encodes the time envelope information in step S400-1. The time envelope information of the high frequency signal can be generated based on the time envelope information of the low frequency signal.

［第24の実施形態の音声復号装置の第1の変形例］
図347は、第24の実施形態に係る音声復号装置の第1の変形例330Aの構成を示す図である。 [First modification of the audio decoding device of the 24th embodiment]
FIG. 347 is a diagram showing a configuration of a first modification 330A of the audio decoding device according to the 24th embodiment.

図348は、第24の実施形態に係る音声復号装置の第1の変形例330Aの動作を示すフローチャートである。 FIG. 348 is a flowchart showing the operation of the first modification 330A of the audio decoding device according to the 24th embodiment.

本変形例と第24の実施形態に係る音声復号装置330との相違点は、低周波数時間包絡形状決定部10eC（10e、10eA、及び10eBでもよいことは明白）、時間包絡修正部300aにかえて、低周波数時間包絡形状決定部16b、時間包絡修正部300aAを具備する点である。 The difference between this modification and the audio decoding device 330 according to the 24th embodiment is that the low frequency time envelope shape determination unit 10eC (clearly, 10e, 10eA, and 10eB may be used) and the time envelope correction unit 300a are replaced. Therefore, it is provided with a low frequency time envelope shape determination unit 16b and a time envelope correction unit 300aA.

［第24の実施形態の音声復号装置の第2の変形例］
図349は、第24の実施形態に係る音声復号装置の第2の変形例330Bの構成を示す図である。 [Second variant of the audio decoding device of the 24th embodiment]
FIG. 349 is a diagram showing a configuration of a second modification 330B of the audio decoding device according to the 24th embodiment.

図350は、第24の実施形態に係る音声復号装置の第2の変形例330Bの動作を示すフローチャートである。 FIG. 350 is a flowchart showing the operation of the second modification 330B of the audio decoding device according to the 24th embodiment.

本変形例と第24の実施形態に係る音声復号装置330との相違点は、高周波数時間包絡形状決定部13aC（13a、13aA、及び13aBでもよいことは明白）、低周波数時間包絡修正部10fにかえて、高周波数時間包絡形状決定部16d、低周波数時間包絡修正部16eを具備する点である。 The difference between this modification and the audio decoding device 330 according to the 24th embodiment is the high frequency time envelope shape determining unit 13aC (it is clear that 13a, 13aA, and 13aB may be used), and the low frequency time envelope correction unit 10f. Instead, it is provided with a high frequency time envelope shape determining unit 16d and a low frequency time envelope correction unit 16e.

［第24の実施形態の音声復号装置の第3の変形例］
図351は、第24の実施形態に係る音声復号装置の第3の変形例330Cの構成を示す図である。 [Third variant of the audio decoding device of the 24th embodiment]
FIG. 351 is a diagram showing a configuration of a third modification 330C of the audio decoding device according to the 24th embodiment.

図352は、第24の実施形態に係る音声復号装置の第3の変形例330Cの動作を示すフローチャートである。 FIG. 352 is a flowchart showing the operation of the third modification 330C of the audio decoding device according to the 24th embodiment.

［第24の実施形態の音声復号装置の第4の変形例］
図353は、第24の実施形態に係る音声復号装置の第4の変形例330Dの構成を示す図である。 [Fourth Modification Example of Audio Decoding Device of the 24th Embodiment]
FIG. 353 is a diagram showing a configuration of a fourth modification 330D of the audio decoding device according to the 24th embodiment.

図354は、第24の実施形態に係る音声復号装置の第4の変形例330Dの動作を示すフローチャートである。 FIG. 354 is a flowchart showing the operation of the fourth modification 330D of the audio decoding device according to the 24th embodiment.

本変形例と前記第24の実施形態に係る音声復号装置330との相違点は、低周波数時間包絡形状決定部10e及び高周波数時間包絡形状決定部13aにかえて時間包絡形状決定部16fを具備する点である。 The difference between this modification and the audio decoding device 330 according to the 24th embodiment is that the time envelope shape determination unit 16f is provided instead of the low frequency time envelope shape determination unit 10e and the high frequency time envelope shape determination unit 13a. It is a point to do.

［第25の実施形態］
図143は、第25の実施形態に係る音声復号装置340の構成を示す図である。音声復号装置340の通信装置は、下記音声符号化装置440から出力される多重化された符号化系列を受信し、更に、復号した音声信号を外部に出力する。音声復号装置340は、図143に示すように、機能的には、符号化系列逆多重化部170a、スイッチ群170b、コア復号部10b、分析フィルタバンク部10c、符号化系列解析部13c、低周波数時間包絡形状決定部10e、低周波数時間包絡修正部10f、高周波数時間包絡形状決定部13a、時間包絡修正部14a、高周波数信号生成部10g、復号/逆量子化部10h、周波数包絡調整部10i、及び合成フィルタバンク部170cを備える。 [25th Embodiment]
FIG. 143 is a diagram showing the configuration of the audio decoding device 340 according to the 25th embodiment. The communication device of the voice decoding device 340 receives the multiplexed coding sequence output from the following voice coding device 440, and further outputs the decoded voice signal to the outside. As shown in FIG. 143, the voice decoding device 340 functionally has a coded sequence demultiplexing unit 170a, a switch group 170b, a core decoding unit 10b, an analysis filter bank unit 10c, a coded sequence analysis unit 13c, and a low frequency. Frequency time envelope shape determination unit 10e, low frequency time envelope correction unit 10f, high frequency time envelope shape determination unit 13a, time envelope correction unit 14a, high frequency signal generation unit 10g, decoding / inverse quantization unit 10h, frequency envelope adjustment unit It is provided with 10i and a synthetic filter bank unit 170c.

図144は、第25の実施形態に係る音声復号装置の動作を示すフローチャートである。なお、ステップS170-2およびS170-3の処理を行う順番については、高周波数信号の時間包絡形状の決定および帯域拡張部分の復号・逆量子化の処理の前であればよく、図144のフローチャートの順番に制限されない。 FIG. 144 is a flowchart showing the operation of the voice decoding device according to the 25th embodiment. The order in which the processes of steps S170-2 and S170-3 are performed may be before the determination of the time envelope shape of the high frequency signal and the decoding / dequantization process of the band extension portion, and the flowchart of FIG. 144 The order of is not limited.

なお、本変形例に係る音声復号装置340の低周波数時間包絡形状決定部10eに対して、本発明の第1の実施形態の音声復号装置の第1、第2、及び第3の変形例が適用できることは明白である。 The first, second, and third modifications of the voice decoding device according to the first embodiment of the present invention are provided with respect to the low frequency time envelope shape determining unit 10e of the voice decoding device 340 according to the present modification. It is clear that it is applicable.

さらには、本変形例に係る音声復号装置340の高周波数時間包絡形状決定部13aに対して、本発明の第4の実施形態の音声復号装置の第1、第2、及び第3の変形例、本発明第5の実施形態の音声復号装置の第1の変形例、及び本発明第7の実施形態の音声復号装置の第1の変形例が適用できることは明白である。 Further, with respect to the high frequency time entanglement shape determining unit 13a of the voice decoding device 340 according to the present modification, the first, second, and third modifications of the voice decoding device according to the fourth embodiment of the present invention. It is clear that the first modification of the voice decoding device according to the fifth embodiment of the present invention and the first modification of the voice decoding device according to the seventh embodiment of the present invention can be applied.

図145は、第25の実施形態に係る音声符号化装置440の構成を示す図である。音声符号化装置440の通信装置は、符号化の対象となる音声信号を外部から受信し、更に、符号化された符号化系列を外部に出力する。音声符号化装置440は、図145に示すように、機能的には、高周波数信号生成制御情報符号化部270a、ダウンサンプリング部20a、コア符号化部20b、分析フィルタバンク部20c及び20c1、制御パラメータ符号化部20d、包絡算出部20e、量子化/符号化部20f、コア復号信号生成部20i、サブバンド信号パワー算出部20j及び24b、擬似高周波数信号生成部410b、時間包絡情報符号化部410a、並びに、符号化系列多重化部270cを備える。 FIG. 145 is a diagram showing the configuration of the voice coding device 440 according to the 25th embodiment. The communication device of the voice coding device 440 receives the voice signal to be coded from the outside, and further outputs the coded coding sequence to the outside. As shown in FIG. 145, the voice coding device 440 functionally includes a high frequency signal generation control information coding unit 270a, a downsampling unit 20a, a core coding unit 20b, an analysis filter bank unit 20c and 20c1, and control. Parameter coding unit 20d, inclusion calculation unit 20e, quantization / coding unit 20f, core decoding signal generation unit 20i, subband signal power calculation unit 20j and 24b, pseudo high frequency signal generation unit 410b, time inclusion information coding unit It includes 410a and a coded sequence multiplexing unit 270c.

図146は、第25の実施形態に係る音声符号化装置440の動作を示すフローチャートである。なお、本実施形態に係る音声符号化装置440に対して、本発明の第7の実施形態の音声符号化装置の第1の変形例が適用できることは明白である。また、高周波数信号の時間包絡情報は、低周波数信号の時間包絡情報を元に生成可能である。 FIG. 146 is a flowchart showing the operation of the voice coding device 440 according to the 25th embodiment. It is clear that the first modification of the voice coding device of the seventh embodiment of the present invention can be applied to the voice coding device 440 according to the present embodiment. Further, the time envelope information of the high frequency signal can be generated based on the time envelope information of the low frequency signal.

［第25の実施形態の音声復号装置の第1の変形例］
図355は、第25の実施形態に係る音声復号装置の第1の変形例340Aの構成を示す図である。 [First variant of the audio decoding device of the 25th embodiment]
FIG. 355 is a diagram showing a configuration of a first modification 340A of the audio decoding device according to the 25th embodiment.

図356は、第25の実施形態に係る音声復号装置の第1の変形例340Aの動作を示すフローチャートである。 FIG. 356 is a flowchart showing the operation of the first modification 340A of the audio decoding device according to the 25th embodiment.

本変形例と第25の実施形態に係る音声復号装置340との相違点は、低周波数時間包絡形状決定部10eC（10e、10eA、及び10eBでもよいことは明白）、時間包絡修正部14aにかえて、低周波数時間包絡形状決定部16b、時間包絡修正部17aを具備する点である。 The difference between this modification and the audio decoding device 340 according to the 25th embodiment is that the low frequency time envelope shape determination unit 10eC (clearly, 10e, 10eA, and 10eB may be used) and the time envelope correction unit 14a are replaced. Therefore, it is provided with a low frequency time envelope shape determining unit 16b and a time envelope correction unit 17a.

［第25の実施形態の音声復号装置の第2の変形例］
図357は、第25の実施形態に係る音声復号装置の第2の変形例340Bの構成を示す図である。 [Second variant of the audio decoding device of the 25th embodiment]
FIG. 357 is a diagram showing a configuration of a second modification 340B of the audio decoding device according to the 25th embodiment.

図358は、第25の実施形態に係る音声復号装置の第2の変形例340Bの動作を示すフローチャートである。 FIG. 358 is a flowchart showing the operation of the second modification 340B of the audio decoding device according to the 25th embodiment.

本変形例と第25の実施形態に係る音声復号装置340との相違点は、高周波数時間包絡形状決定部13aC（13a、13aA、及び13aBでもよいことは明白）、低周波数時間包絡修正部10fにかえて、高周波数時間包絡形状決定部16d、低周波数時間包絡修正部16eを具備する点である。 The difference between this modification and the audio decoding device 340 according to the 25th embodiment is the high frequency time envelope shape determining unit 13aC (it is clear that 13a, 13aA, and 13aB may be used), and the low frequency time envelope correction unit 10f. Instead, it is provided with a high frequency time envelope shape determining unit 16d and a low frequency time envelope correction unit 16e.

［第25の実施形態の音声復号装置の第3の変形例］
図359は、第25の実施形態に係る音声復号装置の第3の変形例340Cの構成を示す図である。 [Third variant of the audio decoding device of the 25th embodiment]
FIG. 359 is a diagram showing a configuration of a third modification 340C of the audio decoding device according to the 25th embodiment.

図360は、第25の実施形態に係る音声復号装置の第3の変形例340Cの動作を示すフローチャートである。 FIG. 360 is a flowchart showing the operation of the third modification 340C of the audio decoding device according to the 25th embodiment.

［第25の実施形態の音声復号装置の第4の変形例］
図361は、第25の実施形態に係る音声復号装置の第4の変形例340Dの構成を示す図である。 [Fourth variant of the audio decoding device of the 25th embodiment]
FIG. 361 is a diagram showing a configuration of a fourth modification 340D of the audio decoding device according to the 25th embodiment.

図362は、第25の実施形態に係る音声復号装置の第4の変形例340Dの動作を示すフローチャートである。 FIG. 362 is a flowchart showing the operation of the fourth modification 340D of the audio decoding device according to the 25th embodiment.

本変形例と前記第25の実施形態に係る音声復号装置340との相違点は、低周波数時間包絡形状決定部10e及び高周波数時間包絡形状決定部13aにかえて時間包絡形状決定部16fを具備する点である。 The difference between this modification and the voice decoding device 340 according to the 25th embodiment is that the time envelope shape determination unit 16f is provided instead of the low frequency time envelope shape determination unit 10e and the high frequency time envelope shape determination unit 13a. It is a point to do.

［第26の実施形態］
図147は、第26の実施形態に係る音声復号装置350の構成を示す図である。音声復号装置350の通信装置は、下記音声符号化装置450から出力される多重化された符号化系列を受信し、更に、復号した音声信号を外部に出力する。音声復号装置350は、図147に示すように、機能的には、符号化系列逆多重化部170a、スイッチ群170b、コア復号部10b、分析フィルタバンク部10c、符号化系列解析部13c、低周波数時間包絡形状決定部10e、低周波数時間包絡修正部10f、高周波数時間包絡形状決定部13a、高周波数信号生成部10g、復号/逆量子化部10h、周波数包絡調整部10i、時間包絡修正部15a、及び合成フィルタバンク部170cを備える。 [26th Embodiment]
FIG. 147 is a diagram showing the configuration of the audio decoding device 350 according to the 26th embodiment. The communication device of the voice decoding device 350 receives the multiplexed coding sequence output from the following voice coding device 450, and further outputs the decoded voice signal to the outside. As shown in FIG. 147, the voice decoding apparatus 350 functionally has a coded sequence demultiplexing unit 170a, a switch group 170b, a core decoding unit 10b, an analysis filter bank unit 10c, a coded sequence analysis unit 13c, and a low frequency. Frequency time envelope shape determination unit 10e, low frequency time envelope correction unit 10f, high frequency time envelope shape determination unit 13a, high frequency signal generation unit 10g, decoding / dequantization unit 10h, frequency envelope adjustment unit 10i, time envelope correction unit It includes 15a and a synthetic filter bank unit 170c.

図148は、第26の実施形態に係る音声復号装置の動作を示すフローチャートである。なお、ステップS170-2およびS170-3の処理を行う順番については、高周波数信号の時間包絡形状の決定および帯域拡張部分の復号・逆量子化の処理の前であればよく、図148のフローチャートの順番に制限されない。 FIG. 148 is a flowchart showing the operation of the voice decoding device according to the 26th embodiment. The order in which the processes of steps S170-2 and S170-3 are performed may be before the determination of the time envelope shape of the high frequency signal and the decoding / dequantization process of the band extension portion, and the flowchart of FIG. 148. The order of is not limited.

なお、本実施形態に係る音声復号装置350の低周波数時間包絡形状決定部10eに対して、本発明の第1の実施形態の音声復号装置の第1、第2、及び第3の変形例が適用できることは明白である。 It should be noted that the first, second, and third modifications of the voice decoding device of the first embodiment of the present invention are provided with respect to the low frequency time envelope shape determining unit 10e of the voice decoding device 350 according to the present invention. It is clear that it is applicable.

さらには、本実施形態に係る音声復号装置350の高周波数時間包絡形状決定部13aに対して、本発明の第4の実施形態の音声復号装置の第1、第2、及び第3の変形例、本発明第5の実施形態の音声復号装置の第1の変形例、及び本発明第7の実施形態の音声復号装置の第1の変形例が適用できることは明白である。 Further, the first, second, and third modifications of the voice decoding device according to the fourth embodiment of the present invention are provided with respect to the high frequency time wrapping shape determining unit 13a of the voice decoding device 350 according to the present invention. It is clear that the first modification of the voice decoding device according to the fifth embodiment of the present invention and the first modification of the voice decoding device according to the seventh embodiment of the present invention can be applied.

図149は、第26の実施形態に係る音声符号化装置450の構成を示す図である。音声符号化装置450の通信装置は、符号化の対象となる音声信号を外部から受信し、更に、符号化された符号化系列を外部に出力する。音声符号化装置450は、図149に示すように、機能的には、高周波数信号生成制御情報符号化部270a、ダウンサンプリング部20a、コア符号化部20b、分析フィルタバンク部20c及び20c1、制御パラメータ符号化部20d、包絡算出部270d、量子化/符号化部20f、コア復号信号生成部20i、サブバンド信号パワー算出部20j及び24b、擬似高周波数信号生成部410b、時間包絡情報符号化部420a、並びに、符号化系列多重化部270cを備える。 FIG. 149 is a diagram showing the configuration of the voice coding device 450 according to the 26th embodiment. The communication device of the voice coding device 450 receives the voice signal to be coded from the outside, and further outputs the coded coding sequence to the outside. As shown in FIG. 149, the voice coding device 450 functionally includes a high frequency signal generation control information coding unit 270a, a downsampling unit 20a, a core coding unit 20b, an analysis filter bank unit 20c and 20c1, and control. Parameter coding unit 20d, inclusion calculation unit 270d, quantization / coding unit 20f, core decoding signal generation unit 20i, subband signal power calculation units 20j and 24b, pseudo high frequency signal generation unit 410b, time inclusion information coding unit It includes 420a and a coded sequence multiplexing unit 270c.

図150は、第26の実施形態に係る音声符号化装置450の動作を示すフローチャートである。なお、本実施形態に係る音声符号化装置450に対して、本発明の第7の実施形態の音声符号化装置の第1の変形例が適用できることは明白である。また、高周波数信号の時間包絡情報は、低周波数信号の時間包絡情報を元に生成可能である。 FIG. 150 is a flowchart showing the operation of the voice coding device 450 according to the 26th embodiment. It is clear that the first modification of the voice coding device of the seventh embodiment of the present invention can be applied to the voice coding device 450 according to the present embodiment. Further, the time envelope information of the high frequency signal can be generated based on the time envelope information of the low frequency signal.

［第26の実施形態の音声復号装置の第1の変形例］
図151は、第26の実施形態の第1の変形例に係る音声復号装置350Aの構成を示す図である。 [First modification of the audio decoding device of the 26th embodiment]
FIG. 151 is a diagram showing a configuration of an audio decoding device 350A according to a first modification of the 26th embodiment.

図152は、第26の実施形態の第1の変形例に係る音声復号装置350Aの動作を示すフローチャートである。なお、ステップS170-2およびS170-3の処理を行う順番については、高周波数信号の時間包絡形状の決定および帯域拡張部分の復号・逆量子化の処理の前であればよく、図152のフローチャートの順番に制限されない。 FIG. 152 is a flowchart showing the operation of the audio decoding device 350A according to the first modification of the 26th embodiment. The order in which the processes of steps S170-2 and S170-3 are performed may be before the determination of the time envelope shape of the high frequency signal and the decoding / dequantization of the band extension portion, and the flowchart of FIG. The order of is not limited.

前記第26の実施形態に係る音声復号装置350との相違点は、時間包絡修正部15aに代えて、時間包絡修正部15aAを用いている点である。 The difference from the voice decoding device 350 according to the 26th embodiment is that the time envelope correction unit 15aA is used instead of the time envelope correction unit 15a.

なお、本変形例に係る音声復号装置350Aの低周波数時間包絡形状決定部10eに対して、本発明の第1の実施形態の音声復号装置の第1、第2、及び第3の変形例が適用できることは明白である。 It should be noted that the first, second, and third modifications of the voice decoding device according to the first embodiment of the present invention are provided with respect to the low frequency time envelope shape determining unit 10e of the voice decoding device 350A according to the present modification. It is clear that it is applicable.

さらには、本変形例に係る音声復号装置350Aの高周波数時間包絡形状決定部13aに対して、本発明の第4の実施形態の音声復号装置の第1、第2、及び第3の変形例、本発明第5の実施形態の音声復号装置の第1の変形例、及び本発明第7の実施形態の音声復号装置の第1の変形例が適用できることは明白である。 Further, with respect to the high frequency time wrapping shape determining unit 13a of the voice decoding device 350A according to the present modification, the first, second, and third modifications of the voice decoding device according to the fourth embodiment of the present invention. It is clear that the first modification of the voice decoding device according to the fifth embodiment of the present invention and the first modification of the voice decoding device according to the seventh embodiment of the present invention can be applied.

［第26の実施形態の音声復号装置の第2の変形例］
図363は、第26の実施形態に係る音声復号装置の第2の変形例350Bの構成を示す図である。 [Second variant of the audio decoding device of the 26th embodiment]
FIG. 363 is a diagram showing a configuration of a second modification 350B of the audio decoding device according to the 26th embodiment.

図364は、第26の実施形態に係る音声復号装置の第2の変形例350Bの動作を示すフローチャートである。 FIG. 364 is a flowchart showing the operation of the second modification 350B of the audio decoding device according to the 26th embodiment.

本変形例と第26の実施形態に係る音声復号装置350との相違点は、低周波数時間包絡形状決定部10eC（10e、10eA、及び10eBでもよいことは明白）、時間包絡修正部15aにかえて、低周波数時間包絡形状決定部16b、時間包絡修正部18aを具備する点である。 The difference between this modification and the audio decoding device 350 according to the 26th embodiment is that the low frequency time envelope shape determination unit 10eC (clearly, 10e, 10eA, and 10eB may be used) and the time envelope correction unit 15a are replaced. Therefore, it is provided with a low frequency time envelope shape determining unit 16b and a time envelope correction unit 18a.

［第26の実施形態の音声復号装置の第3の変形例］
図365は、第26の実施形態に係る音声復号装置の第3の変形例350Cの構成を示す図である。 [Third variant of the audio decoding device of the 26th embodiment]
FIG. 365 is a diagram showing a configuration of a third modification 350C of the audio decoding device according to the 26th embodiment.

図366は、第26の実施形態に係る音声復号装置の第3の変形例350Cの動作を示すフローチャートである。 FIG. 366 is a flowchart showing the operation of the third modification 350C of the audio decoding device according to the 26th embodiment.

本変形例と第26の実施形態に係る音声復号装置350との相違点は、高周波数時間包絡形状決定部13aC（13a、13aA、及び13aBでもよいことは明白）、低周波数時間包絡修正部10fにかえて、高周波数時間包絡形状決定部16d、低周波数時間包絡修正部16eを具備する点である。 The difference between this modification and the audio decoding device 350 according to the 26th embodiment is the high frequency time envelope shape determination unit 13aC (it is clear that 13a, 13aA, and 13aB may be used), and the low frequency time envelope correction unit 10f. Instead, it is provided with a high frequency time envelope shape determining unit 16d and a low frequency time envelope correction unit 16e.

［第26の実施形態の音声復号装置の第4の変形例］
図367は、第26の実施形態に係る音声復号装置の第4の変形例350Dの構成を示す図である。 [Fourth variant of the audio decoding device of the 26th embodiment]
FIG. 367 is a diagram showing a configuration of a fourth modification 350D of the audio decoding device according to the 26th embodiment.

図368は、第26の実施形態に係る音声復号装置の第4の変形例350Dの動作を示すフローチャートである。 FIG. 368 is a flowchart showing the operation of the fourth modification 350D of the audio decoding device according to the 26th embodiment.

［第26の実施形態の音声復号装置の第5の変形例］
図369は、第26の実施形態に係る音声復号装置の第5の変形例350Eの構成を示す図である。 [Fifth variant of the audio decoding device of the 26th embodiment]
FIG. 369 is a diagram showing a configuration of a fifth modification 350E of the audio decoding device according to the 26th embodiment.

図370は、第26の実施形態に係る音声復号装置の第5の変形例350Eの動作を示すフローチャートである。 FIG. 370 is a flowchart showing the operation of the fifth modification 350E of the audio decoding device according to the 26th embodiment.

本変形例と前記第26の実施形態に係る音声復号装置350との相違点は、低周波数時間包絡形状決定部10e及び高周波数時間包絡形状決定部13aにかえて時間包絡形状決定部16fを具備する点である。 The difference between this modification and the voice decoding device 350 according to the 26th embodiment is that the time envelope shape determination unit 16f is provided instead of the low frequency time envelope shape determination unit 10e and the high frequency time envelope shape determination unit 13a. It is a point to do.

［第26の実施形態の音声復号装置の第6の変形例］
図371は、第26の実施形態に係る音声復号装置の第6の変形例350Fの構成を示す図である。 [Sixth variant of the audio decoding device of the 26th embodiment]
FIG. 371 is a diagram showing a configuration of a sixth modification 350F of the audio decoding device according to the 26th embodiment.

図372は、第26の実施形態に係る音声復号装置の第6の変形例350Fの動作を示すフローチャートである。 FIG. 372 is a flowchart showing the operation of the sixth modification 350F of the audio decoding device according to the 26th embodiment.

本変形例と第26の実施形態の第1の変形例に係る音声復号装置350Aとの相違点は、低周波数時間包絡形状決定部10eC（10e、10eA、及び10eBでもよいことは明白）、時間包絡修正部15aAにかえて、低周波数時間包絡形状決定部16b、時間包絡修正部18aAを具備する点である。 The difference between this modification and the voice decoding device 350A according to the first modification of the 26th embodiment is the low frequency time envelope shape determination unit 10eC (clearly, 10e, 10eA, and 10eB may be used), time. Instead of the envelope correction unit 15aA, the low frequency time envelope shape determination unit 16b and the time envelope correction unit 18aA are provided.

［第26の実施形態の音声復号装置の第7の変形例］
図373は、第26の実施形態に係る音声復号装置の第7の変形例350Gの構成を示す図である。 [7th variant of the audio decoding device of the 26th embodiment]
FIG. 373 is a diagram showing a configuration of a seventh modification 350G of the audio decoding device according to the 26th embodiment.

図374は、第26の実施形態に係る音声復号装置の第7の変形例350Gの動作を示すフローチャートである。 FIG. 374 is a flowchart showing the operation of the seventh modification 350G of the voice decoding device according to the 26th embodiment.

本変形例と第26の実施形態の第1の変形例に係る音声復号装置350Aとの相違点は、高周波数時間包絡形状決定部13aC（13a、13aA、及び13aBでもよいことは明白）、低周波数時間包絡修正部10fにかえて、高周波数時間包絡形状決定部16d、低周波数時間包絡修正部16eを具備する点である。 The difference between this modification and the voice decoding device 350A according to the first modification of the 26th embodiment is that the high frequency time envelope shape determining unit 13aC (it is clear that 13a, 13aA, and 13aB may be used) and low. Instead of the frequency time envelope correction unit 10f, the high frequency time envelope shape determination unit 16d and the low frequency time envelope correction unit 16e are provided.

［第26の実施形態の音声復号装置の第8の変形例］
図375は、第26の実施形態に係る音声復号装置の第8の変形例350Hの構成を示す図である。 [Eighth variant of the audio decoding device of the 26th embodiment]
FIG. 375 is a diagram showing a configuration of an eighth modification 350H of the audio decoding device according to the 26th embodiment.

図376は、第26の実施形態に係る音声復号装置の第8の変形例350Hの動作を示すフローチャートである。 FIG. 376 is a flowchart showing the operation of the eighth modification 350H of the audio decoding device according to the 26th embodiment.

［第26の実施形態の音声復号装置の第9の変形例］
図377は、第26の実施形態に係る音声復号装置の第9の変形例350Iの構成を示す図である。 [Ninth variant of the audio decoding device of the 26th embodiment]
FIG. 377 is a diagram showing a configuration of a ninth modification 350I of the audio decoding device according to the 26th embodiment.

図378は、第26の実施形態に係る音声復号装置の第9の変形例350Iの動作を示すフローチャートである。 FIG. 378 is a flowchart showing the operation of the ninth modification 350I of the audio decoding device according to the 26th embodiment.

本変形例と前記第26の実施形態の第1の変形例に係る音声復号装置350Aとの相違点は、低周波数時間包絡形状決定部10e及び高周波数時間包絡形状決定部13aにかえて時間包絡形状決定部16fを具備する点である。 The difference between this modification and the voice decoding device 350A according to the first modification of the 26th embodiment is that the time envelopment is replaced with the low frequency time envelope shape determination unit 10e and the high frequency time envelope shape determination unit 13a. The point is that the shape determining portion 16f is provided.

［第27の実施形態の音声復号装置］
図379は、第27の実施形態に係る音声復号装置360の構成を示す図である。 [Audio Decoding Device of the 27th Embodiment]
FIG. 379 is a diagram showing a configuration of the audio decoding device 360 according to the 27th embodiment.

図380は、第27の実施形態に係る音声復号装置360の動作を示すフローチャートである。 FIG. 380 is a flowchart showing the operation of the voice decoding device 360 according to the 27th embodiment.

時間包絡修正部360aは、低周波数時間包絡形状決定部10eC（10e、10eA、10eBでも良いことは明白）から受け取る時間包絡形状と、高周波数時間包絡形状決定部13aC（13a、13aA、13aBでもよいことは明白）から受け取る時間包絡形状のうち少なくとも一つ以上に基づいて、分析フィルタバンク部10cから出力される低周波数信号の複数のサブバンド信号と周波数包絡調整部10iから出力される高周波数信号の複数のサブバンド信号の時間包絡の形状を修正する(S360-1)。 The time wrapping correction unit 360a may be a time wrapping shape received from the low frequency time wrapping shape determining unit 10eC (clearly 10e, 10eA, 10eB may be used) and a high frequency time wrapping shape determining unit 13aC (13a, 13aA, 13aB). It is clear) that there are multiple subband signals of the low frequency signal output from the analysis filter bank unit 10c and the high frequency signal output from the frequency wrapping adjustment unit 10i based on at least one of the time wrapping shapes received from. Correct the shape of the time wrapping of multiple subband signals in (S360-1).

周波数包絡調整部10iから出力される高周波数信号の複数のサブバンド信号の時間包絡形状の修正では、周波数包絡調整部10iより分離した形で出力された高周波数信号を構成する成分のうち少なくとも一つ以上の時間包絡形状を修正してもよい。 In the correction of the time envelope shape of a plurality of subband signals of the high frequency signal output from the frequency envelope adjustment unit 10i, at least one of the components constituting the high frequency signal output in a form separated from the frequency envelope adjustment unit 10i is used. One or more time envelope shapes may be modified.

低周波数時間包絡形状決定部10eC（10e、10eA、10eBでも良いことは明白）から受け取る時間包絡形状と高周波数時間包絡形状決定部13aC（13a、13aA、13aBでもよいことは明白）から受け取る時間包絡形状は同一であってもよく、異なってもよい。 Time Envelope Shape Received from Low Frequency Time Envelope Shaper 10eC (Clearly 10e, 10eA, 10eB) and Time Envelope Received from High Frequency Time Envelope Shaper 13aC (Clearly 13a, 13aA, 13aB) The shapes may be the same or different.

［第27の実施形態の音声復号装置の第1の変形例］
図381は、第27の実施形態に係る音声復号装置の第1の変形例360Aの構成を示す図である。 [First modification of the audio decoding device according to the 27th embodiment]
FIG. 381 is a diagram showing a configuration of a first modification 360A of the audio decoding device according to the 27th embodiment.

図382は、第27の実施形態に係る音声復号装置の第1の変形例360Aの動作を示すフローチャートである。 FIG. 382 is a flowchart showing the operation of the first modification 360A of the audio decoding device according to the 27th embodiment.

本変形例と前記第27の実施形態に係る音声復号装置360との相違点は、低周波数時間包絡形状決定部10eC（10e、10eA、10eBでも良いことは明白）及び高周波数時間包絡形状決定部13aC（13a、13aA、13aBでもよいことは明白）にかえて時間包絡形状決定部360bを具備する点である。 The difference between this modification and the audio decoding device 360 according to the 27th embodiment is the low frequency time envelope shape determination unit 10eC (clearly, 10e, 10eA, 10eB may be used) and the high frequency time envelope shape determination unit. Instead of 13aC (it is clear that 13a, 13aA, 13aB may be used), a time envelope shape determining unit 360b is provided.

時間包絡決定部360bは、符号化系列逆多重化部10aからの低周波時間包絡形状に関する情報、コア復号部10bからの低周波数信号、分析フィルタバンク部10cからの低周波数信号の複数のサブバンド信号、符号化系列解析部13cからの高周波時間包絡形状に関する情報のうち少なくとも一つに基づいて時間包絡形状を決定する(S360-2)。 The time entrapment determination unit 360b contains information on the low frequency time encapsulation shape from the coded sequence demultiplexing unit 10a, a low frequency signal from the core decoding unit 10b, and a plurality of subbands of the low frequency signal from the analysis filter bank unit 10c. The time entrainment shape is determined based on at least one of the information on the high frequency time entrainment shape from the signal and coded sequence analysis unit 13c (S360-2).

決定される時間包絡形状は、低周波数信号と高周波数信号のそれぞれに対して異なってもよく、また低周波数信号と高周波数信号に対して同一で単一の時間包絡形状であってもよい。 The determined time envelope shape may be different for each of the low frequency signal and the high frequency signal, or may be the same and a single time envelope shape for the low frequency signal and the high frequency signal.

時間包絡修正部360aAは、前記時間包絡形状決定部360bから受け取る時間包絡形状に基づいて、分析フィルタバンク部10cから出力される低周波数信号の複数のサブバンド信号と周波数包絡調整部10iから出力される高周波数信号の複数のサブバンド信号の時間包絡の形状を修正する(S360-1a)。 The time envelope correction unit 360aA is output from a plurality of subband signals of the low frequency signal output from the analysis filter bank unit 10c and the frequency envelope adjustment unit 10i based on the time envelope shape received from the time envelope shape determination unit 360b. Correct the shape of the time envelope of multiple subband signals of a high frequency signal (S360-1a).

［第28の実施形態の音声復号装置］
図383は、第28の実施形態に係る音声復号装置370の構成を示す図である。 [Audio Decoding Device of the 28th Embodiment]
FIG. 383 is a diagram showing the configuration of the audio decoding device 370 according to the 28th embodiment.

図384は、第28の実施形態に係る音声復号装置370の動作を示すフローチャートである。 FIG. 384 is a flowchart showing the operation of the voice decoding device 370 according to the 28th embodiment.

時間包絡修正部370aは、低周波数時間包絡形状決定部10eC（10e、10eA、10eBでも良いことは明白）から受け取る時間包絡形状と、高周波数時間包絡形状決定部13aC（13a、13aA、13aBでもよいことは明白）から受け取る時間包絡形状のうち少なくとも一つ以上に基づいて、分析フィルタバンク部10cから出力される低周波数信号の複数のサブバンド信号の時間包絡の形状を修正し、前記高周波数信号生成情報に基づき高周波数信号を生成すると判断された場合、周波数包絡調整部10iから出力される高周波数信号の複数のサブバンド信号の時間包絡の形状も修正する(S370-1)。 The time wrapping correction unit 370a may be a time wrapping shape received from the low frequency time wrapping shape determining unit 10eC (clearly 10e, 10eA, 10eB may be used) and a high frequency time wrapping shape determining unit 13aC (13a, 13aA, 13aB). It is clear that the time-wrapping shape of the plurality of subband signals of the low-frequency signal output from the analysis filter bank unit 10c is modified based on at least one of the time-wrapping shapes received from the high-frequency signal. When it is determined that a high frequency signal is generated based on the generated information, the shape of the time wrapping of a plurality of subband signals of the high frequency signal output from the frequency wrapping adjustment unit 10i is also corrected (S370-1).

［第28の実施形態の音声復号装置の第1の変形例］
図385は、第28の実施形態に係る音声復号装置の第1の変形例370Aの構成を示す図である。 [First modification of the audio decoding device according to the 28th embodiment]
FIG. 385 is a diagram showing a configuration of a first modification 370A of the audio decoding device according to the 28th embodiment.

図386は、第28の実施形態に係る音声復号装置の第1の変形例370Aの動作を示すフローチャートである。 FIG. 386 is a flowchart showing the operation of the first modification 370A of the audio decoding device according to the 28th embodiment.

本変形例と前記第28の実施形態に係る音声復号装置370との相違点は、低周波数時間包絡形状決定部10eC（10e、10eA、10eBでも良いことは明白）及び高周波数時間包絡形状決定部13aC（13a、13aA、13aBでもよいことは明白）にかえて時間包絡形状決定部360bを具備する点である。 The difference between this modification and the audio decoding device 370 according to the 28th embodiment is the low frequency time envelope shape determination unit 10eC (clearly, 10e, 10eA, 10eB may be used) and the high frequency time envelope shape determination unit. Instead of 13aC (it is clear that 13a, 13aA, 13aB may be used), a time envelope shape determining unit 360b is provided.

時間包絡修正部370aAは、前記時間包絡形状決定部360bから受け取る時間包絡形状に基づいて、分析フィルタバンク部10cから出力される低周波数信号の複数のサブバンド信号の時間包絡の形状を修正し、前記高周波数信号生成情報に基づき高周波数信号を生成すると判断された場合、周波数包絡調整部10iから出力される高周波数信号の複数のサブバンド信号の時間包絡の形状を修正する(S360-1a)。 The time wrapping correction unit 370aA corrects the time wrapping shape of a plurality of subband signals of the low frequency signal output from the analysis filter bank unit 10c based on the time wrapping shape received from the time wrapping shape determining unit 360b. When it is determined that a high frequency signal is generated based on the high frequency signal generation information, the shape of the time wrapping of a plurality of subband signals of the high frequency signal output from the frequency wrapping adjustment unit 10i is corrected (S360-1a). ..

［第29の実施形態の音声復号装置］
図387は、第29の実施形態に係る音声復号装置380の構成を示す図である。 [Audio Decoding Device of the 29th Embodiment]
FIG. 387 is a diagram showing the configuration of the audio decoding device 380 according to the 29th embodiment.

図388は、第29の実施形態に係る音声復号装置380の動作を示すフローチャートである。 FIG. 388 is a flowchart showing the operation of the voice decoding device 380 according to the 29th embodiment.

時間包絡修正部380aは、低周波数時間包絡形状決定部100cで決定される時間包絡形状と、高周波数時間包絡形状決定部110bで決定される時間包絡形状のうち少なくとも一つ以上に基づいて、低周波数復号部100bから出力される低周波数信号と高周波数復号部100eから出力される高周波数信号の時間包絡の形状を修正する(S380-1)。 The time envelope correction unit 380a is low based on at least one of the time envelope shape determined by the low frequency time envelope shape determination unit 100c and the time envelope shape determined by the high frequency time envelope shape determination unit 110b. The shape of the time envelope of the low frequency signal output from the frequency decoding unit 100b and the high frequency signal output from the high frequency decoding unit 100e is corrected (S380-1).

低周波数時間包絡形状決定部100cで決定される時間包絡形状と高周波数時間包絡形状決定部110bで決定される時間包絡形状は同一であってもよく、異なってもよい。 The time envelope shape determined by the low frequency time envelope shape determination unit 100c and the time envelope shape determined by the high frequency time envelope shape determination unit 110b may be the same or different.

［第29の実施形態の音声復号装置の第1の変形例］
図389は、第29の実施形態に係る音声復号装置の第1の変形例380Aの構成を示す図である。 [First variant of the audio decoding device of the 29th embodiment]
FIG. 389 is a diagram showing a configuration of a first modification 380A of the audio decoding device according to the 29th embodiment.

図390は、第29の実施形態に係る音声復号装置の第1の変形例380Aの動作を示すフローチャートである。 FIG. 390 is a flowchart showing the operation of the first modification 380A of the audio decoding device according to the 29th embodiment.

本変形例と前記第29の実施形態に係る音声復号装置380との相違点は、低周波数時間包絡形状決定部100c及び高周波数時間包絡形状決定部110bにかえて時間包絡形状決定部120fを、時間包絡修正部380aにかえて時間包絡修正部380aAを具備する点である。 The difference between this modification and the voice decoding device 380 according to the 29th embodiment is that the time envelope shape determination unit 120f is replaced with the low frequency time envelope shape determination unit 100c and the high frequency time envelope shape determination unit 110b. The point is that the time envelope correction unit 380aA is provided instead of the time envelope correction unit 380a.

時間包絡修正部380aAは、前記時間包絡形状決定部120fにて決定される時間包絡形状に基づいて、低周波数復号部100bから出力される低周波数信号と高周波数復号部100eから出力される高周波数信号の時間包絡の形状を修正する(S380-1a)。 The time envelope correction unit 380aA has a low frequency signal output from the low frequency decoding unit 100b and a high frequency output from the high frequency decoding unit 100e based on the time envelope shape determined by the time envelope shape determination unit 120f. Correct the shape of the time envelope of the signal (S380-1a).

［第30の実施形態の音声復号装置］
図391は、第30の実施形態に係る音声復号装置390の構成を示す図である。 [Audio Decoding Device of the 30th Embodiment]
FIG. 391 is a diagram showing the configuration of the audio decoding device 390 according to the thirtieth embodiment.

図392は、第30の実施形態に係る音声復号装置390の動作を示すフローチャートである。 FIG. 392 is a flowchart showing the operation of the voice decoding device 390 according to the thirtieth embodiment.

本変形例においては、時間包絡修正部380aAは、時間包絡形状決定部120fにて決定される時間包絡形状に基づいて、低周波数復号部100bから出力される低周波数信号の時間包絡の形状を修正し、前記高周波数信号生成情報に基づき高周波数信号を生成すると判断された場合、高周波数復号部100eから出力される高周波数信号の時間包絡の形状も修正する(S380-1a)。 In this modification, the time wrapping correction unit 380aA corrects the time wrapping shape of the low frequency signal output from the low frequency decoding unit 100b based on the time wrapping shape determined by the time wrapping shape determining unit 120f. Then, when it is determined that the high frequency signal is generated based on the high frequency signal generation information, the shape of the time wrapping of the high frequency signal output from the high frequency decoding unit 100e is also corrected (S380-1a).

出願人は、上記の目的を達成するために、以下の第１〜第４の態様に係る音声復号装置を発明した。 The applicant has invented the audio decoding device according to the following first to fourth aspects in order to achieve the above object.

第１の態様に係る音声復号装置は、符号化された音声信号を復号して音声信号を出力する音声復号装置であって、前記符号化された音声信号を含む符号化系列を解析する符号化系列解析部と、前記符号化系列解析部から前記符号化された音声信号を含む符号化系列を受け取り、復号して音声信号を得る音声復号部と、前記符号化系列解析部及び前記音声復号部のうち少なくとも一つより情報を受け取り、当該情報に基づいて、復号された音声信号の時間包絡形状を決定する時間包絡形状決定部と、前記時間包絡形状決定部にて決定された時間包絡形状に基づき前記復号された音声信号の時間包絡形状を修正し出力する時間包絡修正部と、を備えることを特徴とする。 The voice decoding device according to the first aspect is a voice decoding device that decodes a coded voice signal and outputs a voice signal, and is a coding device that analyzes a coding sequence including the coded voice signal. A sequence analysis unit, a voice decoding unit that receives a coded sequence including the coded audio signal from the coded sequence analysis unit and decodes the coded sequence to obtain an audio signal, and the coded sequence analysis unit and the voice decoding unit. A time-wrapping shape determining unit that receives information from at least one of them and determines the time-wrapping shape of the decoded audio signal based on the information, and a time-wrapping shape determined by the time-wrapping shape determining unit. Based on this, it is characterized by including a time-wrapping correction unit that corrects and outputs the time-wrapping shape of the decoded audio signal.

第２の態様に係る音声復号装置は、符号化された音声信号を復号して音声信号を出力する音声復号装置であって、前記符号化された音声信号を含む符号化系列を、少なくとも、符号化された前記音声信号の低周波数信号の情報を含む符号化系列と、符号化された前記音声信号の高周波数信号の情報を含む符号化系列とに分割する符号化系列逆多重化部と、前記符号化系列逆多重化部から前記符号化された低周波数信号の情報を含む符号化系列を受け取り、復号して低周波数信号を得る低周波数復号部と、前記符号化系列逆多重化部及び前記低周波数復号部のうち少なくとも一つより第１の情報を受け取り、当該第１の情報に基づいて高周波数信号を生成する高周波数復号部と、前記符号化系列逆多重化部及び前記低周波数復号部のうち少なくとも一つより第２の情報を受け取り、当該第２の情報に基づいて、復号された低周波数信号の時間包絡形状を決定する低周波数時間包絡形状決定部と、前記低周波数時間包絡形状決定部にて決定された時間包絡形状に基づき前記復号された低周波数信号の時間包絡形状を修正し出力する低周波数時間包絡修正部と、前記低周波数時間包絡修正部から時間包絡形状を修正された低周波数信号を受け取り、前記高周波数復号部から高周波数信号を受け取り、前記時間包絡形状を修正された低周波数信号と前記高周波数信号とを合成することで、出力する音声信号を得る低周波数／高周波数信号合成部と、を備えることを特徴とする。 The voice decoding device according to the second aspect is a voice decoding device that decodes a coded voice signal and outputs a voice signal, and at least encodes a coding sequence including the coded voice signal. A coded sequence demultiplexing unit that divides the coded sequence into a coded sequence containing information on the low frequency signal of the voiced voice signal and a coded series containing information on the high frequency signal of the coded voice signal. A low-frequency decoding unit that receives a coded sequence containing information on the coded low-frequency signal from the coded-series demultiplexing unit and decodes it to obtain a low-frequency signal, and the coded-series demultiplexing unit and A high frequency decoding unit that receives first information from at least one of the low frequency decoding units and generates a high frequency signal based on the first information, a coded sequence demultiplexing unit, and the low frequency. A low frequency time entrainment shape determining unit that receives second information from at least one of the decoding units and determines the time envelopment shape of the decoded low frequency signal based on the second information, and the low frequency time The low frequency time wrapping correction unit that corrects and outputs the time wrapping shape of the decoded low frequency signal based on the time wrapping shape determined by the wrapping shape determination unit, and the time wrapping shape from the low frequency time wrapping correction unit. The corrected low frequency signal is received, the high frequency signal is received from the high frequency decoding unit, and the low frequency signal having the corrected time wrapping shape is combined with the high frequency signal to obtain an output audio signal. It is characterized by including a low frequency / high frequency signal synthesizer.

第３の態様に係る音声復号装置は、符号化された音声信号を復号して音声信号を出力する音声復号装置であって、前記符号化された音声信号を含む符号化系列を、少なくとも、符号化された前記音声信号の低周波数信号の情報を含む符号化系列と、符号化された前記音声信号の高周波数信号の情報を含む符号化系列とに分割する符号化系列逆多重化部と、前記符号化系列逆多重化部から前記符号化された低周波数信号の情報を含む符号化系列を受け取り、復号して低周波数信号を得る低周波数復号部と、前記符号化系列逆多重化部及び前記低周波数復号部のうち少なくとも一つより第１の情報を受け取り、当該第１の情報に基づいて高周波数信号を生成する高周波数復号部と、前記符号化系列逆多重化部、前記低周波数復号部、及び前記高周波数復号部のうち少なくとも一つより第２の情報を受け取り、当該第２の情報に基づいて、生成された高周波数信号の時間包絡形状を決定する高周波数時間包絡形状決定部と、前記高周波数時間包絡形状決定部にて決定された時間包絡形状に基づき前記生成された高周波数信号の時間包絡形状を修正し出力する高周波数時間包絡修正部と、前記低周波数復号部から低周波数信号を受け取り、前記高周波数時間包絡修正部から時間包絡形状を修正された高周波数信号を受け取り、前記低周波数信号と前記時間包絡形状を修正された高周波数信号とを合成することで、出力する音声信号を得る低周波数／高周波数信号合成部と、を備えることを特徴とする。 The voice decoding device according to the third aspect is a voice decoding device that decodes a coded voice signal and outputs a voice signal, and at least encodes a coding sequence including the coded voice signal. A coded sequence demultiplexing unit that divides the coded sequence into a coded sequence containing information on the low frequency signal of the voiced voice signal and a coded series containing information on the high frequency signal of the coded voice signal. A low-frequency decoding unit that receives a coded sequence containing information on the coded low-frequency signal from the coded-series demultiplexing unit and decodes it to obtain a low-frequency signal, and the coded-series demultiplexing unit and A high frequency decoding unit that receives first information from at least one of the low frequency decoding units and generates a high frequency signal based on the first information, a coded sequence demultiplexing unit, and the low frequency. High frequency time wrapping shape determination that receives second information from at least one of the decoding unit and the high frequency decoding unit and determines the time wrapping shape of the generated high frequency signal based on the second information. A high frequency time wrapping correction unit that corrects and outputs the time wrapping shape of the generated high frequency signal based on the time wrapping shape determined by the high frequency time wrapping shape determination unit, and the low frequency decoding unit. By receiving a low frequency signal from the high frequency signal, receiving a high frequency signal having a corrected time wrapping shape from the high frequency time wrapping correction unit, and synthesizing the low frequency signal and the high frequency signal having the corrected time wrapping shape. It is characterized by including a low frequency / high frequency signal synthesizer for obtaining an output audio signal.

第４の態様に係る音声復号装置は、符号化された音声信号を復号して音声信号を出力する音声復号装置であって、前記符号化された音声信号を含む符号化系列を、少なくとも、符号化された前記音声信号の低周波数信号の情報を含む符号化系列と、符号化された前記音声信号の高周波数信号の情報を含む符号化系列とに分割する符号化系列逆多重化部と、前記符号化系列逆多重化部から前記符号化された低周波数信号の情報を含む符号化系列を受け取り、復号して低周波数信号を得る低周波数復号部と、前記符号化系列逆多重化部及び前記低周波数復号部のうち少なくとも一つより第１の情報を受け取り、当該第１の情報に基づいて高周波数信号を生成する高周波数復号部と、前記符号化系列逆多重化部及び前記低周波数復号部のうち少なくとも一つより第２の情報を受け取り、当該第２の情報に基づいて、復号された低周波数信号の時間包絡形状を決定する低周波数時間包絡形状決定部と、前記低周波数時間包絡形状決定部にて決定された時間包絡形状に基づき前記復号された低周波数信号の時間包絡形状を修正し出力する低周波数時間包絡修正部と、前記符号化系列逆多重化部、前記低周波数復号部、及び前記高周波数復号部のうち少なくとも一つより第３の情報を受け取り、当該第３の情報に基づいて、生成された高周波数信号の時間包絡形状を決定する高周波数時間包絡形状決定部と、前記高周波数時間包絡形状決定部にて決定された時間包絡形状に基づき前記生成された高周波数信号の時間包絡形状を修正し出力する高周波数時間包絡修正部と、前記低周波数時間包絡修正部から時間包絡形状を修正された低周波数信号を受け取り、前記高周波数時間包絡修正部から時間包絡形状を修正された高周波数信号を受け取り、前記時間包絡形状を修正された低周波数信号と前記時間包絡形状を修正された高周波数信号とを合成することで、出力する音声信号を得る低周波数／高周波数信号合成部と、を備えることを特徴とする。 The voice decoding device according to the fourth aspect is a voice decoding device that decodes a coded voice signal and outputs a voice signal, and at least encodes a coding sequence including the coded voice signal. A coded sequence demultiplexing unit that divides the coded sequence into a coded sequence containing information on the low frequency signal of the voiced voice signal and a coded series containing information on the high frequency signal of the coded voice signal. A low-frequency decoding unit that receives a coded sequence containing information on the coded low-frequency signal from the coded-series demultiplexing unit and decodes it to obtain a low-frequency signal, and the coded-series demultiplexing unit and A high frequency decoding unit that receives first information from at least one of the low frequency decoding units and generates a high frequency signal based on the first information, a coded sequence demultiplexing unit, and the low frequency. A low frequency time entrainment shape determining unit that receives second information from at least one of the decoding units and determines the time envelopment shape of the decoded low frequency signal based on the second information, and the low frequency time The low frequency time entrainment correction unit that corrects and outputs the time entrapment shape of the decoded low frequency signal based on the time entrapment shape determined by the envelopment shape determination unit, the coded sequence demultiplexing unit, and the low frequency High frequency time entrainment shape determination that receives third information from at least one of the decoding unit and the high frequency decoding unit and determines the time envelopment shape of the generated high frequency signal based on the third information. A high frequency time wrapping correction unit that corrects and outputs the time wrapping shape of the generated high frequency signal based on the time wrapping shape determined by the high frequency time wrapping shape determination unit, and the low frequency time wrapping unit. The low frequency signal with the corrected time wrapping shape is received from the correction unit, the high frequency signal with the corrected time wrapping shape is received from the high frequency time wrapping correction part, and the low frequency signal with the corrected time wrapping shape and the above It is characterized by including a low frequency / high frequency signal synthesizing unit that obtains an output audio signal by synthesizing a high frequency signal having a modified time-wrapping shape.

なお、第２又は第４の態様に係る音声復号装置において、前記高周波数復号部は、前記符号化系列逆多重化部、前記低周波数復号部及び前記低周波数時間包絡修正部のうち少なくとも一つより情報を受け取り、当該情報に基づいて高周波数信号を生成してもよい。 In the voice decoding device according to the second or fourth aspect, the high frequency decoding unit is at least one of the coding sequence demultiplexing unit, the low frequency decoding unit, and the low frequency time envelope correction unit. More information may be received and a high frequency signal may be generated based on the information.

また、第１〜第４の態様に係る音声復号装置において、前記高周波数時間包絡修正部は、前記高周波数時間包絡形状決定部にて決定された時間包絡形状に基づいて、前記高周波数復号部にて高周波数信号を生成する際の中間信号の時間包絡形状を修正し、前記高周波数復号部は、前記時間包絡形状を修正された前記中間信号を用いて、残存する高周波数信号を生成する処理を実施してもよい。 Further, in the voice decoding apparatus according to the first to fourth aspects, the high frequency time wrapping correction unit is based on the time wrapping shape determined by the high frequency time wrapping shape determining unit. The time-wrapping shape of the intermediate signal when generating the high-frequency signal is corrected, and the high-frequency decoding unit generates the remaining high-frequency signal by using the intermediate signal whose time-wrapping shape is corrected. The process may be carried out.

ここで、前記高周波数復号部は、前記低周波数復号部にて復号された低周波数信号を受け取り、当該信号をサブバンド信号に分割する分析フィルタ部と、少なくとも前記分析フィルタ部で分割されたサブバンド信号を用いて高周波数信号を生成する高周波数信号生成部と、前記高周波数信号生成部で生成された高周波数信号の周波数包絡を調整する周波数包絡調整部と、を備え、前記中間信号は、前記高周波数信号生成部で生成された高周波数信号であってもよい。 Here, the high frequency decoding unit receives a low frequency signal decoded by the low frequency decoding unit and divides the signal into subband signals, and at least a sub divided by the analysis filter unit. The intermediate signal includes a high frequency signal generation unit that generates a high frequency signal using a band signal and a frequency wrapping adjustment unit that adjusts the frequency wrapping of the high frequency signal generated by the high frequency signal generation unit. , The high frequency signal generated by the high frequency signal generation unit may be used.

上述した第１〜第４の態様に係る音声復号装置の発明は、音声復号方法の発明として捉えることができ、以下のように記述することができる。 The invention of the audio decoding device according to the first to fourth aspects described above can be regarded as an invention of an audio decoding method, and can be described as follows.

第１の態様に係る音声復号方法は、符号化された音声信号を復号して音声信号を出力する音声復号装置、により実行される音声復号方法であって、前記符号化された音声信号を含む符号化系列を解析する符号化系列解析ステップと、解析後の前記符号化された音声信号を含む符号化系列を受け取り、復号して音声信号を得る音声復号ステップと、前記符号化系列解析ステップ及び前記音声復号ステップのうち少なくとも一つで得られた情報を受け取り、当該情報に基づいて、復号された音声信号の時間包絡形状を決定する時間包絡形状決定ステップと、前記時間包絡形状決定ステップにて決定された時間包絡形状に基づき前記復号された音声信号の時間包絡形状を修正し出力する時間包絡修正ステップと、を備えることを特徴とする。 The voice decoding method according to the first aspect is a voice decoding method executed by a voice decoding device that decodes a coded voice signal and outputs a voice signal, and includes the coded voice signal. A coded sequence analysis step for analyzing the coded sequence, a voice decoding step for receiving the coded sequence including the coded voice signal after analysis and decoding the coded sequence to obtain an audio signal, the coded sequence analysis step, and the coded sequence analysis step. In the time entrainment shape determination step of receiving the information obtained in at least one of the audio decoding steps and determining the time entrapment shape of the decoded audio signal based on the information, and the time entrapment shape determination step. It is characterized by comprising a time-wrapping correction step of correcting and outputting the time-wrapping shape of the decoded audio signal based on the determined time-wrapping shape.

第２の態様に係る音声復号方法は、符号化された音声信号を復号して音声信号を出力する音声復号装置、により実行される音声復号方法であって、前記符号化された音声信号を含む符号化系列を、少なくとも、符号化された前記音声信号の低周波数信号の情報を含む符号化系列と、符号化された前記音声信号の高周波数信号の情報を含む符号化系列とに分割する符号化系列逆多重化ステップと、分割により得られた前記符号化された低周波数信号の情報を含む符号化系列を受け取り、復号して低周波数信号を得る低周波数復号ステップと、前記符号化系列逆多重化ステップ及び前記低周波数復号ステップのうち少なくとも一つで得られた第１の情報を受け取り、当該第１の情報に基づいて高周波数信号を生成する高周波数復号ステップと、前記符号化系列逆多重化ステップ及び前記低周波数復号ステップのうち少なくとも一つで得られた第２の情報を受け取り、当該第２の情報に基づいて、復号された低周波数信号の時間包絡形状を決定する低周波数時間包絡形状決定ステップと、前記低周波数時間包絡形状決定ステップにて決定された時間包絡形状に基づき前記復号された低周波数信号の時間包絡形状を修正し出力する低周波数時間包絡修正ステップと、前記低周波数時間包絡修正ステップで得られた前記時間包絡形状を修正された低周波数信号を受け取り、前記高周波数復号ステップで得られた高周波数信号を受け取り、前記時間包絡形状を修正された低周波数信号と前記高周波数信号とを合成することで、出力する音声信号を得る低周波数／高周波数信号合成ステップと、を備えることを特徴とする。 The voice decoding method according to the second aspect is a voice decoding method executed by a voice decoding device that decodes a coded voice signal and outputs a voice signal, and includes the coded voice signal. A code that divides the coded sequence into at least a coded sequence containing information on the low frequency signal of the encoded audio signal and a coded sequence containing information on the high frequency signal of the encoded audio signal. The coded sequence demultiplexing step, the low frequency decoding step of receiving the coded sequence containing the information of the coded low frequency signal obtained by the division and decoding the coded sequence to obtain the low frequency signal, and the coded sequence inverse. A high frequency decoding step that receives the first information obtained in at least one of the multiplexing step and the low frequency decoding step and generates a high frequency signal based on the first information, and the coding sequence inverse. Low frequency time that receives the second information obtained in at least one of the multiplexing step and the low frequency decoding step, and determines the time entrainment shape of the decoded low frequency signal based on the second information. The low frequency time wrapping correction step, the low frequency time wrapping correction step, which corrects and outputs the time wrapping shape of the decoded low frequency signal based on the time wrapping shape determined in the low frequency time wrapping shape determination step, and the low frequency time wrapping shape determination step. The low frequency signal obtained in the frequency time wrapping correction step with the corrected time wrapping shape is received, the high frequency signal obtained in the high frequency decoding step is received, and the low frequency signal with the corrected time wrapping shape is received. It is characterized by comprising a low frequency / high frequency signal synthesis step of obtaining an output audio signal by synthesizing the high frequency signal.

第３の態様に係る音声復号方法は、符号化された音声信号を復号して音声信号を出力する音声復号装置、により実行される音声復号方法であって、前記符号化された音声信号を含む符号化系列を、少なくとも、符号化された前記音声信号の低周波数信号の情報を含む符号化系列と、符号化された前記音声信号の高周波数信号の情報を含む符号化系列とに分割する符号化系列逆多重化ステップと、分割により得られた前記符号化された低周波数信号の情報を含む符号化系列を受け取り、復号して低周波数信号を得る低周波数復号ステップと、前記符号化系列逆多重化ステップ及び前記低周波数復号ステップのうち少なくとも一つで得られた第１の情報を受け取り、当該第１の情報に基づいて高周波数信号を生成する高周波数復号ステップと、前記符号化系列逆多重化ステップ、前記低周波数復号ステップ、及び前記高周波数復号ステップのうち少なくとも一つで得られた第２の情報を受け取り、当該第２の情報に基づいて、生成された高周波数信号の時間包絡形状を決定する高周波数時間包絡形状決定ステップと、前記高周波数時間包絡形状決定ステップにて決定された時間包絡形状に基づき前記生成された高周波数信号の時間包絡形状を修正し出力する高周波数時間包絡修正ステップと、前記低周波数復号ステップで得られた低周波数信号を受け取り、前記高周波数時間包絡修正ステップで得られた前記時間包絡形状を修正された高周波数信号を受け取り、前記低周波数信号と前記時間包絡形状を修正された高周波数信号とを合成することで、出力する音声信号を得る低周波数／高周波数信号合成ステップと、を備えることを特徴とする。 The voice decoding method according to the third aspect is a voice decoding method executed by a voice decoding device that decodes a coded voice signal and outputs a voice signal, and includes the coded voice signal. A code that divides the coded sequence into at least a coded sequence containing information on the low frequency signal of the encoded audio signal and a coded sequence containing information on the high frequency signal of the encoded audio signal. The coded sequence demultiplexing step, the low frequency decoding step of receiving the coded sequence containing the information of the coded low frequency signal obtained by the division and decoding the coded sequence to obtain the low frequency signal, and the coded sequence inverse. A high frequency decoding step that receives the first information obtained in at least one of the multiplexing step and the low frequency decoding step and generates a high frequency signal based on the first information, and the coding sequence inverse. The second information obtained in at least one of the multiplexing step, the low frequency decoding step, and the high frequency decoding step is received, and the time wrapping of the generated high frequency signal based on the second information is received. The high frequency time entrainment shape determination step that determines the shape and the high frequency time that corrects and outputs the time entrapment shape of the generated high frequency signal based on the time entrapment shape determined in the high frequency time entrapment shape determination step. The low frequency signal obtained in the wrapping correction step and the low frequency decoding step is received, and the high frequency signal obtained in the high frequency time wrapping correction step is received and the time wrapping shape is corrected. It is characterized by comprising a low frequency / high frequency signal synthesis step of obtaining an output audio signal by synthesizing the high frequency signal having the modified time-wrapping shape.

第４の態様に係る音声復号方法は、符号化された音声信号を復号して音声信号を出力する音声復号装置、により実行される音声復号方法であって、前記符号化された音声信号を含む符号化系列を、少なくとも、符号化された前記音声信号の低周波数信号の情報を含む符号化系列と、符号化された前記音声信号の高周波数信号の情報を含む符号化系列とに分割する符号化系列逆多重化ステップと、前記符号化系列逆多重化ステップで得られた前記符号化された低周波数信号の情報を含む符号化系列を受け取り、復号して低周波数信号を得る低周波数復号ステップと、前記符号化系列逆多重化ステップ及び前記低周波数復号ステップのうち少なくとも一つで得られた第１の情報を受け取り、当該第１の情報に基づいて高周波数信号を生成する高周波数復号ステップと、前記符号化系列逆多重化ステップ及び前記低周波数復号ステップのうち少なくとも一つで得られた第２の情報を受け取り、当該第２の情報に基づいて、復号された低周波数信号の時間包絡形状を決定する低周波数時間包絡形状決定ステップと、前記低周波数時間包絡形状決定ステップにて決定された時間包絡形状に基づき前記復号された低周波数信号の時間包絡形状を修正し出力する低周波数時間包絡修正ステップと、前記符号化系列逆多重化ステップ、前記低周波数復号ステップ、及び前記高周波数復号ステップのうち少なくとも一つより第３の情報を受け取り、当該第３の情報に基づいて、生成された高周波数信号の時間包絡形状を決定する高周波数時間包絡形状決定ステップと、前記高周波数時間包絡形状決定ステップにて決定された時間包絡形状に基づき前記生成された高周波数信号の時間包絡形状を修正し出力する高周波数時間包絡修正ステップと、前記低周波数時間包絡修正ステップで得られた前記時間包絡形状を修正された低周波数信号を受け取り、前記高周波数時間包絡修正ステップで得られた前記時間包絡形状を修正された高周波数信号を受け取り、前記時間包絡形状を修正された低周波数信号と前記時間包絡形状を修正された高周波数信号とを合成することで、出力する音声信号を得る低周波数／高周波数信号合成ステップと、を備えることを特徴とする。 The voice decoding method according to the fourth aspect is a voice decoding method executed by a voice decoding device that decodes a coded voice signal and outputs a voice signal, and includes the coded voice signal. A code that divides the coded sequence into at least a coded sequence containing information on the low frequency signal of the encoded audio signal and a coded sequence containing information on the high frequency signal of the encoded audio signal. A low frequency decoding step in which a coded sequence demultiplexing step and a coded sequence including information on the encoded low frequency signal obtained in the coded sequence demultiplexing step are received and decoded to obtain a low frequency signal. A high frequency decoding step that receives the first information obtained in at least one of the coded sequence demultiplexing step and the low frequency decoding step and generates a high frequency signal based on the first information. And the second information obtained in at least one of the coded sequence demultiplexing step and the low frequency decoding step is received, and the time wrapping of the decoded low frequency signal based on the second information is received. A low frequency time entrainment shape determination step that determines the shape, and a low frequency time that corrects and outputs the time entrapment shape of the decoded low frequency signal based on the time entrapment shape determined in the low frequency time entrapment shape determination step. A third piece of information is received from at least one of the encapsulation correction step, the coding sequence demultiplexing step, the low frequency decoding step, and the high frequency decoding step, and is generated based on the third information. The high frequency time entrainment shape determination step for determining the time entrapment shape of the high frequency signal and the time entrapment shape of the generated high frequency signal based on the time entrapment shape determined in the high frequency time entrapment shape determination step. The time obtained in the high frequency time wrapping correction step and the low frequency signal obtained in the low frequency time wrapping correction step and the low frequency signal in which the time wrapping shape is corrected are received and output. A low frequency signal obtained by receiving a high frequency signal having a modified envelope shape and synthesizing the low frequency signal having the modified time envelope shape and the high frequency signal having the modified time envelope shape to output an audio signal. / It is characterized by including a high frequency signal synthesis step.

また、上述した第１〜第４の態様に係る音声復号装置の発明は、音声復号プログラムの発明として捉えることができ、以下のように記述することができる。 Further, the invention of the voice decoding device according to the first to fourth aspects described above can be regarded as an invention of a voice decoding program, and can be described as follows.

第１の態様に係る音声復号プログラムは、符号化された音声信号を復号して音声信号を出力する音声復号装置、に設けられたコンピュータを、前記符号化された音声信号を含む符号化系列を解析する符号化系列解析部と、前記符号化系列解析部から前記符号化された音声信号を含む符号化系列を受け取り、復号して音声信号を得る音声復号部と、前記符号化系列解析部及び前記音声復号部のうち少なくとも一つより情報を受け取り、当該情報に基づいて、復号された音声信号の時間包絡形状を決定する時間包絡形状決定部と、前記時間包絡形状決定部にて決定された時間包絡形状に基づき前記復号された音声信号の時間包絡形状を修正し出力する時間包絡修正部、として機能させることを特徴とする。 The audio decoding program according to the first aspect uses a computer provided in an audio decoding device that decodes an encoded audio signal and outputs an audio signal, and uses a coding sequence including the encoded audio signal. A coded sequence analysis unit to be analyzed, a voice decoding unit that receives a coded sequence including the coded audio signal from the coded sequence analysis unit and decodes the coded sequence to obtain an audio signal, the coded sequence analysis unit, and the coded sequence analysis unit. The time-enclosed shape determining unit that receives information from at least one of the audio decoding units and determines the time-enclosed shape of the decoded audio signal based on the information, and the time-enclosed shape determining unit determine the time-enclosed shape. It is characterized in that it functions as a time-wrapping correction unit that corrects and outputs the time-wrapping shape of the decoded audio signal based on the time-wrapping shape.

第２の態様に係る音声復号プログラムは、符号化された音声信号を復号して音声信号を出力する音声復号装置、に設けられたコンピュータを、前記符号化された音声信号を含む符号化系列を、少なくとも、符号化された前記音声信号の低周波数信号の情報を含む符号化系列と、符号化された前記音声信号の高周波数信号の情報を含む符号化系列とに分割する符号化系列逆多重化部と、前記符号化系列逆多重化部から前記符号化された低周波数信号の情報を含む符号化系列を受け取り、復号して低周波数信号を得る低周波数復号部と、前記符号化系列逆多重化部及び前記低周波数復号部のうち少なくとも一つより第１の情報を受け取り、当該第１の情報に基づいて高周波数信号を生成する高周波数復号部と、前記符号化系列逆多重化部及び前記低周波数復号部のうち少なくとも一つより第２の情報を受け取り、当該第２の情報に基づいて、復号された低周波数信号の時間包絡形状を決定する低周波数時間包絡形状決定部と、前記低周波数時間包絡形状決定部にて決定された時間包絡形状に基づき前記復号された低周波数信号の時間包絡形状を修正し出力する低周波数時間包絡修正部と、前記低周波数時間包絡修正部から時間包絡形状を修正された低周波数信号を受け取り、前記高周波数復号部から高周波数信号を受け取り、前記時間包絡形状を修正された低周波数信号と前記高周波数信号とを合成することで、出力する音声信号を得る低周波数／高周波数信号合成部、として機能させることを特徴とする。 The voice decoding program according to the second aspect uses a computer provided in a voice decoding device that decodes a coded voice signal and outputs a voice signal, and uses a coding sequence including the coded voice signal. , At least, a coded sequence demultiplexing that divides into a coded sequence containing information on the low frequency signal of the encoded voice signal and a coded series containing information on the high frequency signal of the coded voice signal. A low-frequency decoding unit that receives a coded sequence containing information on the coded low-frequency signal from the coded unit and the coded-series demultiplexing unit and decodes the coded sequence to obtain a low-frequency signal, and the coded sequence inverse. A high frequency decoding unit that receives first information from at least one of the multiplexing unit and the low frequency decoding unit and generates a high frequency signal based on the first information, and the coded sequence demultiplexing unit. And the low frequency time wrapping shape determining unit that receives the second information from at least one of the low frequency decoding units and determines the time wrapping shape of the decoded low frequency signal based on the second information. From the low frequency time wrapping correction unit that corrects and outputs the time wrapping shape of the decoded low frequency signal based on the time wrapping shape determined by the low frequency time wrapping shape determination unit, and the low frequency time wrapping correction unit. It receives a low-frequency signal with a modified time-wrapping shape, receives a high-frequency signal from the high-frequency decoding unit, and outputs it by synthesizing the low-frequency signal with a corrected time-wrapping shape and the high-frequency signal. It is characterized in that it functions as a low-frequency / high-frequency signal synthesizer that obtains an audio signal.

第３の態様に係る音声復号プログラムは、符号化された音声信号を復号して音声信号を出力する音声復号装置、に設けられたコンピュータを、前記符号化された音声信号を含む符号化系列を、少なくとも、符号化された前記音声信号の低周波数信号の情報を含む符号化系列と、符号化された前記音声信号の高周波数信号の情報を含む符号化系列とに分割する符号化系列逆多重化部と、前記符号化系列逆多重化部から前記符号化された低周波数信号の情報を含む符号化系列を受け取り、復号して低周波数信号を得る低周波数復号部と、前記符号化系列逆多重化部及び前記低周波数復号部のうち少なくとも一つより第１の情報を受け取り、当該第１の情報に基づいて高周波数信号を生成する高周波数復号部と、前記符号化系列逆多重化部、前記低周波数復号部、及び前記高周波数復号部のうち少なくとも一つより第２の情報を受け取り、当該第２の情報に基づいて、生成された高周波数信号の時間包絡形状を決定する高周波数時間包絡形状決定部と、前記高周波数時間包絡形状決定部にて決定された時間包絡形状に基づき前記生成された高周波数信号の時間包絡形状を修正し出力する高周波数時間包絡修正部と、前記低周波数復号部から低周波数信号を受け取り、前記高周波数時間包絡修正部から時間包絡形状を修正された高周波数信号を受け取り、前記低周波数信号と前記時間包絡形状を修正された高周波数信号とを合成することで、出力する音声信号を得る低周波数／高周波数信号合成部、として機能させることを特徴とする。 The voice decoding program according to the third aspect uses a computer provided in a voice decoding device that decodes a coded voice signal and outputs a voice signal, and uses a coding sequence including the coded voice signal. , At least, a coded sequence demultiplexing that divides into a coded sequence containing information on the low frequency signal of the encoded voice signal and a coded series containing information on the high frequency signal of the coded voice signal. A low-frequency decoding unit that receives a coded sequence containing information on the coded low-frequency signal from the coded unit and the coded-series demultiplexing unit and decodes the coded sequence to obtain a low-frequency signal, and the coded sequence inverse. A high frequency decoding unit that receives first information from at least one of the multiplexing unit and the low frequency decoding unit and generates a high frequency signal based on the first information, and the coded sequence demultiplexing unit. , The high frequency that receives the second information from at least one of the low frequency decoding unit and the high frequency decoding unit and determines the time-enclosed shape of the generated high frequency signal based on the second information. The time wrapping shape determining unit, the high frequency time wrapping correction unit that corrects and outputs the time wrapping shape of the generated high frequency signal based on the time wrapping shape determined by the high frequency time wrapping shape determining unit, and the above. A low frequency signal is received from the low frequency decoding unit, a high frequency signal having a corrected time wrapping shape is received from the high frequency time wrapping correction unit, and the low frequency signal and the high frequency signal having the corrected time wrapping shape are input. It is characterized in that it functions as a low-frequency / high-frequency signal synthesizer that obtains an output audio signal by synthesizing.

第４の態様に係る音声復号プログラムは、符号化された音声信号を復号して音声信号を出力する音声復号装置、に設けられたコンピュータを、前記符号化された音声信号を含む符号化系列を、少なくとも、符号化された前記音声信号の低周波数信号の情報を含む符号化系列と、符号化された前記音声信号の高周波数信号の情報を含む符号化系列とに分割する符号化系列逆多重化部と、前記符号化系列逆多重化部から前記符号化された低周波数信号の情報を含む符号化系列を受け取り、復号して低周波数信号を得る低周波数復号部と、前記符号化系列逆多重化部及び前記低周波数復号部のうち少なくとも一つより第１の情報を受け取り、当該第１の情報に基づいて高周波数信号を生成する高周波数復号部と、前記符号化系列逆多重化部及び前記低周波数復号部のうち少なくとも一つより第２の情報を受け取り、当該第２の情報に基づいて、復号された低周波数信号の時間包絡形状を決定する低周波数時間包絡形状決定部と、前記低周波数時間包絡形状決定部にて決定された時間包絡形状に基づき前記復号された低周波数信号の時間包絡形状を修正し出力する低周波数時間包絡修正部と、前記符号化系列逆多重化部、前記低周波数復号部、及び前記高周波数復号部のうち少なくとも一つより第３の情報を受け取り、当該第３の情報に基づいて、生成された高周波数信号の時間包絡形状を決定する高周波数時間包絡形状決定部と、前記高周波数時間包絡形状決定部にて決定された時間包絡形状に基づき前記生成された高周波数信号の時間包絡形状を修正し出力する高周波数時間包絡修正部と、前記低周波数時間包絡修正部から時間包絡形状を修正された低周波数信号を受け取り、前記高周波数時間包絡修正部から時間包絡形状を修正された高周波数信号を受け取り、前記時間包絡形状を修正された低周波数信号と前記時間包絡形状を修正された高周波数信号とを合成することで、出力する音声信号を得る低周波数／高周波数信号合成部、として機能させることを特徴とする。 The voice decoding program according to the fourth aspect uses a computer provided in a voice decoding device that decodes a coded voice signal and outputs a voice signal, and uses a coding sequence including the coded voice signal. , At least, a coded sequence demultiplexing that divides into a coded sequence containing information on the low frequency signal of the encoded voice signal and a coded series containing information on the high frequency signal of the coded voice signal. A low-frequency decoding unit that receives a coded sequence containing information on the coded low-frequency signal from the coded unit and the coded-series demultiplexing unit and decodes the coded sequence to obtain a low-frequency signal, and the coded sequence inverse. A high frequency decoding unit that receives first information from at least one of the multiplexing unit and the low frequency decoding unit and generates a high frequency signal based on the first information, and the coded sequence demultiplexing unit. And the low frequency time wrapping shape determining unit that receives the second information from at least one of the low frequency decoding units and determines the time wrapping shape of the decoded low frequency signal based on the second information. A low frequency time wrapping correction unit that corrects and outputs the time wrapping shape of the decoded low frequency signal based on the time wrapping shape determined by the low frequency time wrapping shape determination unit, and the coded sequence demultiplexing unit. , The high frequency that receives the third information from at least one of the low frequency decoding unit and the high frequency decoding unit, and determines the time-enclosed shape of the generated high frequency signal based on the third information. The time wrapping shape determining unit, the high frequency time wrapping correction unit that corrects and outputs the time wrapping shape of the generated high frequency signal based on the time wrapping shape determined by the high frequency time wrapping shape determining unit, and the above. The low frequency signal with the corrected time wrapping shape is received from the low frequency time wrapping correction part, the high frequency signal with the corrected time wrapping shape is received from the high frequency time wrapping correction part, and the low frequency signal with the corrected time wrapping shape is received. It is characterized in that it functions as a low-frequency / high-frequency signal synthesizer that obtains an output audio signal by synthesizing a frequency signal and a high-frequency signal having a modified time-wrapping shape.

出願人は、上記の目的を達成するために、以下の第１〜第４の態様に係る音声符号化装置を発明した。 In order to achieve the above object, the applicant has invented a voice coding device according to the following first to fourth aspects.

第１の態様に係る音声符号化装置は、入力される音声信号を符号化して符号化系列を出力する音声符号化装置であって、前記音声信号を符号化する音声符号化部と、前記音声信号の時間包絡情報を算出し符号化する時間包絡情報符号化部と、前記音声符号化部で得られる前記音声信号を含む符号化系列と、前記時間包絡情報符号化部で得られる時間包絡情報の符号化系列とを多重化する符号化系列多重化部と、を備えることを特徴とする。 The voice coding device according to the first aspect is a voice coding device that encodes an input voice signal and outputs a coded sequence, and includes a voice coding unit that encodes the voice signal and the voice. The time-wrapping information coding unit that calculates and encodes the time-wrapping information of the signal, the coding series including the voice signal obtained by the voice coding unit, and the time-wrapping information obtained by the time-wrapping information coding unit. It is characterized by including a coded sequence multiplexing unit for multiplexing the coded sequence of the above.

第２の態様に係る音声符号化装置は、入力される音声信号を符号化して符号化系列を出力する音声符号化装置であって、前記音声信号の低周波数成分を符号化する低周波数符号化部と、前記音声信号の高周波数成分を符号化する高周波数符号化部と、前記音声信号、前記低周波数符号化部の符号化結果、及び当該低周波数符号化過程で得られる情報のうち少なくとも一つ以上に基づいて、低周波数成分の時間包絡情報を算出し符号化する低周波数時間包絡情報符号化部と、前記低周波数符号化部で得られる前記低周波数成分を含む符号化系列と、前記高周波数符号化部で得られる前記高周波数成分を含む符号化系列と、前記低周波数時間包絡情報符号化部で得られる低周波数成分の時間包絡情報の符号化系列とを多重化する符号化系列多重化部と、を備えることを特徴とする。 The voice coding device according to the second aspect is a voice coding device that encodes an input voice signal and outputs a coded sequence, and is a low frequency coding device that encodes a low frequency component of the voice signal. At least of the unit, the high frequency coding unit that encodes the high frequency component of the audio signal, the audio signal, the coding result of the low frequency coding unit, and the information obtained in the low frequency coding process. A low frequency time wrapping information coding unit that calculates and encodes the time wrapping information of the low frequency component based on one or more, and a coding series including the low frequency component obtained by the low frequency coding unit. Coding that multiplexes the coding sequence including the high frequency component obtained by the high frequency coding unit and the coding sequence of the time inclusion information of the low frequency component obtained by the low frequency time inclusion information coding unit. It is characterized by including a sequence multiplexing unit.

第３の態様に係る音声符号化装置は、入力される音声信号を符号化して符号化系列を出力する音声符号化装置であって、前記音声信号の低周波数成分を符号化する低周波数符号化部と、前記音声信号の高周波数成分を符号化する高周波数符号化部と、前記音声信号、前記低周波数符号化部の符号化結果、当該低周波数符号化過程で得られる情報、前記高周波数符号化部の符号化結果、及び当該高周波数符号化過程で得られる情報のうち少なくとも一つ以上に基づいて、高周波数成分の時間包絡情報を算出し符号化する高周波数時間包絡情報符号化部と、前記低周波数符号化部で得られる前記低周波数成分を含む符号化系列と、前記高周波数符号化部で得られる前記高周波数成分を含む符号化系列と、前記高周波数時間包絡情報符号化部で得られる高周波数成分の時間包絡情報の符号化系列とを多重化する符号化系列多重化部と、を備えることを特徴とする。 The voice coding device according to the third aspect is a voice coding device that encodes an input voice signal and outputs a coded sequence, and is a low frequency coding device that encodes a low frequency component of the voice signal. The unit, the high frequency coding unit that encodes the high frequency component of the audio signal, the audio signal, the coding result of the low frequency coding unit, the information obtained in the low frequency coding process, and the high frequency. High frequency time wrapping information coding unit that calculates and encodes the time wrapping information of high frequency components based on the coding result of the coding unit and at least one of the information obtained in the high frequency coding process. And the coding series including the low frequency component obtained by the low frequency coding unit, the coding series including the high frequency component obtained by the high frequency coding unit, and the high frequency time wrapping information coding. It is characterized by including a coded sequence multiplexing section for multiplexing the coded sequence of the time wrapping information of the high frequency component obtained in the section.

第４の態様に係る音声符号化装置は、入力される音声信号を符号化して符号化系列を出力する音声符号化装置であって、前記音声信号の低周波数成分を符号化する低周波数符号化部と、前記音声信号の高周波数成分を符号化する高周波数符号化部と、前記音声信号、前記低周波数符号化部の符号化結果、及び当該低周波数符号化過程で得られる情報のうち少なくとも一つ以上に基づいて、低周波数成分の時間包絡情報を算出し符号化する低周波数時間包絡情報符号化部と、前記音声信号、前記低周波数符号化部の符号化結果、当該低周波数符号化過程で得られる情報、前記高周波数符号化部の符号化結果、及び当該高周波数符号化過程で得られる情報のうち少なくとも一つ以上に基づいて、高周波数成分の時間包絡情報を算出し符号化する高周波数時間包絡情報符号化部と、前記低周波数符号化部で得られる前記低周波数成分を含む符号化系列と、前記高周波数符号化部で得られる前記高周波数成分を含む符号化系列と、前記低周波数時間包絡情報符号化部で得られる低周波数成分の時間包絡情報の符号化系列と、前記高周波数時間包絡情報符号化部で得られる高周波数成分の時間包絡情報の符号化系列とを多重化する符号化系列多重化部と、を備えることを特徴とする。 The voice coding device according to the fourth aspect is a voice coding device that encodes an input voice signal and outputs a coded sequence, and is a low frequency coding device that encodes a low frequency component of the voice signal. At least of the unit, the high frequency coding unit that encodes the high frequency component of the audio signal, the audio signal, the coding result of the low frequency coding unit, and the information obtained in the low frequency coding process. The low frequency time wrapping information coding unit that calculates and encodes the time wrapping information of the low frequency component based on one or more, the voice signal, the coding result of the low frequency coding unit, and the low frequency coding Based on at least one of the information obtained in the process, the coding result of the high frequency coding unit, and the information obtained in the high frequency coding process, the time-related information of the high frequency component is calculated and encoded. A high frequency time wrapping information coding unit, a coding series including the low frequency component obtained by the low frequency coding unit, and a coding series including the high frequency component obtained by the high frequency coding unit. , The coding sequence of the time inclusion information of the low frequency component obtained by the low frequency time inclusion information coding unit and the coding sequence of the time inclusion information of the high frequency component obtained by the high frequency time inclusion information coding unit. It is characterized by including a coding sequence multiplexing unit for multiplexing.

上述した第１〜第４の態様に係る音声符号化装置の発明は、音声符号化方法の発明として捉えることができ、以下のように記述することができる。 The invention of the voice coding device according to the first to fourth aspects described above can be regarded as an invention of a voice coding method, and can be described as follows.

第１の態様に係る音声符号化方法は、入力される音声信号を符号化して符号化系列を出力する音声符号化装置、により実行される音声符号化方法であって、前記音声信号を符号化する音声符号化ステップと、前記音声信号の時間包絡情報を算出し符号化する時間包絡情報符号化ステップと、前記音声符号化ステップで得られる前記音声信号を含む符号化系列と、前記時間包絡情報符号化ステップで得られる時間包絡情報の符号化系列とを多重化する符号化系列多重化ステップと、を備えることを特徴とする。 The voice coding method according to the first aspect is a voice coding method executed by a voice coding device that encodes an input voice signal and outputs a coded sequence, and encodes the voice signal. A voice coding step to be performed, a time-wrapping information coding step for calculating and encoding the time-wrapping information of the voice signal, a coding sequence including the voice signal obtained in the voice-coding step, and the time-wrapping information. It is characterized by comprising a coding sequence multiplexing step for multiplexing the coding sequence of the time-related information obtained in the coding step.

第２の態様に係る音声符号化方法は、入力される音声信号を符号化して符号化系列を出力する音声符号化装置、により実行される音声符号化方法であって、前記音声信号の低周波数成分を符号化する低周波数符号化ステップと、前記音声信号の高周波数成分を符号化する高周波数符号化ステップと、前記音声信号、前記低周波数符号化ステップの符号化結果、及び当該低周波数符号化過程で得られる情報のうち少なくとも一つ以上に基づいて、低周波数成分の時間包絡情報を算出し符号化する低周波数時間包絡情報符号化ステップと、前記低周波数符号化ステップで得られる前記低周波数成分を含む符号化系列と、前記高周波数符号化ステップで得られる前記高周波数成分を含む符号化系列と、前記低周波数時間包絡情報符号化ステップで得られる低周波数成分の時間包絡情報の符号化系列とを多重化する符号化系列多重化ステップと、を備えることを特徴とする。 The voice coding method according to the second aspect is a voice coding method executed by a voice coding device that encodes an input voice signal and outputs a coded sequence, and is a low frequency of the voice signal. A low-frequency coding step for coding a component, a high-frequency coding step for coding a high-frequency component of the voice signal, a coding result of the voice signal, the low-frequency coding step, and the low-frequency code. A low frequency time wrapping information coding step that calculates and encodes the time wrapping information of a low frequency component based on at least one or more of the information obtained in the conversion process, and the low frequency coding step obtained in the low frequency coding step. The code of the coded sequence including the frequency component, the coded sequence including the high frequency component obtained in the high frequency coding step, and the code of the time wrapping information of the low frequency component obtained in the low frequency time wrapping information coding step. It is characterized by comprising a coded sequence multiplexing step for multiplexing the compounded sequence.

第３の態様に係る音声符号化方法は、入力される音声信号を符号化して符号化系列を出力する音声符号化装置、により実行される音声符号化方法であって、前記音声信号の低周波数成分を符号化する低周波数符号化ステップと、前記音声信号の高周波数成分を符号化する高周波数符号化ステップと、前記音声信号、前記低周波数符号化ステップの符号化結果、当該低周波数符号化過程で得られる情報、前記高周波数符号化ステップの符号化結果、及び当該高周波数符号化過程で得られる情報のうち少なくとも一つ以上に基づいて、高周波数成分の時間包絡情報を算出し符号化する高周波数時間包絡情報符号化ステップと、前記低周波数符号化ステップで得られる前記低周波数成分を含む符号化系列と、前記高周波数符号化ステップで得られる前記高周波数成分を含む符号化系列と、前記高周波数時間包絡情報符号化ステップで得られる高周波数成分の時間包絡情報の符号化系列とを多重化する符号化系列多重化ステップと、を備えることを特徴とする。 The voice coding method according to the third aspect is a voice coding method executed by a voice coding device that encodes an input voice signal and outputs a coded sequence, and is a low frequency of the voice signal. The low frequency coding step for coding the components, the high frequency coding step for coding the high frequency components of the voice signal, and the coding result of the voice signal and the low frequency coding step, the low frequency coding. Based on at least one of the information obtained in the process, the coding result of the high frequency coding step, and the information obtained in the high frequency coding process, the time-related information of the high frequency component is calculated and encoded. A high-frequency time-enclosed information coding step, a coding sequence containing the low-frequency component obtained in the low-frequency coding step, and a coding sequence containing the high-frequency component obtained in the high-frequency coding step. It is characterized by comprising a coding sequence multiplexing step for multiplexing the coding sequence of the time inclusion information of the high frequency component obtained in the high frequency time inclusion information coding step.

第４の態様に係る音声符号化方法は、入力される音声信号を符号化して符号化系列を出力する音声符号化装置、により実行される音声符号化方法であって、前記音声信号の低周波数成分を符号化する低周波数符号化ステップと、前記音声信号の高周波数成分を符号化する高周波数符号化ステップと、前記音声信号、前記低周波数符号化ステップの符号化結果、及び当該低周波数符号化過程で得られる情報のうち少なくとも一つ以上に基づいて、低周波数成分の時間包絡情報を算出し符号化する低周波数時間包絡情報符号化ステップと、前記音声信号、前記低周波数符号化ステップの符号化結果、当該低周波数符号化過程で得られる情報、前記高周波数符号化ステップの符号化結果、及び当該高周波数符号化過程で得られる情報のうち少なくとも一つ以上に基づいて、高周波数成分の時間包絡情報を算出し符号化する高周波数時間包絡情報符号化ステップと、前記低周波数符号化ステップで得られる前記低周波数成分を含む符号化系列と、前記高周波数符号化ステップで得られる前記高周波数成分を含む符号化系列と、前記低周波数時間包絡情報符号化ステップで得られる低周波数成分の時間包絡情報の符号化系列と、前記高周波数時間包絡情報符号化ステップで得られる高周波数成分の時間包絡情報の符号化系列とを多重化する符号化系列多重化ステップと、を備えることを特徴とする。 The voice coding method according to the fourth aspect is a voice coding method executed by a voice coding device that encodes an input voice signal and outputs a coded sequence, and is a low frequency of the voice signal. A low-frequency coding step for coding a component, a high-frequency coding step for coding a high-frequency component of the voice signal, a coding result of the voice signal, the low-frequency coding step, and the low-frequency code. A low frequency time wrapping information coding step that calculates and encodes time wrapping information of a low frequency component based on at least one or more of the information obtained in the conversion process, and a voice signal and the low frequency coding step. A high frequency component based on at least one of the coding result, the information obtained in the low frequency coding process, the coding result of the high frequency coding step, and the information obtained in the high frequency coding process. The high frequency time inclusion information coding step that calculates and encodes the time inclusion information of the above, the coding series including the low frequency component obtained in the low frequency coding step, and the said high frequency coding step. A coding sequence containing a high frequency component, a coding sequence of time inclusion information of a low frequency component obtained in the low frequency time inclusion information coding step, and a high frequency component obtained in the high frequency time inclusion information coding step. It is characterized by comprising a coded sequence multiplexing step for multiplexing the coded sequence of the time-related information of.

また、上述した第１〜第４の態様に係る音声符号化装置の発明は、音声符号化プログラムの発明として捉えることができ、以下のように記述することができる。 Further, the invention of the voice coding device according to the first to fourth aspects described above can be regarded as an invention of a voice coding program, and can be described as follows.

第１の態様に係る音声符号化プログラムは、入力される音声信号を符号化して符号化系列を出力する音声符号化装置、に設けられたコンピュータを、前記音声信号を符号化する音声符号化部と、前記音声信号の時間包絡情報を算出し符号化する時間包絡情報符号化部と、前記音声符号化部で得られる前記音声信号を含む符号化系列と、前記時間包絡情報符号化部で得られる時間包絡情報の符号化系列とを多重化する符号化系列多重化部、として機能させることを特徴とする。 The voice coding program according to the first aspect is a voice coding unit that encodes a computer provided in a voice coding device that encodes an input voice signal and outputs a coded sequence. A time-wrapping information coding unit that calculates and encodes the time-wrapping information of the voice signal, a coding series including the voice signal obtained by the voice coding unit, and a time-wrapping information coding unit obtained by the time-wrapping information coding unit. It is characterized in that it functions as a coded sequence multiplexing unit that multiplexes the coded sequence of the time-related information.

第２の態様に係る音声符号化プログラムは、入力される音声信号を符号化して符号化系列を出力する音声符号化装置、に設けられたコンピュータを、前記音声信号の低周波数成分を符号化する低周波数符号化部と、前記音声信号の高周波数成分を符号化する高周波数符号化部と、前記音声信号、前記低周波数符号化部の符号化結果、及び当該低周波数符号化過程で得られる情報のうち少なくとも一つ以上に基づいて、低周波数成分の時間包絡情報を算出し符号化する低周波数時間包絡情報符号化部と、前記低周波数符号化部で得られる前記低周波数成分を含む符号化系列と、前記高周波数符号化部で得られる前記高周波数成分を含む符号化系列と、前記低周波数時間包絡情報符号化部で得られる低周波数成分の時間包絡情報の符号化系列とを多重化する符号化系列多重化部、として機能させることを特徴とする。 The voice coding program according to the second aspect encodes a low frequency component of the voice signal by a computer provided in a voice coding device that encodes an input voice signal and outputs a coded sequence. Obtained in the low frequency coding unit, the high frequency coding unit that encodes the high frequency component of the voice signal, the voice signal, the coding result of the low frequency coding unit, and the low frequency coding process. A low frequency time wrapping information coding unit that calculates and encodes the time wrapping information of the low frequency component based on at least one of the information, and a code containing the low frequency component obtained by the low frequency coding unit. The conversion sequence, the coding sequence including the high frequency component obtained by the high frequency coding unit, and the coding series of the time inclusion information of the low frequency component obtained by the low frequency time inclusion information coding unit are multiplexed. It is characterized in that it functions as a coded sequence multiplexing unit to be converted.

第３の態様に係る音声符号化プログラムは、入力される音声信号を符号化して符号化系列を出力する音声符号化装置、に設けられたコンピュータを、前記音声信号の低周波数成分を符号化する低周波数符号化部と、前記音声信号の高周波数成分を符号化する高周波数符号化部と、前記音声信号、前記低周波数符号化部の符号化結果、当該低周波数符号化過程で得られる情報、前記高周波数符号化部の符号化結果、及び当該高周波数符号化過程で得られる情報のうち少なくとも一つ以上に基づいて、高周波数成分の時間包絡情報を算出し符号化する高周波数時間包絡情報符号化部と、前記低周波数符号化部で得られる前記低周波数成分を含む符号化系列と、前記高周波数符号化部で得られる前記高周波数成分を含む符号化系列と、前記高周波数時間包絡情報符号化部で得られる高周波数成分の時間包絡情報の符号化系列とを多重化する符号化系列多重化部、として機能させることを特徴とする。 The voice coding program according to the third aspect encodes a low frequency component of the voice signal by a computer provided in a voice coding device that encodes an input voice signal and outputs a coded sequence. The low frequency coding unit, the high frequency coding unit that encodes the high frequency component of the voice signal, the coding result of the voice signal and the low frequency coding unit, and the information obtained in the low frequency coding process. , High frequency time encapsulation that calculates and encodes the time encapsulation information of the high frequency component based on the coding result of the high frequency coding unit and at least one of the information obtained in the high frequency coding process. An information coding unit, a coding series including the low frequency component obtained by the low frequency coding unit, a coding series including the high frequency component obtained by the high frequency coding unit, and the high frequency time. It is characterized in that it functions as a coding sequence multiplexing unit that multiplexes the coding sequence of the time inclusion information of the high frequency component obtained by the inclusion information coding unit.

第４の態様に係る音声符号化プログラムは、入力される音声信号を符号化して符号化系列を出力する音声符号化装置、に設けられたコンピュータを、前記音声信号の低周波数成分を符号化する低周波数符号化部と、前記音声信号の高周波数成分を符号化する高周波数符号化部と、前記音声信号、前記低周波数符号化部の符号化結果、及び当該低周波数符号化過程で得られる情報のうち少なくとも一つ以上に基づいて、低周波数成分の時間包絡情報を算出し符号化する低周波数時間包絡情報符号化部と、前記音声信号、前記低周波数符号化部の符号化結果、当該低周波数符号化過程で得られる情報、前記高周波数符号化部の符号化結果、及び当該高周波数符号化過程で得られる情報のうち少なくとも一つ以上に基づいて、高周波数成分の時間包絡情報を算出し符号化する高周波数時間包絡情報符号化部と、前記低周波数符号化部で得られる前記低周波数成分を含む符号化系列と、前記高周波数符号化部で得られる前記高周波数成分を含む符号化系列と、前記低周波数時間包絡情報符号化部で得られる低周波数成分の時間包絡情報の符号化系列と、前記高周波数時間包絡情報符号化部で得られる高周波数成分の時間包絡情報の符号化系列とを多重化する符号化系列多重化部、として機能させることを特徴とする。 The voice coding program according to the fourth aspect encodes a low frequency component of the voice signal by a computer provided in a voice coding device that encodes an input voice signal and outputs a coded sequence. Obtained in the low frequency coding unit, the high frequency coding unit that encodes the high frequency component of the voice signal, the voice signal, the coding result of the low frequency coding unit, and the low frequency coding process. The low frequency time wrapping information coding unit that calculates and encodes the time wrapping information of the low frequency component based on at least one of the information, the voice signal, and the coding result of the low frequency coding unit. Based on at least one of the information obtained in the low frequency coding process, the coding result of the high frequency coding unit, and the information obtained in the high frequency coding process, the time wrapping information of the high frequency component is obtained. The high frequency time wrapping information coding unit to be calculated and encoded, the coding series including the low frequency component obtained by the low frequency coding unit, and the high frequency component obtained by the high frequency coding unit are included. The coding series, the coding series of the time-wrapping information of the low-frequency component obtained by the low-frequency time-wrapping information coding unit, and the time-wrapping information of the high-frequency component obtained by the high-frequency time-wrapping information coding unit. It is characterized in that it functions as a coded sequence multiplexing unit that multiplexes the coded sequence.

出願人は、上記の目的を達成するために、さらに以下の第５及び第６の態様に係る音声復号装置を発明した。 In order to achieve the above object, the applicant further invented the audio decoding device according to the following fifth and sixth aspects.

第５の態様に係る音声復号装置は、符号化された音声信号を復号して音声信号を出力する音声復号装置であって、前記符号化された音声信号を含む符号化系列を、少なくとも符号化された前記音声信号の低周波数信号の情報を含む符号化系列と、符号化された前記音声信号の高周波数信号の情報を含む符号化系列に分割する符号化系列逆多重化部と、前記符号化系列逆多重化部から前記符号化された低周波数信号の情報を含む符号化系列を受け取り、復号して低周波数信号を得る低周波数復号部と、前記符号化系列逆多重化部及び前記低周波数復号部のうち少なくとも一つより情報を受け取り、当該情報に基づいて高周波数信号を生成する高周波数復号部と、前記符号化系列逆多重化部、前記低周波数復号部、及び前記高周波数復号部のうち少なくとも一つより情報を受け取り、復号された低周波数信号及び生成された高周波数信号の時間包絡形状を決定する時間包絡形状決定部と、前記時間包絡形状決定部にて決定された時間包絡形状に基づき前記復号された低周波数信号の時間包絡形状を修正し出力する低周波数時間包絡修正部と、前記時間包絡形状決定部にて決定された時間包絡形状に基づき前記生成された高周波数信号の時間包絡形状を修正し出力する高周波数時間包絡修正部と、前記低周波数時間包絡修正部から時間包絡を修正された低周波数信号を受け取り、前記高周波数時間包絡修正部から時間包絡を修正された高周波数信号を受け取り、出力する音声信号を合成する低周波数／高周波数信号合成部と、を備えることを特徴とする。 The voice decoding device according to the fifth aspect is a voice decoding device that decodes a coded voice signal and outputs a voice signal, and at least encodes a coding sequence including the coded voice signal. A coded sequence demultiplexing unit that divides the coded sequence into a coded sequence containing information on the low frequency signal of the voice signal and a coded series containing information on the high frequency signal of the coded voice signal, and the code. A low-frequency decoding unit that receives a coded sequence containing information on the coded low-frequency signal from the coded-series demultiplexing unit and decodes it to obtain a low-frequency signal, a coded-series demultiplexing unit, and the low frequency section. A high frequency decoding unit that receives information from at least one of the frequency decoding units and generates a high frequency signal based on the information, the coded sequence demultiplexing unit, the low frequency decoding unit, and the high frequency decoding unit. A time-environment shape determining unit that receives information from at least one of the units and determines the time-environment shape of the decoded low-frequency signal and the generated high-frequency signal, and a time determined by the time-environment shape determining unit. The low frequency time wrapping correction unit that corrects and outputs the time wrapping shape of the decoded low frequency signal based on the wrapping shape, and the generated high frequency based on the time wrapping shape determined by the time wrapping shape determining unit. The high frequency time wrapping correction unit that corrects and outputs the time wrapping shape of the signal and the low frequency signal with the corrected time wrapping received from the low frequency time wrapping correction unit are received, and the time wrapping is corrected from the high frequency time wrapping correction unit. It is characterized by including a low frequency / high frequency signal synthesizing unit that receives the generated high frequency signal and synthesizes an output audio signal.

第６の態様に係る音声復号装置は、符号化された音声信号を復号して音声信号を出力する音声復号装置であって、前記符号化された音声信号を含む符号化系列を、少なくとも符号化された前記音声信号の低周波数信号の情報を含む符号化系列と、符号化された前記音声信号の高周波数信号の情報を含む符号化系列に分割する符号化系列逆多重化部と、前記符号化系列逆多重化部から前記符号化された低周波数信号の情報を含む符号化系列を受け取り、復号して低周波数信号を得る低周波数復号部と、前記符号化系列逆多重化部及び前記低周波数復号部のうち少なくとも一つより情報を受け取り、当該情報に基づいて高周波数信号を生成する高周波数復号部と、前記符号化系列逆多重化部、前記低周波数復号部、及び前記高周波数復号部のうち少なくとも一つより情報を受け取り、復号された低周波数信号及び生成された高周波数信号の時間包絡形状を決定する時間包絡形状決定部と、前記低周波数復号部から復号された低周波数信号を受け取り、前記高周波数復号部から生成された高周波数信号を受け取り、前記時間包絡形状決定部にて決定された時間包絡形状に基づき、前記復号された低周波数信号及び前記生成された高周波数信号の時間包絡形状を修正し出力する時間包絡修正部と、前記時間包絡修正部から時間包絡を修正された低周波数信号及び高周波数信号を受け取り、出力する音声信号を合成する低周波数／高周波数信号合成部と、を備えることを特徴とする。 The voice decoding device according to the sixth aspect is a voice decoding device that decodes a coded voice signal and outputs a voice signal, and at least encodes a coding sequence including the coded voice signal. A coded sequence demultiplexing unit that divides the coded sequence into a coded sequence containing information on the low frequency signal of the voice signal and a coded series containing information on the high frequency signal of the coded voice signal, and the code. A low-frequency decoding unit that receives a coded sequence containing information on the coded low-frequency signal from the coded-series demultiplexing unit and decodes it to obtain a low-frequency signal, a coded-series demultiplexing unit, and the low frequency section. A high frequency decoding unit that receives information from at least one of the frequency decoding units and generates a high frequency signal based on the information, the coded sequence demultiplexing unit, the low frequency decoding unit, and the high frequency decoding unit. A time-enclosed shape-determining unit that receives information from at least one of the units and determines the time-enclosed shape of the decoded low-frequency signal and the generated high-frequency signal, and a low-frequency signal decoded from the low-frequency decoding unit. Is received, the high frequency signal generated from the high frequency decoding unit is received, and the decoded low frequency signal and the generated high frequency signal are received based on the time inclusion shape determined by the time inclusion shape determination unit. A low-frequency / high-frequency signal that receives a low-frequency signal and a high-frequency signal whose time-wrapping has been corrected and outputs an audio signal from the time-wrapping correction section that corrects and outputs the time-wrapping shape of It is characterized by including a synthesis unit.

なお、第５の態様に係る音声復号装置において、前記高周波数復号部は、前記符号化系列逆多重化部、前記低周波数復号部及び前記低周波数時間包絡修正部のうち少なくとも一つより情報を受け取り、当該情報に基づいて高周波数信号を生成してもよい。 In the voice decoding device according to the fifth aspect, the high frequency decoding unit receives information from at least one of the coding sequence demultiplexing unit, the low frequency decoding unit, and the low frequency time envelope correction unit. It may be received and a high frequency signal may be generated based on the information.

また、第５の態様に係る音声復号装置において、前記高周波数時間包絡修正部は、前記時間包絡形状決定部にて決定された時間包絡形状に基づいて、前記高周波数復号部にて高周波数信号を生成する際の中間信号の時間包絡形状を修正し、前記高周波数復号部は、前記時間包絡形状を修正された前記中間信号を用いて、残存する高周波数信号を生成する処理を実施してもよい。 Further, in the voice decoding device according to the fifth aspect, the high frequency time wrapping correction unit is a high frequency signal in the high frequency decoding unit based on the time wrapping shape determined by the time wrapping shape determining unit. The time-wrapping shape of the intermediate signal at the time of generating is corrected, and the high-frequency decoding unit performs a process of generating a remaining high-frequency signal using the intermediate signal whose time-wrapping shape is corrected. May be good.

また、第６の態様に係る音声復号装置において、前記高周波数復号部は、前記符号化系列逆多重化部及び前記低周波数復号部のうち少なくとも一つより情報を受け取り、当該情報に基づいて高周波数信号を生成してもよい。 Further, in the voice decoding device according to the sixth aspect, the high frequency decoding unit receives information from at least one of the coded sequence demultiplexing unit and the low frequency decoding unit, and is high based on the information. A frequency signal may be generated.

また、第６の態様に係る音声復号装置において、前記時間包絡修正部は、前記時間包絡形状決定部にて決定された時間包絡形状に基づいて、前記高周波数復号部にて高周波数信号を生成する際の中間信号の時間包絡形状を修正し、前記高周波数復号部は、前記時間包絡形状を修正された前記中間信号を用いて、残存する高周波数信号を生成する処理を実施してもよい。 Further, in the voice decoding device according to the sixth aspect, the time wrapping correction unit generates a high frequency signal in the high frequency decoding unit based on the time wrapping shape determined by the time wrapping shape determining unit. The time-wrapping shape of the intermediate signal may be modified, and the high-frequency decoding unit may perform a process of generating a remaining high-frequency signal using the intermediate signal having the modified time-wrapping shape. ..

上述した第５及び第６の態様に係る音声復号装置の発明は、音声復号方法の発明として捉えることができ、以下のように記述することができる。 The invention of the audio decoding device according to the fifth and sixth aspects described above can be regarded as an invention of the audio decoding method, and can be described as follows.

第５の態様に係る音声復号方法は、符号化された音声信号を復号して音声信号を出力する音声復号装置、により実行される音声復号方法であって、前記符号化された音声信号を含む符号化系列を、少なくとも符号化された前記音声信号の低周波数信号の情報を含む符号化系列と、符号化された前記音声信号の高周波数信号の情報を含む符号化系列に分割する符号化系列逆多重化ステップと、分割により得られた前記符号化された低周波数信号の情報を含む符号化系列を受け取り、復号して低周波数信号を得る低周波数復号ステップと、前記符号化系列逆多重化ステップ及び前記低周波数復号ステップのうち少なくとも一つで得られた情報を受け取り、当該情報に基づいて高周波数信号を生成する高周波数復号ステップと、前記符号化系列逆多重化ステップ、前記低周波数復号ステップ、及び前記高周波数復号ステップのうち少なくとも一つで得られた情報を受け取り、復号された低周波数信号及び生成された高周波数信号の時間包絡形状を決定する時間包絡形状決定ステップと、前記時間包絡形状決定ステップにて決定された時間包絡形状に基づき前記復号された低周波数信号の時間包絡形状を修正し出力する低周波数時間包絡修正ステップと、前記時間包絡形状決定ステップにて決定された時間包絡形状に基づき前記生成された高周波数信号の時間包絡形状を修正し出力する高周波数時間包絡修正ステップと、前記低周波数時間包絡修正ステップで得られた時間包絡を修正された低周波数信号を受け取り、前記高周波数時間包絡修正ステップで得られた時間包絡を修正された高周波数信号を受け取り、出力する音声信号を合成する低周波数／高周波数信号合成ステップと、を備えることを特徴とする。 The voice decoding method according to the fifth aspect is a voice decoding method executed by a voice decoding device that decodes a coded voice signal and outputs a voice signal, and includes the coded voice signal. A coding sequence that divides the coding sequence into a coding sequence that includes at least information on the low frequency signal of the voice signal that has been encoded and a coding series that contains information on the high frequency signal of the encoded voice signal. The demultiplexing step, the low frequency decoding step of receiving the coded sequence including the information of the coded low frequency signal obtained by the division and decoding it to obtain the low frequency signal, and the demultiplexing of the coded sequence. A high frequency decoding step that receives information obtained in at least one of the steps and the low frequency decoding step and generates a high frequency signal based on the information, the coding sequence demultiplexing step, and the low frequency decoding. The time entrainment shape determination step, which receives the information obtained in at least one of the steps and the high frequency decoding step, and determines the time entrapment shape of the decoded low frequency signal and the generated high frequency signal, and the time. Time determined in the entrapment shape determination step The low frequency time entrapment correction step that corrects and outputs the time entrapment shape of the decoded low frequency signal based on the entrapment shape, and the time determined in the time encapsulation shape determination step. Receives the high frequency time wrapping correction step that corrects and outputs the time wrapping shape of the generated high frequency signal based on the wrapping shape, and the low frequency signal that corrects the time wrapping obtained in the low frequency time wrapping correction step. It is characterized by including a low frequency / high frequency signal synthesis step of receiving a high frequency signal obtained by correcting the time inclusion obtained in the high frequency time inclusion correction step and synthesizing an audio signal to be output.

第６の態様に係る音声復号方法は、符号化された音声信号を復号して音声信号を出力する音声復号装置、により実行される音声復号方法であって、前記符号化された音声信号を含む符号化系列を、少なくとも符号化された前記音声信号の低周波数信号の情報を含む符号化系列と、符号化された前記音声信号の高周波数信号の情報を含む符号化系列に分割する符号化系列逆多重化ステップと、分割により得られた前記符号化された低周波数信号の情報を含む符号化系列を受け取り、復号して低周波数信号を得る低周波数復号ステップと、前記符号化系列逆多重化ステップ及び前記低周波数復号ステップのうち少なくとも一つで得られた情報を受け取り、当該情報に基づいて高周波数信号を生成する高周波数復号ステップと、前記符号化系列逆多重化ステップ、前記低周波数復号ステップ、及び前記高周波数復号ステップのうち少なくとも一つで得られた情報を受け取り、復号された低周波数信号及び生成された高周波数信号の時間包絡形状を決定する時間包絡形状決定ステップと、前記低周波数復号ステップで得られた復号された低周波数信号を受け取り、前記高周波数復号ステップで得られた生成された高周波数信号を受け取り、前記時間包絡形状決定ステップにて決定された時間包絡形状に基づき、前記復号された低周波数信号及び前記生成された高周波数信号の時間包絡形状を修正し出力する時間包絡修正ステップと、前記時間包絡修正ステップで得られた時間包絡を修正された低周波数信号及び高周波数信号を受け取り、出力する音声信号を合成する低周波数／高周波数信号合成ステップと、を備えることを特徴とする。 The voice decoding method according to the sixth aspect is a voice decoding method executed by a voice decoding device that decodes a coded voice signal and outputs a voice signal, and includes the coded voice signal. A coding sequence that divides the coding sequence into a coding sequence that includes at least information on the low frequency signal of the voice signal that has been encoded and a coding series that contains information on the high frequency signal of the encoded voice signal. The demultiplexing step, the low frequency decoding step of receiving the coded sequence including the information of the coded low frequency signal obtained by the division and decoding it to obtain the low frequency signal, and the demultiplexing of the coded sequence. A high frequency decoding step that receives information obtained in at least one of the steps and the low frequency decoding step and generates a high frequency signal based on the information, the coding sequence demultiplexing step, and the low frequency decoding. The time entrainment shape determination step, which receives the information obtained in at least one of the steps and the high frequency decoding step, and determines the time entrapment shape of the decoded low frequency signal and the generated high frequency signal, and the low frequency decoding step. It receives the decoded low frequency signal obtained in the frequency decoding step, receives the generated high frequency signal obtained in the high frequency decoding step, and is based on the time entrapment shape determined in the time entrapment shape determination step. The time wrapping correction step of correcting and outputting the time wrapping shape of the decoded low frequency signal and the generated high frequency signal, and the time wrapping corrected low frequency signal obtained in the time wrapping correction step. It is characterized by including a low frequency / high frequency signal synthesis step of receiving a high frequency signal and synthesizing an output audio signal.

また、上述した第５及び第６の態様に係る音声復号装置の発明は、音声復号プログラムの発明として捉えることができ、以下のように記述することができる。 Further, the invention of the voice decoding device according to the fifth and sixth aspects described above can be regarded as the invention of the voice decoding program, and can be described as follows.

第５の態様に係る音声復号プログラムは、符号化された音声信号を復号して音声信号を出力する音声復号装置、に設けられたコンピュータを、前記符号化された音声信号を含む符号化系列を、少なくとも符号化された前記音声信号の低周波数信号の情報を含む符号化系列と、符号化された前記音声信号の高周波数信号の情報を含む符号化系列に分割する符号化系列逆多重化部と、前記符号化系列逆多重化部から前記符号化された低周波数信号の情報を含む符号化系列を受け取り、復号して低周波数信号を得る低周波数復号部と、前記符号化系列逆多重化部及び前記低周波数復号部のうち少なくとも一つより情報を受け取り、当該情報に基づいて高周波数信号を生成する高周波数復号部と、前記符号化系列逆多重化部、前記低周波数復号部、及び前記高周波数復号部のうち少なくとも一つより情報を受け取り、復号された低周波数信号及び生成された高周波数信号の時間包絡形状を決定する時間包絡形状決定部と、前記時間包絡形状決定部にて決定された時間包絡形状に基づき前記復号された低周波数信号の時間包絡形状を修正し出力する低周波数時間包絡修正部と、前記時間包絡形状決定部にて決定された時間包絡形状に基づき前記生成された高周波数信号の時間包絡形状を修正し出力する高周波数時間包絡修正部と、前記低周波数時間包絡修正部から時間包絡を修正された低周波数信号を受け取り、前記高周波数時間包絡修正部から時間包絡を修正された高周波数信号を受け取り、出力する音声信号を合成する低周波数／高周波数信号合成部、として機能させることを特徴とする。 The voice decoding program according to the fifth aspect uses a computer provided in a voice decoding device that decodes a coded voice signal and outputs a voice signal, and uses a coding sequence including the coded voice signal. , At least a coded sequence demultiplexing unit that divides into a coded sequence containing information on the low frequency signal of the voice signal and a coded series containing information on the high frequency signal of the coded voice signal. And the low frequency decoding unit that receives the coded sequence including the information of the encoded low frequency signal from the coded sequence demultiplexing unit and decodes it to obtain the low frequency signal, and the coded sequence demultiplexing unit. A high frequency decoding unit that receives information from at least one of the unit and the low frequency decoding unit and generates a high frequency signal based on the information, the coded sequence demultiplexing unit, the low frequency decoding unit, and The time wrapping shape determining unit that receives information from at least one of the high frequency decoding units and determines the time wrapping shape of the decoded low frequency signal and the generated high frequency signal, and the time wrapping shape determining unit The low frequency time wrapping correction unit that corrects and outputs the time wrapping shape of the decoded low frequency signal based on the determined time wrapping shape, and the generation based on the time wrapping shape determined by the time wrapping shape determination unit. A high frequency time wrapping correction unit that corrects and outputs the time wrapping shape of the high frequency signal, and a low frequency signal whose time wrapping is corrected from the low frequency time wrapping correction unit are received from the high frequency time wrapping correction unit. It is characterized in that it functions as a low-frequency / high-frequency signal synthesizer that receives a high-frequency signal with corrected time wrapping and synthesizes an output audio signal.

第６の態様に係る音声復号プログラムは、符号化された音声信号を復号して音声信号を出力する音声復号装置、に設けられたコンピュータを、前記符号化された音声信号を含む符号化系列を、少なくとも符号化された前記音声信号の低周波数信号の情報を含む符号化系列と、符号化された前記音声信号の高周波数信号の情報を含む符号化系列に分割する符号化系列逆多重化部と、前記符号化系列逆多重化部から前記符号化された低周波数信号の情報を含む符号化系列を受け取り、復号して低周波数信号を得る低周波数復号部と、前記符号化系列逆多重化部及び前記低周波数復号部のうち少なくとも一つより情報を受け取り、当該情報に基づいて高周波数信号を生成する高周波数復号部と、前記符号化系列逆多重化部、前記低周波数復号部、及び前記高周波数復号部のうち少なくとも一つより情報を受け取り、復号された低周波数信号及び生成された高周波数信号の時間包絡形状を決定する時間包絡形状決定部と、前記低周波数復号部から復号された低周波数信号を受け取り、前記高周波数復号部から生成された高周波数信号を受け取り、前記時間包絡形状決定部にて決定された時間包絡形状に基づき、前記復号された低周波数信号及び前記生成された高周波数信号の時間包絡形状を修正し出力する時間包絡修正部と、前記時間包絡修正部から時間包絡を修正された低周波数信号及び高周波数信号を受け取り、出力する音声信号を合成する低周波数／高周波数信号合成部、として機能させることを特徴とする。 The voice decoding program according to the sixth aspect uses a computer provided in a voice decoding device that decodes a coded voice signal and outputs a voice signal, and uses a coding sequence including the coded voice signal. , At least a coded sequence demultiplexing unit that divides into a coded sequence containing information on the low frequency signal of the voice signal and a coded series containing information on the high frequency signal of the coded voice signal. And the low frequency decoding unit that receives the coded sequence including the information of the encoded low frequency signal from the coded sequence demultiplexing unit and decodes it to obtain the low frequency signal, and the coded sequence demultiplexing unit. A high frequency decoding unit that receives information from at least one of the unit and the low frequency decoding unit and generates a high frequency signal based on the information, the coded sequence demultiplexing unit, the low frequency decoding unit, and A time-environment shape determining unit that receives information from at least one of the high-frequency decoding units and determines the time-environment shape of the decoded low-frequency signal and the generated high-frequency signal, and a low-frequency decoding unit that decodes the information. The high frequency signal generated from the high frequency decoding unit is received, and the decoded low frequency signal and the generated low frequency signal are received based on the time inclusion shape determined by the time inclusion shape determination unit. A low frequency that corrects the time wrapping shape of the high frequency signal and outputs it, and a low frequency that receives the low frequency signal and the high frequency signal with the corrected time wrapping from the time wrapping correction part and synthesizes the output audio signal. / It is characterized by functioning as a high frequency signal synthesizer.

本発明の音声復号装置は、符号化された音声信号を復号して音声信号を出力する音声復号装置であって、符号化された低周波数信号の情報を含む符号化系列を受け取り、復号して低周波数信号を得る低周波数復号部と、前記低周波数復号部より第１の情報を受け取り、当該第１の情報に基づいて高周波数信号を生成する高周波数復号部と、符号化装置から送信された第２の情報に基づいて、生成された高周波数信号の時間包絡形状を決定する高周波数時間包絡形状決定部と、前記高周波数時間包絡形状決定部にて決定された時間包絡形状に基づき前記生成された高周波数信号の時間包絡形状を修正し出力する高周波数時間包絡修正部と、前記低周波数復号部から低周波数信号を受け取り、前記高周波数時間包絡修正部から時間包絡形状を修正された高周波数信号を受け取り、前記低周波数信号と前記時間包絡形状を修正された高周波数信号とを合成することで、出力する音声信号を得る低周波数／高周波数信号合成部と、を備え、前記高周波数時間包絡修正部は、前記高周波数時間包絡形状決定部にて時間包絡形状が平坦であると決定された場合、前記生成された高周波数信号のうち、時間セグメント内の任意の前記生成された高周波数信号を使って時間包絡形状を修正し出力し、前記高周波数時間包絡修正部は、前記高周波数時間包絡形状決定部にて時間包絡形状が平坦であると決定された場合、ｘｄｅｃ（ｉ）（ｔ（ｌ）≦ｉ＜ｔ（ｌ＋１））を任意の時間セグメント内の高周波数信号としたときに、

を使って得られる信号を、時間包絡形状が修正された高周波数信号として出力する。 The voice decoding device of the present invention is a voice decoding device that decodes a coded voice signal and outputs a voice signal, and receives and decodes a coded sequence including information on a coded low frequency signal. It is transmitted from the encoding device, the low frequency decoding unit that obtains the low frequency signal, the high frequency decoding unit that receives the first information from the low frequency decoding unit and generates the high frequency signal based on the first information. Based on the second information, the high frequency time wrapping shape determining unit that determines the time wrapping shape of the generated high frequency signal, and the time wrapping shape determined by the high frequency time wrapping shape determining unit. The time wrapping shape of the generated high frequency signal is corrected and output, and the time wrapping shape is corrected by receiving the low frequency signal from the low frequency decoding unit and correcting the time wrapping shape from the high frequency time wrapping correction unit. A low-frequency / high-frequency signal synthesizer that receives a high-frequency signal and synthesizes the low-frequency signal and the high-frequency signal whose time-wrapping shape has been modified to obtain an output voice signal is provided. When the high frequency time wrapping shape determining unit determines that the time wrapping shape is flat, the frequency time wrapping correction unit may generate any of the generated high frequency signals in the time segment. When the time wrapping shape is corrected and output using the high frequency signal, and the high frequency time wrapping shape determining unit determines that the time wrapping shape is flat, the high frequency time wrapping correction unit uses xdec (i). ) (T (l) ≤ i <t (l + 1)) as a high frequency signal within an arbitrary time segment.

The signal obtained by using is output as a high frequency signal with the modified time envelope shape.

また、本発明の音声復号装置は、前記符号化された音声信号を含む符号化系列を、少なくとも、符号化された前記音声信号の低周波数信号の情報を含む符号化系列と、符号化された前記音声信号の高周波数信号の情報を含む符号化系列とに分割する符号化系列逆多重化部、をさらに備えることとしてもよい。 Further, the audio decoding device of the present invention encodes the coded sequence including the encoded audio signal with at least the coded sequence including the information of the low frequency signal of the encoded audio signal. A coded sequence demultiplexing unit that divides the audio signal into a coded sequence containing information on a high frequency signal may be further provided.

また、本発明の音声復号装置において、前記高周波数時間包絡修正部は、前記高周波数時間包絡形状決定部にて時間包絡形状が平坦であると決定された場合、ｘｄｅｃ（ｉ）（ｔ（ｌ）≦ｉ＜ｔ（ｌ＋１））を任意の時間セグメント内の高周波数信号としたときに、

を

で除した結果に基づいて得られる信号を、時間包絡形状が修正された高周波数信号として出力することとしてもよい。 Further, in the voice decoding apparatus of the present invention, when the high frequency time envelope correction unit determines that the time envelope shape is flat by the high frequency time envelope shape determination unit, xdec (i) (t (l) ) ≤i <t (l + 1)) as a high frequency signal within an arbitrary time segment

To

The signal obtained based on the result divided by may be output as a high frequency signal in which the time envelope shape is corrected.

1、10、11、12、13、14、15、15A、16、17、18、18A、100、110、120、130、140、150、160、170、180、190、190A、300、310、320、320A、330、340、350、350A、360、370、380、390…音声復号装置、1a、10d、13c…符号化系列解析部、1b…音声復号部、1c、16f、120f、360b…時間包絡形状決定部、1d、13a、13b、14a、15a、15aA、16c、17a、18a、18aA、300a、300aA、360a、360aA、370a、370aA、380a、380aA…時間包絡修正部、2、20、20A、21、22、23、24、25、26、27、28、200、210、220、230、240、250、260、270、280、290、400、410、420、430、440、450…音声符号化装置、2a…音声符号化部、2b、20g、20gA、21a、21aA、22b、22bA、22bB、23a、23aA、24c、25b、26a、26aA、27a、28a、270b、280a、290a、400a、410a、420a…時間包絡情報符号化部、2c、20h、200d、210b、220b、250b、250c、270c…符号化系列多重化部、10a、10aA、100a、110a、120a、150a、170a…符号化系列逆多重化部、10b…コア復号部、10c、20c、20c1…分析フィルタバンク部、10e、10eA、10eB、10eC、16b、100c、120c…低周波数時間包絡形状決定部、10f、12a、16e、100d、120e…低周波数時間包絡修正部、10g…高周波数信号生成部、10h…復号/逆量子化部、10i、25a…周波数包絡調整部、10j、170c…合成フィルタバンク部、13a、13aA、13aB、13aC、14b、16a、16d、110b、120b、120bA…高周波数時間包絡形状決定部、20a…ダウンサンプリング部、20b…コア符号化部、20d…制御パラメータ符号化部、20e、270d…包絡算出部、20f…量子化/符号化部、20i…コア復号信号生成部、20j、24b…サブバンド信号パワー算出部、22a、22a1、22aB…時間包絡算出部、24a、410b…擬似高周波数信号生成部、100b…低周波数復号部、100e、110e、130b…高周波数復号部、100f、150c…低周波数/高周波数信号合成部、110c、120d、130a、140a、140b…高周波数時間包絡修正部、150b、170b…スイッチ群、200a…低周波数符号化部、200b…高周波数符号化部、200c…低周波数時間包絡情報符号化部、210a、220a、230a…高周波数信号生成制御情報符号化部、250a、270a…高周波数信号生成制御情報符号化部、360b…時間包絡決定部。 1,10,11,12,13,14,15,15A,16,17,18,18A,100,110,120,130,140,150,160,170,180,190,190A,300,310, 320, 320A, 330, 340, 350, 350A, 360, 370, 380, 390 ... Voice decoding device, 1a, 10d, 13c ... Coded sequence analysis unit, 1b ... Voice decoding unit, 1c, 16f, 120f, 360b ... Time wrapping shape determination part, 1d, 13a, 13b, 14a, 15a, 15aA, 16c, 17a, 18a, 18aA, 300a, 300aA, 360a, 360aA, 370a, 370aA, 380a, 380aA ... Time wrapping correction part, 2, 20 , 20A, 21, 22, 23, 24, 25, 26, 27, 28, 200, 210, 220, 230, 240, 250, 260, 270, 280, 290, 400, 410, 420, 430, 440, 450 … Voice coding device, 2a… Voice coding unit, 2b, 20g, 20gA, 21a, 21aA, 22b, 22bA, 22bB, 23a, 23aA, 24c, 25b, 26a, 26aA, 27a, 28a, 270b, 280a, 290a , 400a, 410a, 420a ... Time-related information coding section, 2c, 20h, 200d, 210b, 220b, 250b, 250c, 270c ... Coded sequence multiplexing section, 10a, 10aA, 100a, 110a, 120a, 150a, 170a ... Coding sequence demultiplexing section, 10b ... Core decoding section, 10c, 20c, 20c1 ... Analytical filter bank section, 10e, 10eA, 10eB, 10eC, 16b, 100c, 120c ... Low frequency time entrainment shape determination section, 10f, 12a, 16e, 100d, 120e ... Low frequency time wrapping correction part, 10g ... High frequency signal generation part, 10h ... Decoding / dequantization part, 10i, 25a ... Frequency wrapping adjustment part, 10j, 170c ... Synthetic filter bank part, 13a, 13aA, 13aB, 13aC, 14b, 16a, 16d, 110b, 120b, 120bA ... High frequency time entrainment shape determination unit, 20a ... Downsampling unit, 20b ... Core coding unit, 20d ... Control parameter coding unit, 20e , 270d ... Encapsulation calculation unit, 20f ... Quantization / coding unit, 20i ... Core decoding signal generation unit, 20j, 24b ... Subband signal power calculation unit, 22a, 22a1, 22aB ... Time envelopment calculation unit, 24a, 410b ... Pseudo high frequency signal generator, 100b ... low Frequency decoding unit, 100e, 110e, 130b ... High frequency decoding unit, 100f, 150c ... Low frequency / high frequency signal synthesis unit, 110c, 120d, 130a, 140a, 140b ... High frequency time wrapping correction unit, 150b, 170b ... Switch Group, 200a ... Low frequency coding unit, 200b ... High frequency coding unit, 200c ... Low frequency time wrapping information coding unit, 210a, 220a, 230a ... High frequency signal generation control information coding unit, 250a, 270a ... High Frequency signal generation control information coding unit, 360b ... Time wrapping determination unit.

Claims

An audio coding device that encodes an input audio signal and outputs a coded sequence.
An audio coding unit that encodes the audio signal,
A time-envelope information coding unit that calculates and encodes the time-envelope information of the audio signal,
A coding sequence multiplexing unit that multiplexes a coding sequence including the voice signal obtained by the voice coding unit and a coding series of time-wrapping information obtained by the time-wrapping information coding unit.
With
The time envelope information is generated based on the ratio of the arithmetic mean to the geometric mean of the time envelope of the high frequency signal of the audio signal .
The time envelope information indicates whether or not the time envelope shape is flat.
Voice coding device.

The voice coding device according to claim 1, wherein the time envelope information is represented by 1 bit.

A voice coding method executed by a voice coding device that encodes an input voice signal and outputs a coded sequence.
A voice coding step for encoding the voice signal and
A time-envelope information coding step for calculating and encoding the time-envelope information of the audio signal,
A coding sequence multiplexing step that multiplexes a coding sequence including the voice signal obtained in the voice coding step and a coding sequence of time-wrapping information obtained in the time-wrapping information coding step.
With
The time envelope information is generated based on the ratio of the arithmetic mean to the geometric mean of the time envelope of the high frequency signal of the audio signal .
The time envelope information indicates whether or not the time envelope shape is flat.
Voice coding method.

The voice coding method according to claim 3, wherein the time envelope information is represented by 1 bit.