JP2005284162A

JP2005284162A - Signal coding device, signal decoding device, signal coding method, and signal decoding method

Info

Publication number: JP2005284162A
Application number: JP2004100895A
Authority: JP
Inventors: Kei Kikuiri; 圭菊入; Nobuhiko Naka; 信彦仲; Tomoyuki Oya; 智之大矢
Original assignee: NTT Docomo Inc
Current assignee: NTT Docomo Inc
Priority date: 2004-03-30
Filing date: 2004-03-30
Publication date: 2005-10-13

Abstract

<P>PROBLEM TO BE SOLVED: To improve a compression efficiency in the transformational coding of a signal by localizing a transformational coefficient with respect to a transformation rule such as a DCT (Discrete Cosine Transform) and an MDCT (Modified Discrete Cosine Transform) having no phase information clearly. <P>SOLUTION: The signal coding device 10 changes the timewise haracteristic of frame of an input signal divided by a frame dividing part 11 to a plurality of change values by using a timewise property changing part 13. A frame evaluation value calculating part 15 calculates evaluation values of the frames whose timewize properties are changed based on a prescribed criterion for evaluation. A signal coding part 17 transforms and codes a signal in which timewise properties of the frames are changed by optimum change quantities selected by a timewise property-change quantity selecting part 16. A coded series outputting part 18 outputs the transformed and coded signal as a signal coded series. <P>COPYRIGHT: (C)2006,JPO&NCIPI

Description

本発明は、オーディオ信号などの信号を符復号化する技術に関する。 The present invention relates to a technique for codec decoding a signal such as an audio signal.

近年、信号処理技術の発達に伴い、オーディオ信号を高効率に圧縮符号化する技術は数多く提案されている。音声・音響符号化方式は、時間領域で符号化する方式（時間符号化方式）と、所定の変換規則に則って変換した後に、その変換領域で符号化する方式（変換符号化方式）とに大別することができる。この変換規則としては、例えば、ＤＦＴ（Discrete Fourier Transform）やＤＣＴ（Discrete Cosine Transform）などのような、時間領域と周波数領域との間で変換する規則（時間周波数変換）が一般的に利用されている。 In recent years, with the development of signal processing technology, many technologies for compressing and encoding audio signals with high efficiency have been proposed. The speech / acoustic encoding method is divided into a method of encoding in the time domain (time encoding method) and a method of encoding in the conversion domain after conversion according to a predetermined conversion rule (conversion encoding method). It can be divided roughly. As this conversion rule, for example, a rule for converting between the time domain and the frequency domain (time frequency conversion) such as DFT (Discrete Fourier Transform) and DCT (Discrete Cosine Transform) is generally used. Yes.

時間周波数変換では、時間領域において、ある個数のサンプルをまとめて変換フレームを構成し、この変換フレームに対して上記変換規則を適用することで、周波数領域での変換係数を得る。この変換係数を圧縮符号化する方式が変換符号化方式である。変換符号化方式を用いた代表的なオーディオ信号符号化技術としては、ＩＳＯ／ＩＥＣで規格化されたＭＰＥＧ-２ＡｕｄｉｏＡＡＣ（Moving Picture Expert Group-2 Advanced Audio Coding）を挙げることができる（例えば、非特許文献１参照。）。ＡＡＣでは、修正離散コサイン変換（ＭＤＣＴ：Modified Discrete Cosine Transform）と呼ばれる変換規則が用いられており、この変換規則も時間周波数変換である。 In the time-frequency transform, a transform frame in the frequency domain is obtained by forming a transform frame by collecting a certain number of samples in the time domain and applying the transform rule to the transform frame. A method of compressing and encoding the transform coefficient is a transform coding method. As a typical audio signal coding technique using the transform coding system, there can be mentioned MPEG-2 Audio AAC (Moving Picture Expert Group-2 Advanced Audio Coding) standardized by ISO / IEC (for example, (Refer nonpatent literature 1.). In AAC, a conversion rule called Modified Discrete Cosine Transform (MDCT) is used, and this conversion rule is also a time-frequency conversion.

また、非特許文献２においては、時間領域の符号化であるピッチパルス駆動音源とＬＰＣ（Linear Predictive Coding）フィルタとに基づく分析・合成系の符号化処理に際して、位相等化を用いる方法が開示されている。通常、ピッチパルス駆動音源とＬＰＣフィルタとの合成では、位相特性が最小位相特性に限定されるため、原音声との整合性をとることが困難である。そこで、非特許文献２に開示の技術では、音声の位相は変形しても主観品質の変化が少ないという利点に鑑みて、原音声を最小位相に位相等化した音声と、ピッチパルス駆動音源と、ＬＰＣフィルタの合成音声との整合を図っている。 Non-Patent Document 2 discloses a method that uses phase equalization in encoding processing of an analysis / synthesis system based on a pitch pulse driving sound source and LPC (Linear Predictive Coding) filter, which are time domain encoding. ing. Usually, in the synthesis of the pitch pulse driving sound source and the LPC filter, the phase characteristic is limited to the minimum phase characteristic, and it is difficult to achieve consistency with the original voice. Therefore, in the technology disclosed in Non-Patent Document 2, in view of the advantage that there is little change in subjective quality even when the phase of the audio is deformed, the audio obtained by phase equalizing the original audio to the minimum phase, And matching with the synthesized speech of the LPC filter.

更に、特許文献１には、変換符号化に際して、フレーム内の途中に急峻な立ち上がり（以下、「アタック」と記す。）が存在する場合に発生するプリエコーを回避する方法が開示されている。このプリエコーは、アタックの前に存在する、信号レベルの比較的小さな部分に生じる歪みであり、耳障りな音として知覚されやすい。特許文献１に記載の技術では、フレーム内の途中に存在するアタックの位置が、フレームの開始位置となるように制御することで、プリエコーを回避する。 Further, Patent Document 1 discloses a method for avoiding a pre-echo that occurs when there is a steep rise (hereinafter referred to as “attack”) in the middle of a frame during transform encoding. This pre-echo is a distortion that occurs in a relatively small portion of the signal level existing before the attack, and is easily perceived as an annoying sound. In the technique described in Patent Document 1, pre-echo is avoided by controlling the position of an attack existing in the middle of a frame to be the start position of the frame.

特開２０００−１９６４５２号公報JP 2000-196252 A ISO/IEC JTC1/SC29/WG11 MPEG. International Standard IS 13818-7 Information Technology -Generic Coding of Moving Pictures and Associated Audio, Part 7: MPEG-2 AAC, 1997.ISO / IEC JTC1 / SC29 / WG11 MPEG.International Standard IS 13818-7 Information Technology -Generic Coding of Moving Pictures and Associated Audio, Part 7: MPEG-2 AAC, 1997. 誉田雅彰、“位相等化音源による音声分析合成系”、電子情報通信学会音声研究会技術報告 SP89-124，1989.Masaaki Honda, “Speech analysis and synthesis system using phase equalized sound source”, IEICE Technical Report SP89-124, 1989.

しかしながら、上記従来技術には、以下に説明するような問題点があった。
まず、非特許文献１に関して、ＤＦＴの変化係数には、変換基底の振幅情報及び位相情報が含まれるのに対して、ＭＤＣＴの変化係数には、変換基底の振幅情報のみが含まれる。この点においては、ＤＣＴの変化係数に関してもＭＤＣＴと同様である。このため、入力信号に含まれている信号成分の位相が、この信号成分と周波数を同一とする変換基底の位相と一致している場合には、当該変換基底のみにより信号成分を表現できる。ところが、これらの位相がずれている場合には、当該変換基底のみならず他の周波数の変換基底を参照しない限り、信号成分を表現することはできない。その結果、変換係数は、上記信号成分の周波数に局所化することなく、他の周波数にまで拡散してしまい、圧縮効果が低減するという問題がある。 However, the above prior art has the following problems.
First, regarding Non-Patent Document 1, the DFT change coefficient includes amplitude information and phase information of the conversion base, while the MDCT change coefficient includes only the amplitude information of the conversion base. In this respect, the coefficient of change of DCT is the same as that of MDCT. For this reason, when the phase of the signal component included in the input signal matches the phase of the conversion base having the same frequency as that of the signal component, the signal component can be expressed only by the conversion base. However, when these phases are shifted, a signal component cannot be expressed unless the conversion bases of other frequencies are referred to as well. As a result, there is a problem that the transform coefficient is diffused to other frequencies without being localized to the frequency of the signal component, and the compression effect is reduced.

また、非特許文献２に記載の方法は、上述のように、ピッチパルス駆動音源とＬＰＣフィルタとに基づく分析・合成系の符号化の位相特性に鑑みて、原音声を位相等化する方法である。しかし、この方法は、入力信号の変換係数の局所化を行うものではないので、変換符号化における圧縮効果は上がらない。また、位相等化のパラメータを決定する際に、所与の評価基準を用いてパラメータ毎に評価値を算出し、その評価値を比較した結果に基づいて、パラメータを決定するという処理は行われない。 Further, as described above, the method described in Non-Patent Document 2 is a method for phase equalizing the original speech in view of the phase characteristics of the analysis / synthesis coding based on the pitch pulse driving sound source and the LPC filter. is there. However, since this method does not localize the transform coefficient of the input signal, the compression effect in transform coding does not increase. Also, when determining the phase equalization parameter, a process is performed in which an evaluation value is calculated for each parameter using a given evaluation criterion, and the parameter is determined based on a result of comparison of the evaluation value. Absent.

更に、特許文献１に記載の技術は、フレーム内の途中にアタックが存在するという特別な状況を想定したものであり、当該文献には、アタックが存在しない状況に関しては何ら言及されていない。 Furthermore, the technique described in Patent Document 1 assumes a special situation in which an attack exists in the middle of a frame, and the document does not mention anything about a situation in which no attack exists.

そこで、本発明の課題は、上述の問題点に鑑みて、ＤＣＴやＭＤＣＴを始めとする、位相情報を明示的にもたない変換規則に関しても、変換係数の局在化を可能とすることで、変換符号化における圧縮効率を向上することである。 Therefore, in view of the above-described problems, an object of the present invention is to enable localization of transform coefficients for transform rules that do not explicitly include phase information, such as DCT and MDCT. It is to improve the compression efficiency in transform coding.

本発明に係る信号符号化装置は、入力された信号をフレームに分割する分割手段と、前記フレームの時間的特性を複数の変更量により変更する変更手段と、所定の評価基準に基づいて、時間的特性が変更された前記フレームの評価値を算出する算出手段と、算出された前記評価値に基づいて、前記複数の変更量の中から最適な変更量を選択する選択手段と、選択された前記最適な変更量だけフレームの時間的特性が変更された信号を変換符号化する符号化手段と、変換符号化された前記信号を信号符号化系列として出力する出力手段とを備える。 A signal encoding apparatus according to the present invention includes a dividing unit that divides an input signal into frames, a changing unit that changes temporal characteristics of the frame by a plurality of changing amounts, and a time based on a predetermined evaluation criterion. A calculation unit that calculates an evaluation value of the frame in which a characteristic is changed, a selection unit that selects an optimal change amount from the plurality of change amounts based on the calculated evaluation value, and Coding means for transform-coding the signal whose temporal characteristics of the frame have been changed by the optimum change amount, and output means for outputting the transform-coded signal as a signal coding sequence.

本発明に係る信号符号化方法は、入力された信号をフレームに分割する分割ステップと、前記フレームの時間的特性を複数の変更量により変更する変更ステップと、所定の評価基準に基づいて、時間的特性が変更された前記フレームの評価値を算出する算出ステップと、算出された前記評価値に基づいて、前記複数の変更量の中から最適な変更量を選択する選択ステップと、選択された前記最適な変更量だけフレームの時間的特性が変更された信号を変換符号化する符号化ステップと、変換符号化された前記信号を信号符号化系列として出力する出力ステップとを含む。 A signal encoding method according to the present invention includes a dividing step of dividing an input signal into frames, a changing step of changing temporal characteristics of the frame by a plurality of changing amounts, and a time based on a predetermined evaluation criterion. A calculation step for calculating an evaluation value of the frame in which a characteristic is changed, a selection step for selecting an optimal change amount from the plurality of change amounts based on the calculated evaluation value, and An encoding step for transform-encoding the signal in which the temporal characteristic of the frame is changed by the optimum change amount; and an output step for outputting the transform-encoded signal as a signal encoding sequence.

本発明に係る信号符号化装置において好ましくは、前記フレームの時間的特性を変更するか否かを判定する判定手段を更に備え、前記変更手段は、前記判定手段により時間的特性を変更すると判定されたフレームの時間的特性を複数の変更量により変更し、前記符号化手段は、前記判定手段により時間的特性を変更すると判定されたフレームに関しては、選択された前記最適な変更量だけ当該フレームの時間的特性が変更された信号を変換符号化すると共に、前記判定手段により時間的特性を変更しないと判定されたフレームに関しては、当該フレームの信号をそのまま変換符号化する。 Preferably, the signal encoding device according to the present invention further includes a determination unit that determines whether or not to change the temporal characteristic of the frame, and the changing unit is determined to change the temporal characteristic by the determination unit. The temporal characteristics of the received frame are changed by a plurality of change amounts, and the encoding means, with respect to the frame determined to change the temporal characteristics by the determination means, only the selected optimal change amount of the frame. The signal with the temporal characteristic changed is transform-coded, and for the frame determined not to change the temporal characteristic by the determination means, the signal of the frame is directly transform-coded.

本発明に係る信号符号化装置において、前記符号化手段は、前記フレームの信号に併せて、前記最適な変更量を符号化し、前記出力手段は、前記信号符号化系列に併せて、前記最適な変更量を示す変更量符号化系列を出力するものとしてもよい。
本発明に係る信号符号化装置の符号化手段は、離散コサイン変換係数、修正離散コサイン変換係数の何れかを符号化することができる。 In the signal encoding device according to the present invention, the encoding means encodes the optimal change amount in conjunction with the signal of the frame, and the output means in combination with the signal encoded sequence A change amount coded sequence indicating the change amount may be output.
The encoding means of the signal encoding apparatus according to the present invention can encode either the discrete cosine transform coefficient or the modified discrete cosine transform coefficient.

本発明に係る信号符号化装置の算出手段は、所定の変換規則に基づく変換利得を算出することができる。
また、前記算出手段は、聴覚心理モデルに基づく聴覚エントロピを算出するものとしてもよい。
更に、前記算出手段は、符復号信号（符号化復号信号とも称する。）の歪み信号に基づく客観評価値を算出することもできる。 The calculation means of the signal encoding device according to the present invention can calculate a conversion gain based on a predetermined conversion rule.
The calculation means may calculate auditory entropy based on an auditory psychological model.
Furthermore, the calculation means can also calculate an objective evaluation value based on a distortion signal of a code decoded signal (also referred to as an encoded decoded signal).

本発明に係る信号符号化装置の算出手段は、符復号信号の歪み量を算出するものとしてもよい。当該符復号信号の歪み量は、例えば、聴覚重み付けされた歪み量である。 The calculation means of the signal encoding apparatus according to the present invention may calculate the distortion amount of the codec signal. The distortion amount of the codec signal is, for example, an auditory weighted distortion amount.

本発明に係る信号復号装置は、入力された信号符号化系列を復号信号に復号する復号手段と、入力された変更量符号化系列を復号することにより、位相変更量、サンプル削除量、サンプル挿入量、時間的伸縮変更量のうち、少なくとも１つを含む復号変更量を生成する生成手段と、前記復号変更量に基づいて、前記復号信号の時間的特性を変更する変更手段と、前記変更手段により時間的特性が変更された復号信号を出力する出力手段とを備える。 The signal decoding apparatus according to the present invention includes a decoding unit that decodes an input signal encoded sequence into a decoded signal, and a phase change amount, a sample deletion amount, and a sample insertion by decoding the input change amount encoded sequence. Generating means for generating a decoding change amount including at least one of the amount and the temporal expansion / contraction change amount, a changing means for changing a temporal characteristic of the decoded signal based on the decoding change amount, and the changing means And outputting means for outputting a decoded signal whose temporal characteristics are changed.

本発明に係る信号復号方法は、入力された信号符号化系列を復号信号に復号する復号ステップと、入力された変更量符号化系列を復号することにより、位相変更量、サンプル削除量、サンプル挿入量、時間的伸縮変更量のうち、少なくとも１つを含む復号変更量を生成する生成ステップと、前記復号変更量に基づいて、前記復号信号の時間的特性を変更する変更ステップと、前記変更ステップにて時間的特性が変更された復号信号を出力する出力ステップとを含む。 The signal decoding method according to the present invention includes a decoding step for decoding an input signal encoded sequence into a decoded signal, and a phase change amount, a sample deletion amount, and a sample insertion by decoding the input change amount encoded sequence. A generation step for generating a decoding change amount including at least one of the amount and a temporal expansion / contraction change amount, a change step for changing a temporal characteristic of the decoded signal based on the decoding change amount, and the change step And outputting a decoded signal whose temporal characteristics have been changed.

これらの発明によれば、入力信号が一旦フレームに分割された後に、これらのフレームに時間的特性（例えば位相）の変更処理が施されて符号化される。これにより、変換係数が局在化される。したがって、位相情報をもたない変換規則に関しても、変換符号化における圧縮効率が向上する。更に、時間的特性を変更するか否かの判定をフレーム単位で行うものとすれば、必要のないフレームに関してまで時間的特性の変更を行うことに伴う、処理量の増大を抑制することができる。 According to these inventions, after the input signal is once divided into frames, these frames are subjected to the temporal characteristic (for example, phase) changing process and encoded. Thereby, the conversion coefficient is localized. Therefore, the compression efficiency in transform coding is improved even for transform rules that do not have phase information. Furthermore, if the determination as to whether or not to change the temporal characteristics is performed in units of frames, it is possible to suppress an increase in the amount of processing associated with changing the temporal characteristics even with respect to unnecessary frames. .

本発明によれば、時間的特性を変更して変換係数を局在化させることにより、ＤＣＴやＭＤＣＴのような位相情報を明示的にもたない変換規則に関しても、変換符号化における圧縮効率を向上することができる。 According to the present invention, by changing the temporal characteristics to localize the transform coefficient, the compression efficiency in transform coding can be improved even for transform rules having no phase information such as DCT and MDCT. Can be improved.

［第１の実施形態］
以下、例示のみの為に添付された図面を参照しながら、本発明の第１の実施形態について説明する。まず、本実施の形態における信号符号化装置１０の構成を説明する。図１に示すように、信号符号化装置１０は、フレーム分割部１１（分割手段に対応）と時間的特性変更量指示部１２と時間的特性変更部１３（変更手段に対応）と時間的特性変更信号保持部１４とフレーム評価値算出部１５（算出手段に対応）と時間的特性変更量選択部１６（選択手段に対応）と信号符号化部１７（符号化手段に対応）と符号化系列出力部１８（出力手段に対応）とを備える。これら各部はバスを介して接続されている。 [First Embodiment]
Hereinafter, a first embodiment of the present invention will be described with reference to the accompanying drawings for illustration only. First, the configuration of the signal encoding device 10 in the present embodiment will be described. As shown in FIG. 1, the signal encoding apparatus 10 includes a frame dividing unit 11 (corresponding to the dividing unit), a temporal characteristic change amount indicating unit 12, a temporal characteristic changing unit 13 (corresponding to the changing unit), and a temporal characteristic. Change signal holding unit 14, frame evaluation value calculation unit 15 (corresponding to calculation means), temporal characteristic change amount selection unit 16 (corresponding to selection means), signal encoding unit 17 (corresponding to encoding means), and encoded sequence And an output unit 18 (corresponding to output means). These units are connected via a bus.

以下、信号符号化装置１０の各構成要素について具体的に説明する。
フレーム分割部１１は、入力信号を所定長Ｎのフレームに分割する。本実施の形態では、入力信号は、ＡＤ変換後のオーディオ信号つまりデジタル信号とする。
時間的特性変更量指示部１２には、Ｋ個（Ｋは自然数）の時間的特性変更量が予め保持されている。時間的特性変更量指示部１２は、これらの時間的特性変更量の中から１つの時間的特性変更量ｃ_ｋ（ｋ＝０，１，…，Ｋ−１）分の時間的特性の変更を時間的特性変更部１３に指示する。 Hereinafter, each component of the signal encoding device 10 will be specifically described.
The frame dividing unit 11 divides the input signal into frames having a predetermined length N. In this embodiment, the input signal is an audio signal after AD conversion, that is, a digital signal.
The temporal characteristic change amount instruction unit 12 holds K (K is a natural number) temporal characteristic change amounts in advance. The temporal characteristic change amount instruction unit 12 changes the temporal characteristic for one temporal characteristic change amount c _k (k = 0, 1,..., K−1) from among these temporal characteristic change amounts. The time characteristic changing unit 13 is instructed.

時間的特性変更部１３は、分割されたフレームのうちｍ番目の入力フレーム信号ｘ_ｍの時間的特性を、時間的特性変更量指示部１２から指示された時間的特性変更量だけ変更する。時間的特性は、例えば入力信号の位相であり、この場合、時間的特性変更量は、入力信号の位相変更量となる。時間的特性変更部１３は、時間的特性が変更されたフレームを時間的特性変更フレーム信号ｘ_ｍｋとして出力する。 Temporal characteristic change unit 13, the m-th temporal characteristics of the input frame signal x _m of the divided frames, changing only the temporal characteristic change amount instructed by the temporal characteristic change amount instruction section 12. The temporal characteristic is, for example, the phase of the input signal. In this case, the temporal characteristic change amount is the phase change amount of the input signal. The temporal characteristic changing unit 13 outputs a frame whose temporal characteristic has been changed as a temporal characteristic changing frame signal x _mk .

時間的特性変更信号保持部１４には、時間的特性変更部１３から入力された時間的特性変更フレーム信号ｘ_ｍｋが保持される。また、時間的特性変更信号保持部１４は、入力フレーム信号ｘ_ｍに関する全ての時間的特性変更量ｃ_ｋのうち、最適の時間的特性変更量ｃ_{ｍ，ｓｌｔ}を採る時の時間的特性変更フレーム信号ｘ_{ｍ，ｓｌｔ}を信号符号化部１７に出力する。 The temporal characteristic change signal holding unit 14 holds the temporal characteristic change frame signal x _mk input from the temporal characteristic changing unit 13. Further, the temporal characteristic change signal holding section 14, of all the temporal characteristic change amount c _k for the input frame signal x _m, temporal characteristic change amount of the optimum c _m, temporal characteristic change frame when taking _slt The signals x _{m and slt} are output to the signal encoding unit 17.

フレーム評価値算出部１５は、時間的特性変更部１３より入力された時間的特性変更フレーム信号ｘ_ｍｋから、所定の算定式を用いてフレーム評価値を算出する。フレーム評価値としては、例えば、変換係数の局所化の度合いを表す変換利得を使用することができる。変換規則としてＤＣＴ（Discrete Cosine Transform）を適用した場合、変換係数ｇ_ｍｋは、ＤＣＴ係数Ｘ_ｍｋｎ（ｎ＝０，１，…，Ｎ−１）を用いて、次式（１）により算定することができる。

The frame evaluation value calculation unit 15 calculates a frame evaluation value from the temporal characteristic change frame signal x _mk input from the temporal characteristic change unit 13 using a predetermined calculation formula. As the frame evaluation value, for example, a conversion gain representing the degree of localization of the conversion coefficient can be used. When DCT (Discrete Cosine Transform) is applied as a conversion rule, the conversion coefficient g _mk is calculated by the following equation (1) using the DCT coefficient X _mkn (n = 0, 1,..., N−1). Can do.

フレーム評価値としては、変換利得以外のパラメータを利用することもできる。その例が、聴覚エントロピである。聴覚エントロピは、聴覚的に雑音が知覚されないように符号化するために必要な情報量の最小値である。聴覚エントロピの詳細な算出手法に関しては、例えば、文献「Johnston, J. D., “Estimation of perceptual entropy using noise masking criteria”, Proc. of IEEE Int. Conf. Acoust., Speech and Signal Proc., pp. 2524-2527，1988.」に記載されている。
また、変換規則には、ＭＤＣＴ（Modified Discrete Cosine Transform）を用いてもよい。 Parameters other than the conversion gain can be used as the frame evaluation value. An example is auditory entropy. Auditory entropy is the minimum value of information necessary for encoding so that noise is not perceptually perceived. For a detailed method for calculating auditory entropy, see, for example, the document “Johnston, JD,“ Estimation of perceptual entropy using noise masking criteria ”, Proc. Of IEEE Int. Conf. Acoust., Speech and Signal Proc., Pp. 2524- 2527, 1988. ”.
Further, MDCT (Modified Discrete Cosine Transform) may be used as the conversion rule.

フレーム評価値は、時間的特性変更量指示部１２に保持されている全ての時間的特性変更量による時間的特性変更フレーム信号に関して算出される。すなわち、フレーム評価値算出部１５は、時間的特性変更量指示部１２に保持されているＫ個の時間的特性変更量ｃ_ｋのそれぞれに関して、フレーム評価値Ｅ_ｍｋの算出を行う。 The frame evaluation value is calculated for the temporal characteristic change frame signal based on all temporal characteristic change amounts held in the temporal characteristic change amount instruction unit 12. That is, the frame evaluation value calculation unit 15 calculates the frame evaluation value E _mk for each of the K temporal characteristic change amounts _ck held in the temporal characteristic change amount instruction unit 12.

時間的特性変更量選択部１６は、入力フレーム信号ｘ_ｍについて全ての時間的特性変更量ｃ_ｋに対応するフレーム評価値Ｅ_ｍｋが算出された後に、フレーム評価値が最も高い値を採る時間的特性変更量を最適変更量に選択する。フレーム評価値が変換利得の場合には、対応する変換利得が最も大きい時間的特性変更量が最適変更量として選択される。時間的特性変更量選択部１６は、選択した時間的特性変更量の最適変更量ｃ_{ｍ，ｓｌｔ}を時間的特性変更信号保持部１４に通知する。 Temporal characteristic change amount selector 16, after the frame evaluation value E _mk for all temporal characteristic change amount c _k for an input frame signal x _m is calculated, the time frame evaluation value takes the highest value Select the characteristic change amount as the optimal change amount. When the frame evaluation value is the conversion gain, the temporal characteristic change amount having the largest corresponding conversion gain is selected as the optimum change amount. The temporal characteristic change amount selection unit 16 notifies the temporal characteristic change signal holding unit 14 of the optimal change amount _{cm, slt} of the selected temporal characteristic change amount.

信号符号化部１７は、時間的特性変更フレーム信号ｘ_{ｍ，ｓｌｔ}を符号化して信号符号化系列を生成し、後段の符号化系列出力部１８に出力する。信号符号化部１７は、変換符号化に際して、ＭＰＥＧ-２のＡＡＣ（Advanced Audio Coding）を用いることができる。
符号化系列出力部１８は、信号符号化部１７から入力された信号符号化系列を外部装置に出力する。 The signal encoding unit 17 encodes the temporal characteristic change frame signal x _{m, slt} to generate a signal encoded sequence, and outputs the signal encoded sequence to the _subsequent encoded sequence output unit 18. The signal encoding unit 17 can use MPEG-2 Advanced Audio Coding (AAC) in MPEG-2 for transform encoding.
The encoded sequence output unit 18 outputs the signal encoded sequence input from the signal encoding unit 17 to an external device.

なお、時間的特性変更信号保持部１４に保持されている信号は、時間信号でなくてもよい。すなわち、フレーム評価値算出部１５にて時間的特性変更信号の変換係数が算出され、この変換係数が信号符号化部１７で利用される場合には、その変換係数を時間的特性変更信号保持部１４に保持させるものとしてもよい。この場合、信号符号化部１７において変換係数を算出する必要が無くなるという効果がある。 The signal held in the temporal characteristic change signal holding unit 14 may not be a time signal. That is, when the frame evaluation value calculation unit 15 calculates the transform coefficient of the temporal characteristic change signal and this transform coefficient is used in the signal encoding unit 17, the transform coefficient is used as the temporal characteristic change signal holding unit. 14 may be held. In this case, there is an effect that it is not necessary to calculate the transform coefficient in the signal encoding unit 17.

次に、図２を参照しながら、本実施の形態における信号符号化装置１０の動作、併せて、本発明に係る信号符号化方法を構成する各ステップについて説明する。動作説明の前提として、時間的特性変更量指示部１２には、Ｋ個の時間的特性変更量ｃ_ｋ（ｋ＝０，１，…，Ｋ−１）が保持されているものとする。 Next, with reference to FIG. 2, the operation of the signal encoding apparatus 10 according to the present embodiment and each step constituting the signal encoding method according to the present invention will be described. As a premise for explaining the operation, it is assumed that the temporal characteristic change amount instruction unit 12 holds K temporal characteristic change amounts c _k (k = 0, 1,..., K−1).

まず、入力信号が、所定のデータ長Ｎを有するフレームに分割されると（Ｓ１）、時間的特性変更量を特定するインデックスｋに初期値として“０”が設定される（Ｓ２）。続いて、時間的特性変更量の数Ｋとインデックスｋとの大小関係が比較される（Ｓ３）。ｋ＜Ｋが成立する場合には（Ｓ３；ＹＥＳ）、入力フレーム信号ｘ_ｍの時間的特性は時間的特性変更量ｃ_ｋにより変更され、その結果、時間的特性変更フレーム信号ｘ_ｍｋが生成及び保持される（Ｓ４）。 First, when an input signal is divided into frames having a predetermined data length N (S1), “0” is set as an initial value to an index k for specifying a temporal characteristic change amount (S2). Subsequently, the magnitude relationship between the number K of temporal characteristic change amounts and the index k is compared (S3). if k <K is satisfied (S3; YES), the temporal characteristics of the input frame signal x _m is changed by the temporal characteristic change amount c _k, as a result, generation and temporal characteristic change frame signal x _mk It is held (S4).

Ｓ５では、Ｓ４で生成された時間的特性変更フレーム信号ｘ_ｍｋのフレーム評価値Ｅ_ｍｋが算出される。その後、ｋに１が加算され（Ｓ６）、Ｓ３〜Ｓ６の一連の処理は、ｋ＝Ｋが成立するまで繰り返し実行される。 In S5, the frame evaluation value E _{mk of} the temporal characteristic change frame signal x _mk generated in S4 is calculated. Thereafter, 1 is added to k (S6), and a series of processes from S3 to S6 are repeatedly executed until k = K is established.

ｋ＜Ｋが成立しない場合、すなわちｋ≧Ｋが成立する場合には（Ｓ３；ＮＯ）、Ｓ４で使用された各時間的特性変更量ｃ_ｋに対応するフレーム評価値Ｅ_ｍｋの中から最も評価の高い値Ｅ_{ｍ，ｓｌｔ}が選択される。そして、これに対応する時間的特性変更量ｃ_{ｍ，ｓｌｔ}が最適変更量として選択される（Ｓ７）。Ｓ８では、時間的特性変更量ｃ_{ｍ，ｓｌｔ}に対応する時間的特性変更フレーム信号ｘ_{ｍ，ｓｌｔ}が符号化される。この時間的特性変更フレーム信号ｘ_{ｍ，ｓｌｔ}の符号化系列は、信号符号化系列として装置外部に出力される（Ｓ９）。 When k <K is not satisfied, that is, when k ≧ K is satisfied (S3; NO), the most evaluated frame evaluation value E _mk corresponding to each temporal characteristic change amount _ck used in S4. A high value _{Em, slt} is selected. Then, the temporal characteristic change amount _{cm, slt} corresponding to this is selected as the optimum change amount (S7). In S8, the temporal characteristic change frame signal _{xm, slt} corresponding to the temporal characteristic change amount _{cm, slt} is encoded. The encoded sequence of the temporal characteristic change frame signal _{xm, slt} is output to the outside of the apparatus as a signal encoded sequence (S9).

Ｓ１０では、Ｓ１で分割された入力信号を構成するフレームのうち、最終フレームまで一連の処理（Ｓ２〜Ｓ９の各処理）が終了したか否かが判定される。終了していない場合には（Ｓ１０；ＮＯ）、信号符号化系列が未だに出力されていないフレームが残存するものと判断できるので、Ｓ２に戻り、後続のフレームに関して上記一連の処理が実行される。入力信号を構成する全てのフレームに関して上記一連の処理が完了した時点で（Ｓ１０；ＹＥＳ）、本実施の形態の信号符号化処理は終了する。 In S10, it is determined whether a series of processes (each process of S2 to S9) has been completed up to the final frame among the frames constituting the input signal divided in S1. If it has not been completed (S10; NO), it can be determined that there remains a frame for which a signal encoded sequence has not yet been output. When the above-described series of processing is completed for all the frames constituting the input signal (S10; YES), the signal encoding processing of the present embodiment ends.

ここで、図３は、第１の実施形態の変形態様における信号符号化装置１０の動作を説明するためのフローチャートである。本態様における信号符号化装置１０の動作は、上記詳述した信号符号化装置１０の動作と類似するので、共通するステップには同一のステップ番号を付すと共に、その説明は省略する。かかる態様では、差異であるＳ１１〜Ｓ１３の処理をＳ５の後処理として追加することにより、Ｓ７の処理の実行を省略する。 Here, FIG. 3 is a flowchart for explaining the operation of the signal encoding device 10 according to a modification of the first embodiment. Since the operation of the signal encoding device 10 in this aspect is similar to the operation of the signal encoding device 10 described in detail above, common steps are denoted by the same step numbers and description thereof is omitted. In this aspect, the processing of S7 is omitted by adding the processing of S11 to S13, which is the difference, as the post-processing of S5.

すなわち、Ｓ１１ではｋ＝０であるか否かの判定が為され、ｋ＝０である場合には（Ｓ１１；ＹＥＳ）、Ｅ_{ｍ，ｓｌｔ}＝Ｅ_ｍｋ，ｃ_{ｍ，ｓｌｔ}＝ｃ_ｋ，ｘ_{ｍ，ｓｌｔ}＝ｘ_ｍｋがそれぞれ設定される（Ｓ１２）。一方、ｋ≠０である場合には（Ｓ１１；ＮＯ）、フレーム評価値Ｅ_ｍｋの評価がＥ_{ｍ，ｓｌｔ}の評価よりも高いか否かが判定される（Ｓ１３）。判定の結果、フレーム評価値Ｅ_ｍｋの評価がＥ_{ｍ，ｓｌｔ}の評価よりも高い場合には（Ｓ１３；ＹＥＳ）、Ｓ１２に移行し、フレーム評価値Ｅ_ｍｋの評価がＥ_{ｍ，ｓｌｔ}の評価以下である場合には（Ｓ１３；ＮＯ）、Ｓ１２の処理は省略され、Ｓ６以降の処理に移行する。 That is, in S11, it is determined whether or not k = 0. If k = 0 (S11; YES), E _{m, slt} = E _mk , _{cm, slt} = c _k , x _{m , Slt} = x _mk are set (S12). On the other hand, if k ≠ 0 (S11; NO), it is determined whether or not the evaluation of the frame evaluation value E _mk is higher than the evaluation of E _{m, slt} (S13). As a result of the determination, when the evaluation of the frame evaluation value E _mk is higher than the evaluation of E _{m, slt} (S13; YES), the process proceeds to S12, and the evaluation of the frame evaluation value E _mk is equal to or less than the evaluation of E _{m, slt} . (S13; NO), the process of S12 is omitted, and the process proceeds to S6 and subsequent processes.

このような変形態様を採ることにより、評価の最も高い時間的特性変更信号、時間的特性変更量、及び、フレーム評価値を信号符号化装置１０に保持しておけば、変換係数を局在化することができる。換言すれば、全ての時間的特性変更信号、時間的特性変更量、及びフレーム評価値を保持しておく必要が無いので、データ記憶領域の節約が実現される。 By adopting such a modification mode, if the temporal characteristic change signal, the temporal characteristic change amount, and the frame evaluation value with the highest evaluation are held in the signal encoding device 10, the transform coefficient is localized. can do. In other words, since it is not necessary to hold all temporal characteristic change signals, temporal characteristic change amounts, and frame evaluation values, data storage area can be saved.

［第２の実施形態］
次に、図４及び図５を参照して、本発明の第２の実施形態について説明する。
図４は、本実施の形態における信号符号化装置の機能的構成を示すブロック図である。信号符号化装置２０の構成は、第１の実施形態において詳述した信号符号化装置１０の構成と類似するので、共通する構成部分には同列（末尾が同一）の符号を付しその説明は省略すると共に、第１の実施形態との差異について詳述する。 [Second Embodiment]
Next, a second embodiment of the present invention will be described with reference to FIGS.
FIG. 4 is a block diagram showing a functional configuration of the signal encoding apparatus according to the present embodiment. Since the configuration of the signal encoding device 20 is similar to the configuration of the signal encoding device 10 described in detail in the first embodiment, common components are denoted by the same column (same end), and the description thereof will be given. While omitted, the differences from the first embodiment will be described in detail.

変更量符号化部２９は、信号符号化装置２０に特有の構成部分である。
変更量符号化部２９は、時間的特性変更量選択部２６から通知された時間的特性変更量ｃ_{ｍ，ｓｌｔ}を符号化して変更量符号化系列を生成する。この変更量符号化系列は、符号化系列出力部２８により、信号符号化系列とともに出力される。 The change amount encoding unit 29 is a component unique to the signal encoding device 20.
The change amount encoding unit 29 encodes the temporal characteristic change amounts _{cm and slt} notified from the temporal characteristic change amount selection unit 26 to generate a change amount encoded sequence. This change amount encoded sequence is output by the encoded sequence output unit 28 together with the signal encoded sequence.

次いで、図５を参照することで、第２の実施形態における信号符号化処理について説明する。本信号符号化処理は、第１の実施形態において詳述した信号符号化処理（図２参照）と共通するステップを複数含む。具体的には、図５のＳ２１〜Ｓ２７，Ｓ２１０は、図２に示したＳ１〜Ｓ７，Ｓ１０にそれぞれ相当する。 Next, the signal encoding process in the second embodiment will be described with reference to FIG. This signal encoding process includes a plurality of steps in common with the signal encoding process (see FIG. 2) described in detail in the first embodiment. Specifically, S21 to S27 and S210 in FIG. 5 correspond to S1 to S7 and S10 shown in FIG. 2, respectively.

以下、第１及び第２の実施形態における差異であるＳ２８，Ｓ２９について説明する。図５のＳ２８では、符号化系列出力部２８により、時間的特性変更フレーム信号ｘ_{ｍ，ｓｌｔ}と最適変更量ｃ_{ｍ，ｓｌｔ}とが符号化される。その結果、時間的特性変更フレーム信号ｘ_{ｍ，ｓｌｔ}からは信号符号化系列が生成され、最適変更量ｃ_{ｍ，ｓｌｔ}からは変更量符号化系列が生成される。これらの符号化系列は、符号化系列出力部２８により外部装置に出力される（Ｓ２９）。 Hereinafter, S28 and S29, which are the differences between the first and second embodiments, will be described. In S28 of FIG. 5, the encoded sequence output unit 28 encodes the temporal characteristic change frame signal _{xm, slt} and the optimal change amount _{cm, slt} . As a result, a signal encoded sequence is generated from the temporal characteristic change frame signals x _{m and slt} _, and a change amount encoded sequence is generated from the optimal change amounts _{cm and slt} . These encoded sequences are output to the external device by the encoded sequence output unit 28 (S29).

［第３の実施形態］
次に、図６及び図７を参照して、本発明の第３の実施形態について説明する。
図６は、本実施の形態における信号符号化装置の機能的構成を示すブロック図である。信号符号化装置３０の構成は、第１の実施形態において詳述した信号符号化装置１０の構成と類似するので、共通する構成部分には同列（末尾が同一）の符号を付しその説明は省略すると共に、第１の実施形態との差異について詳述する。 [Third Embodiment]
Next, a third embodiment of the present invention will be described with reference to FIGS.
FIG. 6 is a block diagram showing a functional configuration of the signal encoding apparatus according to the present embodiment. Since the configuration of the signal encoding device 30 is similar to the configuration of the signal encoding device 10 described in detail in the first embodiment, common components are denoted by the same reference numerals (same end), and the description thereof is as follows. While omitted, the differences from the first embodiment will be described in detail.

時間的特性変更判定部３９（判定手段に対応）は、信号符号化装置３０に特有の構成部分である。
時間的特性変更判定部３９は、分割されたｍ番目の入力フレーム信号ｘ_ｍに対して時間的特性の変更処理を施すか否かの判定を行う。判定の基準としては、例えば、入力フレーム信号ｘ_ｍの電力値を算出し、この電力値と閾値との大小関係を利用することができる。すなわち、時間的特性変更判定部３９は、フレームの電力値が閾値以上である場合にのみ時間的特性変更処理を実行する。 The temporal characteristic change determination unit 39 (corresponding to the determination unit) is a component unique to the signal encoding device 30.
The temporal characteristic change determination unit 39, and determines whether to perform the process of changing the temporal characteristics with respect to the divided m-th input frame signal x _m. As the basis for judgment, for example, calculates the power value of the input frame signal x _m, it may be utilized magnitude relation between the power value and the threshold value. That is, the temporal characteristic change determination unit 39 executes the temporal characteristic change process only when the power value of the frame is equal to or greater than the threshold value.

続いて、図７を参照することで、第３の実施形態における信号符号化処理について説明する。本信号符号化処理は、第１の実施形態において詳述した信号符号化処理（図２参照）と共通するステップを複数含む。具体的には、図７のＳ３１，Ｓ３３〜Ｓ３１１は、図２に示したＳ１，Ｓ２〜Ｓ１０にそれぞれ相当する。 Subsequently, a signal encoding process according to the third embodiment will be described with reference to FIG. This signal encoding process includes a plurality of steps in common with the signal encoding process (see FIG. 2) described in detail in the first embodiment. Specifically, S31 and S33 to S311 in FIG. 7 correspond to S1, S2 to S10 shown in FIG.

以下、第１及び第３の実施形態における差異であるＳ３２，Ｓ３１２について説明する。図７のＳ３２では、時間的特性変更判定部３９により、時間的特性の変更処理を実行するか否かの判定が為される。当該判定の結果、変更処理の実行が決定された場合には（Ｓ３２；ＹＥＳ）、Ｓ３３以降の処理に移行する。これに対して、変更処理の非実行が決定された場合には（Ｓ３２；ＮＯ）、信号符号化処理はＳ３１２に移行する。すなわち、入力フレーム信号は、フレーム分割部３１から信号符号化部３７に出力され、時間的特性変更処理が施されないまま符号化される（Ｓ３９）。符号化の結果生成された信号符号化系列は、符号化系列出力部３８から外部に出力される（Ｓ３１０）。 Hereinafter, S32 and S312 which are the differences between the first and third embodiments will be described. In S32 of FIG. 7, the temporal characteristic change determination unit 39 determines whether or not to execute the temporal characteristic change process. As a result of the determination, if execution of the change process is determined (S32; YES), the process proceeds to S33 and subsequent steps. On the other hand, when it is determined not to execute the change process (S32; NO), the signal encoding process proceeds to S312. That is, the input frame signal is output from the frame dividing unit 31 to the signal encoding unit 37, and is encoded without being subjected to the temporal characteristic changing process (S39). The signal encoded sequence generated as a result of encoding is output to the outside from the encoded sequence output unit 38 (S310).

［第４の実施形態］
次に、図８及び図９を参照して、本発明の第４の実施形態について説明する。
図８は、本実施の形態における信号符号化装置の機能的構成を示すブロック図である。信号符号化装置４０の構成は、第３の実施形態において詳述した信号符号化装置３０の構成と類似するので、共通する構成部分には同列（末尾が同一）の符号を付しその説明は省略すると共に、第３の実施形態との差異について詳述する。 [Fourth Embodiment]
Next, a fourth embodiment of the present invention will be described with reference to FIGS.
FIG. 8 is a block diagram showing a functional configuration of the signal encoding apparatus according to the present embodiment. Since the configuration of the signal encoding device 40 is similar to the configuration of the signal encoding device 30 described in detail in the third embodiment, common components are denoted by the same reference numerals (same end), and description thereof will be given. While omitted, the differences from the third embodiment will be described in detail.

時間的特性変更判定部４９は、分割されたｍ番目の入力フレーム信号ｘ_ｍに対して時間的特性の変更処理を施すか否かの判定を行う。時間的特性の変更処理を実行しないものと判定された場合には、時間的特性変更判定部４９は、入力フレーム信号を信号符号化部４７に直接に出力すると共に、時間的特性変更処理が施されていない旨を示す時間的特性変更量「ゼロ」を変更量符号化部４１０に通知する。一方、時間的特性の変更処理を実行するものと判定された場合には、時間的特性変更判定部４９は、変更量「ゼロ」を変更量符号化部４１０に通知することなく、入力フレーム信号を時間的特性変更部４３に出力する。 The temporal characteristic change determination unit 49, and determines whether to perform the process of changing the temporal characteristics with respect to the divided m-th input frame signal x _m. When it is determined that the temporal characteristic changing process is not to be executed, the temporal characteristic change determining unit 49 outputs the input frame signal directly to the signal encoding unit 47 and performs the temporal characteristic changing process. The change amount encoding unit 410 is notified of the time characteristic change amount “zero” indicating that the change is not made. On the other hand, when it is determined that the temporal characteristic changing process is to be executed, the temporal characteristic change determining unit 49 does not notify the change amount encoding unit 410 of the change amount “zero”, and the input frame signal Is output to the temporal characteristic changing unit 43.

変更量符号化部４１０は、時間的特性変更量選択部４６から通知された時間的特性変更量ｃ_{ｍ，ｓｌｔ}を符号化して変更量符号化系列を生成する。また、変更量符号化部４１０は、時間的特性変更処理が実行されない場合に入力される変更量「ゼロ」を符号化して変更量符号化系列を生成する。これらの変更量符号化系列は、符号化系列出力部４８により、信号符号化系列とともに出力される。 The change amount encoding unit 410 encodes the temporal characteristic change amounts _{cm and slt} notified from the temporal characteristic change amount selection unit 46 to generate a change amount encoded sequence. Also, the change amount encoding unit 410 encodes the change amount “zero” input when the temporal characteristic change process is not executed, and generates a change amount encoded sequence. These change amount encoded sequences are output together with the signal encoded sequence by the encoded sequence output unit 48.

次いで、図９を参照することで、第４の実施形態における信号符号化処理について説明する。本信号符号化処理は、第３の実施形態において詳述した信号符号化処理（図７参照）と基本的に同様である。具体的には、図９のＳ４１〜Ｓ４１１は、図７に示したＳ３１〜Ｓ３１１にそれぞれ相当する。特に、Ｓ４９，Ｓ４１０の各処理に関しては、第２の実施形態にて説明したＳ２８，Ｓ２９（図５参照）と同様である。 Next, a signal encoding process in the fourth embodiment will be described with reference to FIG. This signal encoding process is basically the same as the signal encoding process (see FIG. 7) described in detail in the third embodiment. Specifically, S41 to S411 in FIG. 9 correspond to S31 to S311 shown in FIG. 7, respectively. In particular, the processes of S49 and S410 are the same as S28 and S29 (see FIG. 5) described in the second embodiment.

以下、第３と第４の実施形態との差異であるＳ４１２について説明する。図９のＳ４１２は、時間的特性変更処理が実行されないと判定された場合（Ｓ４２；ＮＯ）に実行される。Ｓ４１２では、入力フレーム信号ｘ_ｍが、時間的特性変更フレーム信号ｘ_{ｍ，ｓｌｔ}として信号符号化部４７に入力される。同様に、最適変更量ｃ_{ｍ，ｓｌｔ}としては、上記変更処理を施していないことを示す「ゼロ」が変更量符号化部４１０に入力される。Ｓ４１２の処理が終了すると、Ｓ４９以降の処理に移行する。 Hereinafter, S412 which is the difference between the third and fourth embodiments will be described. S412 in FIG. 9 is executed when it is determined that the temporal characteristic changing process is not executed (S42; NO). In S412, the input frame signal _{x m} is the temporal characteristic change frame signal _{x m,} is input to the signal encoding unit 47 as a _slt. Similarly, “zero” indicating that the change processing is not performed is input to the change amount encoding unit 410 as the optimum change amount _{cm, slt} . When the process of S412 ends, the process proceeds to S49 and subsequent processes.

［第５の実施形態］
次に、図１０及び図１１を参照して、本発明の第５の実施形態について説明する。図１０は、生成された符号化系列を復号する信号復号装置の機能的構成を示すブロック図である。図１０に示すように、信号復号装置５０は、信号復号部５１（復号手段に対応）と、変更量復号部５２（生成手段に対応）と、時間的特性変更部５３（変更手段に対応）と、時間的特性変更復号フレーム信号出力部５４（出力手段に対応）とを備える。これら各部はバスを介して接続されている。 [Fifth Embodiment]
Next, a fifth embodiment of the present invention will be described with reference to FIGS. FIG. 10 is a block diagram illustrating a functional configuration of a signal decoding apparatus that decodes a generated encoded sequence. As shown in FIG. 10, the signal decoding apparatus 50 includes a signal decoding unit 51 (corresponding to the decoding unit), a change amount decoding unit 52 (corresponding to the generating unit), and a temporal characteristic changing unit 53 (corresponding to the changing unit). And a temporal characteristic change decoded frame signal output unit 54 (corresponding to output means). These units are connected via a bus.

信号復号部５１は、入力された信号符号化系列を復号して復号信号とする。
変更量復号部５２は、入力された変更量符号化系列を復号することにより、位相変更量、サンプル削除量、サンプル挿入量、時間的伸縮変更量のうち、少なくとも１つを含む復号変更量を生成する。 The signal decoding unit 51 decodes the input signal encoded sequence to obtain a decoded signal.
The change amount decoding unit 52 decodes the input change amount encoded sequence to obtain a decoding change amount including at least one of a phase change amount, a sample deletion amount, a sample insertion amount, and a temporal expansion / contraction change amount. Generate.

時間的特性変更部５３は、生成された復号変更量に基づいて、復号信号の時間的特性を変更する。時間的特性変更部５３は、例えば位相変更部として機能し、この場合、変更量符号化系列としては位相変更量が符号化されたものが使用される。例えば、第２の実施形態における信号符号化装置２０において、入力フレーム信号ｘ_ｍの時間的特性が復号変更量ｃ_ｓｌｔにより変更された場合には、本実施の形態における信号復号装置５０では、−ｃ_ｓｌｔにより、復号フレーム信号ｘ_ｍの時間的特性を変更する。
時間的特性変更復号フレーム信号出力部５４は、時間的特性が変更された復号信号を時間的特性変更復号フレーム信号（復号信号に対応）として外部に出力する。 The temporal characteristic changing unit 53 changes the temporal characteristic of the decoded signal based on the generated decoding change amount. The temporal characteristic change unit 53 functions as, for example, a phase change unit. In this case, a change amount encoded sequence in which a phase change amount is encoded is used. For example, in the signal encoding device 20 according to the second embodiment, when the temporal characteristic of the input frame signal x _m is changed by the decoding change amount c _slt, the signal decoding device 50 according to the present embodiment uses − the c _slt, changes the temporal characteristics of the decoded frame signal _{x m.}
The temporal characteristic changed decoded frame signal output unit 54 outputs the decoded signal whose temporal characteristic has been changed to the outside as a temporal characteristic changed decoded frame signal (corresponding to the decoded signal).

続いて、図１１を参照しながら、信号復号装置５０の動作を説明する。
まずＴ１では、信号復号部５１において、入力された信号符号化系列が復号され、その結果、復号フレーム信号ｘ’_ｍが生成される。この復号フレーム信号ｘ’_ｍは、時間的特性変更部５３に入力される。一方、変更量復号部５２では、入力された変更量符号化系列が復号された結果、復号変更量ｃ’_ｓｌｔが生成される（Ｔ２）。この復号変更量ｃ’_ｓｌｔは、時間的特性変更部５３に入力される。Ｔ１，Ｔ２の処理は、Ｔ２，Ｔ１の順で、つまり逆順で実行されてもよいし、同時に並行して実行されてもよい。 Next, the operation of the signal decoding device 50 will be described with reference to FIG.
First, at T1, the signal decoding unit 51 decodes the input signal encoded sequence, and as a result, a decoded frame signal x ′ _m is generated. The decoded frame signal x ′ _m is input to the temporal characteristic changing unit 53. On the other hand, the change amount decoding unit 52 generates a decoding change amount _c′slt as a result of decoding the input change amount coded sequence (T2). This decoding change amount c ′ _slt is input to the temporal characteristic changing unit 53. The processes of T1 and T2 may be executed in the order of T2 and T1, that is, in the reverse order, or may be executed in parallel at the same time.

時間的特性変更部５３は、上記復号フレーム信号ｘ’_ｍの時間的特性を、上記復号変更量ｃ’_ｓｌｔに基づいて変更し、時間的特性変更復号フレーム信号ｘ’_{ｍ，ｓｌｔ}を生成する（Ｔ３）。そして、生成された時間的特性変更復号フレーム信号ｘ’_{ｍ，ｓｌｔ}は、時間的特性変更復号フレーム信号出力部５４にて出力される（Ｔ４）。Ｔ５では、最終のフレーム信号までの復号処理が完了したか否かの判定が為され、完了していない場合には（Ｔ５；ＮＯ）Ｔ１以降の処理が再び実行される。全てのフレーム信号に関する信号復号処理が終了した時点で（Ｔ５；ＹＥＳ）、一連の信号復号処理は終了する。 The temporal characteristic change section 53, 'the temporal characteristics of _m, the decoding change amount c' the decoded frame signal x to change based on _slt, temporal characteristic change decoded frame signal x _'m, to produce a _slt ( T3). The generated temporal characteristic change decoded frame signal x ′ _{m, slt} is output from the temporal characteristic change decoded frame signal output unit 54 (T4). At T5, it is determined whether or not the decoding process up to the final frame signal has been completed. If it has not been completed (T5; NO), the processes after T1 are executed again. When the signal decoding process for all the frame signals is completed (T5; YES), a series of signal decoding processes are completed.

以上説明したように、第１〜第５の実施形態における信号符号化装置１０〜５０によれば、入力信号がフレームに分割された後に、これらのフレームに時間的特性（例えば位相）の変更処理を施して符号化（第５の実施形態では復号）する。したがって、位相情報を明示的にもたない変換規則に関しても変換係数が局在化されるので、変換符号化における圧縮効率（第５の実施形態では伸張効率）が向上する。更に、第１、第３、及び第４の実施形態における信号符号化装置１０，３０，４０によれば、時間的特性を変更するか否かの判定をフレーム単位で行うため、本来不必要な、フレームに関する時間的特性の変更処理を省略することができる。この結果、処理量が必要最小限に低減される。 As described above, according to the signal encoding devices 10 to 50 in the first to fifth embodiments, after the input signal is divided into frames, the temporal characteristic (for example, phase) change processing is performed on these frames. And encoding (decoding in the fifth embodiment). Therefore, since the transform coefficient is localized even for a transform rule that does not explicitly include phase information, the compression efficiency (transformation efficiency in the fifth embodiment) in transform coding is improved. Furthermore, according to the signal encoding devices 10, 30, and 40 in the first, third, and fourth embodiments, since it is determined in units of frames whether or not the temporal characteristics are to be changed, it is unnecessary originally. Thus, the process of changing the temporal characteristics related to the frame can be omitted. As a result, the processing amount is reduced to a necessary minimum.

なお、本発明は、上述した各実施の形態に限定されるものではなく、その趣旨を逸脱しない範囲において、適宜変形態様を採ることもできる。
例えば、第１〜第５の実施形態においては、時間的特性変更部は、時間的特性の一例である位相を変更するものとしたが、サンプル削除機能を有するものとしてもよい。すなわち、時間的特性変更部は、時間的特性変更量としてサンプル削除量が指示されたことに伴い、サンプル削除量だけ入力信号からサンプルを削除する。かかる態様では、フレーム分割部は、入力信号をフレームに分割する際に、あるフレーム信号から上記サンプル削除量だけサンプルを削除した後に、後続のフレーム信号に当該サンプル削除量だけサンプルを追加する。これにより、データ長Ｎのフレームを構成する。 In addition, this invention is not limited to each embodiment mentioned above, In the range which does not deviate from the meaning, a deformation | transformation aspect can also be taken suitably.
For example, in the first to fifth embodiments, the temporal characteristic changing unit changes the phase, which is an example of the temporal characteristic, but may have a sample deletion function. That is, the temporal characteristic changing unit deletes the sample from the input signal by the amount of sample deletion when the sample deletion amount is instructed as the temporal characteristic change amount. In such an aspect, when the input signal is divided into frames, the frame dividing unit deletes the sample from the certain frame signal by the sample deletion amount, and then adds the sample to the subsequent frame signal by the sample deletion amount. Thus, a frame having a data length N is configured.

反対に、時間的特性変更部は、サンプル挿入機能を有することもできる。すなわち、時間的特性変更部は、時間的特性変更量としてサンプル挿入量が指示されたことに伴い、サンプル挿入量だけ入力信号にサンプルを挿入する。本態様では、フレーム分割部は、入力信号をフレームに分割する際に、あるフレーム信号に対して上記サンプル挿入量だけサンプルを挿入した後、当該フレーム信号の末尾のサンプルより当該サンプル挿入量だけサンプルを消去する。これにより、データ長Ｎのフレームを構成する。消去されたサンプルは、後続するフレーム信号の先頭のサンプルに使用される。 On the contrary, the temporal characteristic changing unit may have a sample insertion function. That is, the temporal characteristic changing unit inserts a sample into the input signal by the amount of sample insertion when the sample insertion amount is instructed as the temporal characteristic change amount. In this aspect, when the frame dividing unit divides the input signal into frames, after inserting the sample by the sample insertion amount into a certain frame signal, the sample is inserted by the sample insertion amount from the sample at the end of the frame signal. Erase. Thus, a frame having a data length N is configured. The erased sample is used as the first sample of the subsequent frame signal.

更に、時間的特性変更部は、時間的伸縮機能を有するものとしてもよい。すなわち、時間的特性変更部は、入力フレーム信号のある部分の時間分解能を高くし、その分、別の信号部分の時間分解能を低くする。これにより、サンプル数を変えることなく、入力フレーム信号の時間的伸縮を実現する。 Furthermore, the temporal characteristic changing unit may have a temporal expansion / contraction function. That is, the temporal characteristic changing unit increases the time resolution of a certain part of the input frame signal and decreases the time resolution of another signal part accordingly. This realizes temporal expansion and contraction of the input frame signal without changing the number of samples.

図１２を参照して、より具体的に説明すると、時間的特性変更部（例えば、第１の実施形態における時間的特性変更部１３）は、複数の特性変更部に分割することができる。すなわち、第１時間的特性変更部１３１に位相変更機能をもたせ、第２、第３時間的特性変更部１３２，１３３にそれぞれサンプル削除機能、サンプル挿入機能をもたせることができる。この場合、フレーム評価値算出部１５は、第１〜第３の時間的特性変更信号の各々に関してフレーム評価値を算定する。 More specifically, with reference to FIG. 12, the temporal characteristic changing unit (for example, the temporal characteristic changing unit 13 in the first embodiment) can be divided into a plurality of characteristic changing units. That is, the first temporal characteristic changing unit 131 can have a phase changing function, and the second and third temporal characteristic changing units 132 and 133 can have a sample deletion function and a sample insertion function, respectively. In this case, the frame evaluation value calculation unit 15 calculates a frame evaluation value for each of the first to third temporal characteristic change signals.

例えば、時間的特性変更量指示部１２が、第１時間的特性変更量をＫ_１個、第２時間的特性変更量をＫ_２個、第３時間的特性変更量をＫ_３個、保持しているものとする。かかる例では、（Ｋ_１＋Ｋ_２＋Ｋ_３）個のフレーム評価値が算出されることになる。時間的特性変更量選択部１６は、これら全ての時間的特性変更量に対応するフレーム評価値の高低を比較して、評価が最も高い時間的特性変更量を最適変更量に選択する。選択された最適変更量に対応する時間的特性変更信号は、時間的特性変更信号保持部１４から信号符号化部１７に出力される。 For example, the temporal characteristic change amount instruction section 12, or K ₁ of the first temporal characteristic change amount, a second temporal characteristic change amount _two K, the third temporal characteristic change amount _three K, retained It shall be. In such an example, (K ₁ + K ₂ + K ₃ ) frame evaluation values are calculated. The temporal characteristic change amount selection unit 16 compares the frame evaluation values corresponding to all these temporal characteristic change amounts, and selects the temporal characteristic change amount having the highest evaluation as the optimal change amount. The temporal characteristic change signal corresponding to the selected optimal change amount is output from the temporal characteristic change signal holding unit 14 to the signal encoding unit 17.

なお、本変形態様において、時間的特性変更量指示部１２は、各フレーム信号に対して異なる種類の時間的特性変更量を指示するものとしてもよい。また、過去のフレーム信号において選択された時間的特性変更量に応じて、指示する時間的特性変更量を変化させてもよい。 In this modification, the temporal characteristic change amount instruction unit 12 may instruct different types of temporal characteristic change amounts for each frame signal. Further, the time characteristic change amount to be indicated may be changed according to the time characteristic change amount selected in the past frame signal.

更に別の変形態様における信号符号化装置は、符号化歪みを算出して、その歪み量や聴覚的な重み付けをした歪み量をフレーム評価値に使用する。あるいは、符復号信号を用いてＰＥＳＱ（ＩＴＵ−Ｔ勧告Ｐ．８６２）、ＰＥＡＱ（ＩＴＵ−Ｒ勧告ＢＳ．１３８７−１）により算出される客観評価値を用いることもできる。符復号信号の歪み量を評価値とした態様（前者の態様）における信号符号化装置の構成例を図１３に示す。 Further, the signal coding apparatus according to another modification mode calculates coding distortion, and uses the distortion amount or the distortion amount subjected to auditory weighting as a frame evaluation value. Alternatively, an objective evaluation value calculated by PESQ (ITU-T recommendation P.862) or PEAQ (ITU-R recommendation BS.1387-1) using a codec signal may be used. FIG. 13 shows a configuration example of the signal encoding apparatus in an aspect (the former aspect) in which the amount of distortion of the codec signal is an evaluation value.

図１３に示すように、時間的特性変更フレーム信号ｘ_ｍｋは、信号符号化部１７にて符号化され、その結果、信号符号化系列が生成される。この信号符号化系列は、符号化歪み量算出部１９と信号符号化系列保持部１１０とに出力される。符号化歪み量算出部１９は、入力された信号符号化系列を復号して符復号信号を得る。そして、この信号と入力フレーム信号との差分である歪みのエネルギ量（符号化歪み量ｄ_ｍｋ）を算出する。 As shown in FIG. 13, the temporal characteristic change frame signal x _mk is encoded by the signal encoding unit 17, and as a result, a signal encoded sequence is generated. This signal encoded sequence is output to the encoded distortion amount calculation unit 19 and the signal encoded sequence holding unit 110. The coding distortion amount calculation unit 19 decodes the input signal coded sequence to obtain a coded signal. Then, a distortion energy amount (encoding distortion amount d _mk ), which is a difference between this signal and the input frame signal, is calculated.

入力フレーム信号ｘ_ｍに関して、全ての時間的特性変更量ｃ_ｋに対応する符号化歪み量ｄ_ｍｋが算出されると、時間的特性変更量選択部１６において、最も小さい符号化歪み量に対応する時間的特性変更量が、最適変更量ｃ_{ｍ，ｓｌｔ}に選択される。信号符号化系列保持部１１０には、時間的特性変更量と信号符号化系列とが予め対応付けて保持されている。信号符号化系列保持部１１０は、時間的特性変更量選択部１６より通知される上記最適変更量ｃ_{ｍ，ｓｌｔ}に対応する信号符号化系列を符号化系列出力部１８に出力する。符号化系列出力部１８は、この信号符号化系列を出力する。 When the encoding distortion amount d _mk corresponding to all temporal characteristic change amounts _ck is calculated for the input frame signal x _m , the temporal characteristic change amount selection unit 16 corresponds to the smallest encoding distortion amount. The temporal characteristic change amount is selected as the optimal change amount _{cm, slt} . In the signal encoded sequence holding unit 110, the temporal characteristic change amount and the signal encoded sequence are stored in advance in association with each other. The signal encoded sequence holding unit 110 outputs the signal encoded sequence corresponding to the optimal change amount _{cm, slt} notified from the temporal characteristic change amount selecting unit 16 to the encoded sequence output unit 18. The encoded sequence output unit 18 outputs this signal encoded sequence.

第１の実施形態に関しては、図３を参照して説明した変形態様を採ることができることは上述した通りであるが、かかる態様は、他の変形態様と同様に、第２〜第４の実施形態に対しても適用可能である。これにより、信号符号化装置は、全ての時間的特性変更信号、時間的特性変更量、及びフレーム評価値を保持しておく必要が無い。このため、記憶データ容量を節減することができる。 Regarding the first embodiment, it is as described above that the modified embodiment described with reference to FIG. 3 can be adopted. However, this embodiment is similar to the other modified embodiments in the second to fourth embodiments. It is applicable to the form. This eliminates the need for the signal encoding device to hold all temporal characteristic change signals, temporal characteristic change amounts, and frame evaluation values. For this reason, the storage data capacity can be reduced.

本発明の第１の実施形態における信号符号化装置の構成を示すブロック図である。It is a block diagram which shows the structure of the signal encoding apparatus in the 1st Embodiment of this invention. 第１の実施形態における信号符号化装置の動作を示すフローチャートである。It is a flowchart which shows operation | movement of the signal encoding apparatus in 1st Embodiment. 本発明の第１〜第４の実施形態の変形態様における信号符号化装置の動作を示すフローチャートである。It is a flowchart which shows operation | movement of the signal coding apparatus in the deformation | transformation aspect of the 1st-4th embodiment of this invention. 第２の実施形態における信号符号化装置の構成を示すブロック図である。It is a block diagram which shows the structure of the signal encoding apparatus in 2nd Embodiment. 第２の実施形態における信号符号化装置の動作を示すフローチャートである。It is a flowchart which shows operation | movement of the signal encoding apparatus in 2nd Embodiment. 第３の実施形態における信号符号化装置の構成を示すブロック図である。It is a block diagram which shows the structure of the signal encoding apparatus in 3rd Embodiment. 第３の実施形態における信号符号化装置の動作を示すフローチャートである。It is a flowchart which shows operation | movement of the signal coding apparatus in 3rd Embodiment. 第４の実施形態における信号符号化装置の構成を示すブロック図である。It is a block diagram which shows the structure of the signal encoding apparatus in 4th Embodiment. 第４の実施形態における信号符号化装置の動作を示すフローチャートである。It is a flowchart which shows operation | movement of the signal encoding apparatus in 4th Embodiment. 本発明の第５の実施形態における信号復号装置の構成を示すブロック図である。It is a block diagram which shows the structure of the signal decoding apparatus in the 5th Embodiment of this invention. 本発明の第５の実施形態における信号復号装置の動作を示すフローチャートである。It is a flowchart which shows operation | movement of the signal decoding apparatus in the 5th Embodiment of this invention. 第１〜第５の実施形態の変形態様（時間的特性変更部を機能分割した態様）における装置構成を示すブロック図である。It is a block diagram which shows the apparatus structure in the deformation | transformation aspect (The aspect which divided the time characteristic change part into the function) of 1st-5th embodiment. 第１〜第４の実施形態の変形態様（歪み量を評価値に使用する態様）における信号符号化装置の構成を示すブロック図である。It is a block diagram which shows the structure of the signal coding apparatus in the deformation | transformation aspect (mode which uses distortion amount for an evaluation value) of 1st-4th embodiment.

Explanation of symbols

１０，２０，３０，４０…信号符号化装置、１１，２１，３１，４１…フレーム分割部、１２，２２，３２，４２…時間的特性変更量指示部、１３，２３，３３，４３…時間的特性変更部、１３１…第１時間的特性変更部、１３２…第２時間的特性変更部、１３３…第３時間的特性変更部、１４，２４，３４，４４…時間的特性変更信号保持部、１５，２５，３５，４５…フレーム評価値算出部、１６，２６，３６，４６…時間的特性変更量選択部、１７，２７，３７，４７…信号符号化部、１８，２８，３８，４８…符号化系列出力部、１９…符号化歪み量算出部、１１０…信号符号化系列保持部、２９，４１０…変更量符号化部、３９，４９…時間的特性変更判定部、５０…信号復号装置、５１…信号復号部、５２…変更量復号部、５３…時間的特性変更部、５４…時間的特性変更復号フレーム信号出力部 10, 20, 30, 40 ... signal encoding device, 11, 21, 31, 41 ... frame division unit, 12, 22, 32, 42 ... temporal characteristic change amount instruction unit, 13, 23, 33, 43 ... time Characteristic change unit, 131... First time characteristic change unit, 132... Second time characteristic change unit, 133... Third time characteristic change unit, 14, 24, 34, 44. , 15, 25, 35, 45... Frame evaluation value calculation unit, 16, 26, 36, 46... Temporal characteristic change amount selection unit, 17, 27, 37, 47... Signal encoding unit, 18, 28, 38,. 48 ... Coding sequence output unit, 19 ... Coding distortion amount calculation unit, 110 ... Signal coding sequence holding unit, 29, 410 ... Change amount coding unit, 39, 49 ... Temporal characteristic change determination unit, 50 ... Signal Decoding device 51... Signal decoding unit 52. Change amount decoding unit 53 Temporal characteristic change portion, 54 ... temporal characteristics change decoded frame signal output unit

Claims

A dividing means for dividing the input signal into frames;
Change means for changing the temporal characteristics of the frame by a plurality of change amounts;
A calculation means for calculating an evaluation value of the frame whose temporal characteristics have been changed based on a predetermined evaluation criterion;
Selection means for selecting an optimum change amount from the plurality of change amounts based on the calculated evaluation value;
Coding means for transform-coding a signal in which the temporal characteristics of the frame are changed by the selected optimum change amount;
And a signal encoding device comprising: an output unit configured to output the transform-coded signal as a signal encoded sequence.

A determination means for determining whether to change the temporal characteristic of the frame;
The changing means changes the temporal characteristics of the frame determined to change the temporal characteristics by the determining means by a plurality of change amounts,
The encoding unit transform-encodes a signal in which the temporal characteristic of the frame is changed by the selected optimal change amount for the frame determined to change the temporal characteristic by the determining unit; 2. The signal encoding apparatus according to claim 1, wherein for the frame determined not to change the temporal characteristic by the determination unit, the signal of the frame is directly converted and encoded.

The encoding means encodes the optimum amount of change together with the signal of the frame,
The signal encoding apparatus according to claim 1, wherein the output unit outputs a change amount coding sequence indicating the optimum change amount together with the signal coding sequence.

The signal encoding apparatus according to claim 1, wherein the encoding unit encodes either a discrete cosine transform coefficient or a modified discrete cosine transform coefficient.

The signal encoding apparatus according to claim 1, wherein the calculation unit calculates a conversion gain based on a predetermined conversion rule.

The signal encoding apparatus according to claim 1, wherein the calculating unit calculates an auditory entropy based on an auditory psychological model.

5. The signal encoding apparatus according to claim 1, wherein the calculation unit calculates an objective evaluation value based on a distortion signal of a code decoding signal. 6.

The signal encoding apparatus according to claim 1, wherein the calculation unit calculates a distortion amount of the codec signal.

The signal encoding apparatus according to claim 8, wherein the distortion amount of the codec signal is a perceptually weighted distortion amount.

Decoding means for decoding an input signal encoded sequence into a decoded signal;
Generating means for generating a decoding change amount including at least one of a phase change amount, a sample deletion amount, a sample insertion amount, and a temporal expansion / contraction change amount by decoding the input change amount encoded sequence;
Changing means for changing a temporal characteristic of the decoded signal based on the decoding change amount;
An output means for outputting a decoded signal whose temporal characteristics have been changed by the changing means.

A dividing step of dividing the input signal into frames;
A change step of changing the temporal characteristics of the frame by a plurality of change amounts;
A calculation step for calculating an evaluation value of the frame whose temporal characteristics are changed based on a predetermined evaluation criterion;
A selection step of selecting an optimal change amount from the plurality of change amounts based on the calculated evaluation value;
An encoding step of transform-encoding a signal in which the temporal characteristics of the frame are changed by the selected optimal change amount;
An output step of outputting the transform-encoded signal as a signal encoding sequence.

A decoding step of decoding the input signal encoded sequence into a decoded signal;
Generating a decoding change amount including at least one of a phase change amount, a sample deletion amount, a sample insertion amount, and a temporal expansion / contraction change amount by decoding the input change amount encoded sequence;
A changing step of changing a temporal characteristic of the decoded signal based on the decoding change amount;
An output step of outputting the decoded signal whose temporal characteristics have been changed in the changing step.