JPH09232964A

JPH09232964A - Variable block length converting and encoding device and transient state detecting device

Info

Publication number: JPH09232964A
Application number: JP8056898A
Authority: JP
Inventors: Toru Chinen; 徹知念
Original assignee: Nippon Steel Corp
Current assignee: Nippon Steel Corp
Priority date: 1996-02-20
Filing date: 1996-02-20
Publication date: 1997-09-05

Abstract

PROBLEM TO BE SOLVED: To highly precisely detect the transient state of an input signal through a simple configuration. SOLUTION: A transient state detection circuit 2 is constituted so that the input signal is fetched for every prescribed sections, and a sum of squares is obtained respectively, and the transient state of the above-mentioned input signal is detected on the basis of the degree of the change of the signal whose sum of squares is calculated for every section extending over, at least, two sections. Then, the transient state is made capable of being detected by only executing the calculation of the sum of squares of the input signal on a time base without executing orthogonal transformation processing or filter processing.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、ブロック長可変型
変換符号化装置および過渡状態検出装置に関するもので
ある。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a variable block length transform coding device and a transient state detecting device.

【０００２】[0002]

【従来の技術】オーディオデータの高能率符号化には、
オーディオデータを時間軸上で複数の帯域に分割して符
号化する帯域分割符号化や、オーディオデータを周波数
軸上に直交変換し、複数の帯域に分割して符号化する変
換符号化などがある。また、これらの符号化方式を組み
合わせて、オーディオデータを時間軸上で複数の帯域に
分割し、各帯域信号を更に周波数軸上に直交変換して符
号化する高能率符号化もある。2. Description of the Related Art For high efficiency encoding of audio data,
There are band division encoding that divides audio data into multiple bands on the time axis and encodes it, and transform encoding that orthogonally transforms audio data on the frequency axis and divides it into multiple bands and encodes it. . There is also high-efficiency coding in which these coding methods are combined to divide audio data into a plurality of bands on the time axis, and each band signal is further orthogonally converted and coded on the frequency axis.

【０００３】従来技術の一例として、図３に示すＭＤＣ
Ｔ（モディファイド離散余弦変換）を用いたブロック長
可変型変換符号化について説明する。図３において、オ
ーディオデータ入力端子３１より入力されたオーディオ
データは、Ｎ／２サンプル（Ｎは例えば５１２）を１ブ
ロックとして、前ブロックのＮ／２サンプルと現ブロッ
クのＮ／２サンプルとを合わせたＮサンプルが、過渡状
態検出回路３２とＭＤＣＴ回路３３とに入力される。As an example of the prior art, the MDC shown in FIG.
Block length variable transform coding using T (Modified Discrete Cosine Transform) will be described. In FIG. 3, the audio data input from the audio data input terminal 31 has N / 2 samples (N is, for example, 512) as one block and combines the N / 2 samples of the previous block and the N / 2 samples of the current block. The N samples are input to the transient state detection circuit 32 and the MDCT circuit 33.

【０００４】過渡状態検出回路３２は、入力されるＮサ
ンプルの信号の性質に応じて、ＭＤＣＴ回路３３で行う
ＭＤＣＴ処理の直交変換サイズを決定し、それをＭＤＣ
Ｔ回路３３に供給する。ＭＤＣＴ回路３３は、過渡状態
検出回路３２から供給される直交変換のサンプルサイズ
に従って、入力されるＮサンプルのオーディオデータを
ＭＤＣＴにより直交変換し、それにより得られるＮ個の
ＭＤＣＴ係数データをブロック浮動小数点変換回路３４
に出力する。The transient state detection circuit 32 determines the orthogonal transform size of the MDCT processing performed by the MDCT circuit 33 according to the property of the input N-sample signal, and determines it by MDC.
It is supplied to the T circuit 33. The MDCT circuit 33 orthogonally transforms input N-sample audio data by MDCT according to the orthogonal transformation sample size supplied from the transient state detection circuit 32, and obtains N MDCT coefficient data obtained by the block floating point. Conversion circuit 34
Output to

【０００５】ブロック浮動小数点変換回路３４は、ＭＤ
ＣＴ回路３３より入力されるＮ個のＭＤＣＴ係数データ
を仮数部と指数部とで表現される浮動小数点データに変
換し、指数部データをビット割り当て回路３５に出力す
るとともに、仮数部データを量子化符号化回路３６に出
力する。The block floating point conversion circuit 34 is an MD
The N MDCT coefficient data input from the CT circuit 33 is converted into floating point data represented by a mantissa part and an exponent part, the exponent part data is output to the bit allocation circuit 35, and the mantissa part data is quantized. Output to the encoding circuit 36.

【０００６】ビット割り当て回路３５は、ブロック浮動
小数点変換回路３４より入力される指数部データをもと
に、人の聴覚マスキング特性等を利用して仮数部データ
の量子化ビット長を決定し、それを量子化符号化回路３
６に出力する。量子化符号化回路３６は、ビット割り当
て回路３５より入力される量子化ビット長の分だけ、ブ
ロック浮動小数点変換回路３４より入力される仮数部デ
ータのＭＳＢ（最上位ビット）からデータを取り出して
量子化符号化の処理を行う。The bit allocation circuit 35 determines the quantization bit length of the mantissa data based on the exponent data input from the block floating point conversion circuit 34 by utilizing human auditory masking characteristics and the like. Quantization coding circuit 3
6 is output. The quantization encoding circuit 36 extracts data from the MSB (most significant bit) of the mantissa data input from the block floating point conversion circuit 34 by the amount of the quantization bit length input from the bit allocation circuit 35, and quantizes it. Performs encoding processing.

【０００７】以上の処理の結果、第１のデータ出力端子
３７から指数部データが出力され、第２のデータ出力端
子３８から符号化された仮数部データ（以下、仮数部符
号化データという）が出力される。これらの出力された
データは、図示しない記録媒体に蓄積されたり、図示し
ない伝送路を介して送信されたりする。As a result of the above processing, the exponent part data is output from the first data output terminal 37, and the encoded mantissa part data (hereinafter referred to as mantissa part encoded data) is output from the second data output terminal 38. Is output. These output data are accumulated in a recording medium (not shown) or transmitted via a transmission path (not shown).

【０００８】[0008]

【発明が解決しようとする課題】ところで、直交変換を
利用した符号化においては、高圧縮率で高音質を実現す
るという符号化効率の観点から、一般には、直交変換サ
イズであるＮの値を大きくした方が良い。By the way, in the coding using the orthogonal transform, the value of N, which is the orthogonal transform size, is generally set from the viewpoint of the coding efficiency of realizing high sound quality with a high compression rate. It is better to make it larger.

【０００９】しかし、図４に示すように、Ｎサンプル内
（この図４においては５１２サンプルの期間内）におい
て入力信号の性質が急激に変化するような場合、例えば
信号の急峻な立ち上がり等が発生する場合には、そのよ
うな信号を符号化して復号化すると、図５に示すよう
に、信号レベルの低い部分で符号化による雑音の割合が
多くなってしまい、プリエコーとして検知されてしまう
場合があった。However, as shown in FIG. 4, when the characteristics of the input signal change abruptly within N samples (in the period of 512 samples in FIG. 4), for example, a sharp rise of the signal occurs. In such a case, if such a signal is encoded and then decoded, as shown in FIG. 5, the ratio of noise due to encoding increases in a portion where the signal level is low, and it may be detected as a pre-echo. there were.

【００１０】そこで、過渡状態検出回路３２において、
入力信号の性質を分析して、直交変換サイズを適応的に
切り替える方法が従来より用いられている。この方法で
は、入力信号が定常状態にある場合には、Ｎサンプルサ
イズの直交変換を１回行い、Ｎ個の周波数データに変換
する。また、入力信号が過渡状態にある場合には、Ｎサ
ンプルを複数の区間に分けてそれぞれの区間に対して直
交変換を行い、Ｎ個の周波数データに変換する。Therefore, in the transient state detection circuit 32,
The method of analyzing the property of the input signal and adaptively switching the orthogonal transform size has been conventionally used. In this method, when the input signal is in a steady state, orthogonal transformation of N sample size is performed once to transform into N frequency data. When the input signal is in a transient state, N samples are divided into a plurality of sections, orthogonal conversion is performed on each section, and N pieces of frequency data are converted.

【００１１】例えば、Ｎサンプルを前半のＮ／２サンプ
ルと後半のＮ／２サンプルとに分けて、Ｎ／２サンプル
サイズの直交変換を前半と後半とで合計２回行い、Ｎ個
の周波数データに変換する。また、他の例では、Ｎサン
プルをＮ／４サンプル毎の等間隔に分けて合計４回の直
交変換を行い、Ｎ個の周波数データに変換する。これに
より、図６に示すように、過渡状態である区間を短くし
て（この図６では、２５６サンプルずつ）符号化による
雑音が及ぶ範囲を狭くすることで、プリエコーとして検
知されないようにしている。For example, N samples are divided into N / 2 samples in the first half and N / 2 samples in the second half, and orthogonal transform of N / 2 sample size is performed twice in total in the first half and the second half to obtain N frequency data. Convert to. In another example, N samples are divided at equal intervals of N / 4 samples, and a total of four orthogonal transforms are performed to transform into N frequency data. As a result, as shown in FIG. 6, the section in the transient state is shortened (in this FIG. 6, by 256 samples), the range covered by the coding noise is narrowed so that it is not detected as a pre-echo. .

【００１２】過渡状態の検出を行う方法としては、図７
に示すフィードバック型検出法（信学技報：EA90-65,P
P.23-30,1990.「適応ブロック長適応変換符号化(ATC-AB
S) による２５６Ｋｂｐｓステレオハイファイオーディ
オ符号化／復号装置」）が、最も検出精度の良いもので
あると考えられている。As a method for detecting a transient state, FIG.
Feedback type detection method shown in (Technical report: EA90-65, P
P.23-30, 1990. "Adaptive block length adaptive transform coding (ATC-AB
The 256 Kbps stereo high-fidelity audio encoder / decoder by S) ") is considered to have the best detection accuracy.

【００１３】この図７に示すフィードバック型検出法
は、Ｎサンプルサイズの直交変換と、それよりも短いサ
ンプルサイズ、例えばＮ／２，Ｎ／４の短いサンプルサ
イズの直交変換とをいくつか同時に行い、符号化復号化
処理を行う。そして、それらのサンプルサイズの中から
最終的に雑音レベルの低いサンプルサイズを選択する方
法である。このように、フィードバック型検出法では、
雑音レベルが最も低くなる直交変換サイズを選択してい
るので、聴感上ほぼ最高の音質を達成することができる
と考えられる。The feedback type detection method shown in FIG. 7 performs several orthogonal transforms of N sample size and a shorter sample size, for example, N / 2, N / 4 shorter sample size at the same time. , Encoding / decoding processing is performed. Then, a method of finally selecting a sample size having a low noise level from those sample sizes. Thus, in the feedback type detection method,
Since the orthogonal transform size with the lowest noise level is selected, it is considered that almost the best sound quality can be achieved in terms of hearing.

【００１４】この他にも、Ｎサンプルを複数の区間に分
けてそれぞれの区間に対して直交変換を行い（例えば、
Ｎサンプルを前半のＮ／２サンプルと後半のＮ／２サン
プルとに分けて、Ｎ／２サンプルサイズの直交変換を前
半と後半とで２回行い）、前半のＮ／２サンプルに対す
る周波数データと後半のＮ／２サンプルに対する周波数
データとの変化度によって過渡状態を検出する方法もあ
る。In addition to this, N samples are divided into a plurality of sections and orthogonal transformation is performed for each section (for example,
The N samples are divided into the first half N / 2 samples and the second half N / 2 samples, and the orthogonal transformation of the N / 2 sample size is performed twice in the first half and the second half), and the frequency data for the first half N / 2 samples is obtained. There is also a method of detecting the transient state based on the degree of change with the frequency data for the N / 2 sample in the latter half.

【００１５】また、直交変換は行わずに、時間軸上の入
力信号の低周波成分を高域通過フィルタにより遮断し、
その結果得られる入力信号の高周波成分の変化度によっ
て過渡状態を検出する方法もある。Further, without performing the orthogonal transformation, the low frequency component of the input signal on the time axis is cut off by the high pass filter,
There is also a method of detecting a transient state based on the degree of change of the high frequency component of the input signal obtained as a result.

【００１６】しかしながら、以上の過渡状態を検出する
方法では、検出精度が向上すれば処理も複雑になってし
まうという欠点があった。また、以上の検出方法では直
交変換処理やフィルタ処理を行っているため、計算負荷
が大きく、符号化の実時間処理などには適さないという
欠点もあった。However, the above method of detecting a transient state has a drawback that the processing becomes complicated if the detection accuracy is improved. Further, since the above-mentioned detection method performs orthogonal transform processing and filter processing, it has a drawback that the calculation load is large and it is not suitable for real-time processing of encoding.

【００１７】本発明は、このような欠点を解消するため
に成されたものであり、簡単な構成で、入力信号の過渡
状態を精度良く検出することができるようにすることを
目的とする。The present invention has been made in order to eliminate such drawbacks, and an object of the present invention is to make it possible to detect a transient state of an input signal with high accuracy with a simple structure.

【００１８】[0018]

【課題を解決するための手段】本発明は、入力信号を所
定区間毎に取り込んで２乗和を夫々求め、各区間毎に２
乗和された信号の少なくとも２以上の区間に渡る変化度
によって上記入力信号の過渡状態を検出する過渡状態検
出回路を有している。この過渡状態検出回路は、直交変
換処理やフィルタ処理を行わずに、時間軸上の入力信号
のある区間（例えば、直交変換回路の最大直交変換サイ
ズがＮである場合に、Ｎ／２またはＮ／４サイズの区
間）毎に２乗和を求め、その２乗和の値の変化度により
過渡状態を検出する。このように構成した本発明によれ
ば、時間軸上の入力信号の２乗和計算を行うだけで過渡
状態を検出することが可能となる。According to the present invention, an input signal is taken in for each predetermined section to obtain a sum of squares, and 2 is calculated for each section.
It has a transient state detection circuit that detects the transient state of the input signal by the degree of change of the summed signals over at least two sections. This transient state detection circuit does not perform orthogonal transform processing or filter processing, but has a certain section of the input signal on the time axis (for example, N / 2 or N when the maximum orthogonal transform size of the orthogonal transform circuit is N). The sum of squares is obtained for each (/ 4 size section), and the transient state is detected by the degree of change in the value of the sum of squares. According to the present invention having such a configuration, it is possible to detect the transient state only by performing the square sum calculation of the input signal on the time axis.

【００１９】[0019]

【発明の実施の形態】以下、本発明の一実施形態を図面
に基づいて説明する。図１は、本実施形態によるブロッ
ク長可変型変換符号化装置の構成例を示す図である。図
１に示す本実施形態では、ＭＤＣＴ回路３は、最大でＮ
サンプルサイズの直交変換が可能であり、過渡状態検出
回路２からの制御信号に応じて、Ｎサンプルサイズの直
交変換またはＮ／４サンプルサイズの直交変換を行う。DESCRIPTION OF THE PREFERRED EMBODIMENTS One embodiment of the present invention will be described below with reference to the drawings. FIG. 1 is a diagram showing a configuration example of a variable block length transform coding apparatus according to the present embodiment. In the present embodiment shown in FIG. 1, the MDCT circuit 3 has a maximum N
The orthogonal transform of the sample size is possible, and the orthogonal transform of the N sample size or the N / 4 sample size is performed according to the control signal from the transient state detection circuit 2.

【００２０】図１において、オーディオデータ入力端子
１より入力されたオーディオデータは、Ｎサンプル（Ｎ
は例えば５１２）を１ブロックとして、バッファ回路Ｂ
ＵＦに一時的に蓄えられる。このバッファ回路ＢＵＦに
蓄えられたオーディオデータは、その後過渡状態検出回
路２とＭＤＣＴ回路３とに入力される。以下では、簡単
のためにＮ＝５１２として説明する。In FIG. 1, the audio data input from the audio data input terminal 1 has N samples (N
Is, for example, 512) as one block, and the buffer circuit B
It is temporarily stored in the UF. The audio data stored in the buffer circuit BUF is then input to the transient state detection circuit 2 and the MDCT circuit 3. In the following, for simplicity, N = 512 will be described.

【００２１】過渡状態検出回路２は、入力される５１２
サンプルの信号をＮ／４サンプル（＝１２８サンプル）
毎の４つの小ブロックに分割し、各小ブロック毎にサン
プルの２乗和を求め、互いに隣接する小ブロック間の２
乗和の比を求める。そして、４つの小ブロックの２乗和
の比が何れも所定値よりも小さいときは、５１２サンプ
ルサイズの直交変換を行うようにＭＤＣＴ回路３に指示
する。また、４つの小ブロックの２乗和の比が少なくと
も１つの隣接小ブロック間で所定値よりも大きくなった
ときは、１２８サンプルサイズの直交変換を行うように
指示する。The transient state detection circuit 2 receives 512 as an input.
Sample signal is N / 4 samples (= 128 samples)
Each of the small blocks is divided into four small blocks, the sum of squares of samples is calculated for each small block, and 2 between adjacent small blocks is calculated.
Calculate the ratio of sums of products. When the ratios of the sums of squares of the four small blocks are all smaller than the predetermined value, the MDCT circuit 3 is instructed to perform the orthogonal transform of 512 sample size. Further, when the ratio of the sum of squares of the four small blocks becomes larger than a predetermined value between at least one adjacent small blocks, it is instructed to perform the orthogonal transform of 128 sample size.

【００２２】ＭＤＣＴ回路３は、過渡状態検出回路２か
ら５１２サンプルサイズの直交変換を行うように指示さ
れた場合には、バッファ回路ＢＵＦから取り込まれてい
た５１２サンプルのデータをもとに５１２サンプルサイ
ズの直交変換を行う。このとき、本実施形態では、窓掛
け処理によってデータの連続性を担保するために、以下
のような処理を行う。When the MDCT circuit 3 is instructed by the transient state detection circuit 2 to perform a 512-sample size orthogonal transform, the 512-sample size is obtained based on the 512-sample data fetched from the buffer circuit BUF. Orthogonal transform of. At this time, in the present embodiment, the following processing is performed in order to ensure the continuity of the data by the windowing processing.

【００２３】すなわち、まず、バッファ回路ＢＵＦに保
持されている５１２サンプルサイズのデータに対して５
１２サンプルサイズのＭＤＣＴ処理を１回行い、５１２
個の周波数データを得る。その後、バッファ回路ＢＵＦ
に保持されている５１２サンプルサイズのデータを２５
６サンプル分シフトして、現ブロックの後半の２５６サ
ンプルと新しく入力される次ブロックの前半の２５６サ
ンプルとで新たな５１２サンプルサイズのデータとし、
次のＭＤＣＴ処理に備える。That is, first, the data of 512 sample size held in the buffer circuit BUF is set to 5
512 sample size MDCT processing is performed once and 512
Obtain frequency data. After that, the buffer circuit BUF
The 512 sample size data stored in
By shifting by 6 samples, 256 samples in the latter half of the current block and 256 samples in the first half of the next block to be newly input become data of a new 512 sample size,
Prepare for the next MDCT process.

【００２４】このように、本実施形態では、Ｎ／２サン
プル（＝２５６サンプル）ずつデータを重複させてＭＤ
ＣＴ処理を行っている。この様子を図２（ａ）に示す。As described above, in the present embodiment, the data is overlapped by N / 2 samples (= 256 samples) and MD is added.
CT processing is performed. This situation is shown in FIG.

【００２５】上述のようにして５１２サンプルサイズの
ＭＤＣＴ処理を行った場合、図示しないデコーダ側にお
いて行われる逆ＭＤＣＴ処理は以下のようになる。な
お、ここでも簡単のためにＮ＝５１２として説明する。When the 512-sample size MDCT process is performed as described above, the inverse MDCT process performed on the decoder side (not shown) is as follows. It should be noted that here also, for simplification, description will be made with N = 512.

【００２６】１）符号化データのビットストリームより
復元された５１２個の周波数データは、５１２サンプル
サイズの逆ＭＤＣＴ処理が１回行われることによって、
５１２個の周波数データが５１２個の時間データに逆変
換される。２）バッファには前ブロックの後半２５６個の時間デー
タが残っており、この前ブロックの後半２５６個の時間
データと、現ブロックの５１２個の時間データのうち前
半２５６個の時間データとが加算され、デコーダから出
力される。1) The 512 frequency data restored from the bit stream of the encoded data are subjected to the inverse MDCT processing of 512 sample size once,
The 512 frequency data are inversely transformed into 512 time data. 2) The second half 256 time data of the previous block remains in the buffer, and the second half 256 time data of the previous block and the first half 256 time data of the 512 time data of the current block are added. And output from the decoder.

【００２７】３）上記バッファに保持されている前ブロ
ックの後半２５６個の時間データが、現ブロックの５１
２個の時間データのうち後半２５６個の時間データに入
れ換えられる。上記の１）２）３）の処理を繰り返し行う。3) The second half 256 time data of the previous block held in the above buffer are 51 times of the current block.
Of the two pieces of time data, the latter half 256 pieces of time data are replaced. The above processes 1) 2) 3) are repeated.

【００２８】また、ＭＤＣＴ回路３は、過渡状態検出回
路２からＮ／４サンプルサイズ（＝１２８サンプルサイ
ズ）の直交変換を行うように指示された場合には、バッ
ファ回路ＢＵＦから取り込まれていた５１２サンプルの
データに対して、１２８サンプルサイズ毎に４回に分け
て直交変換を行う。このときも、窓掛け処理のために、
図２（ｂ）に示すように、以下のような処理を行う。Further, when the MDCT circuit 3 is instructed by the transient state detection circuit 2 to perform the orthogonal transform of N / 4 sample size (= 128 sample size), 512 fetched from the buffer circuit BUF. Orthogonal transformation is performed on the sample data every four 128 sample sizes. Also at this time, because of the windowing process,
As shown in FIG. 2B, the following processing is performed.

【００２９】１）バッファ回路ＢＵＦに保持されている
５１２個のサンプルデータを４等分し、それぞれの１２
８個のサンプルデータに対して１２８サンプルサイズの
ＭＤＣＴ処理を行う。合計４回の１２８サンプルサイズ
のＭＤＣＴ処理によって、５１２個のサンプルデータが
直交変換され、５１２個の周波数データとなる。1) The 512 pieces of sample data held in the buffer circuit BUF are divided into four equal parts, and each of them is divided into 12 parts.
MDCT processing of 128 sample size is performed on 8 sample data. By a total of four MDCT processes of 128 sample size, 512 sample data are orthogonally transformed into 512 frequency data.

【００３０】２）バッファ回路ＢＵＦに保持されている
５１２個のサンプルデータが直交変換された後は、５１
２個のサンプルを２５６サンプル分シフトして、現ブロ
ックの後半２５６サンプルと新しく入力される次ブロッ
クの前半２５６サンプルとで新たな５１２サンプルのデ
ータとし、次のＭＤＣＴ処理に備える。2) After the 512 pieces of sample data held in the buffer circuit BUF have been orthogonally transformed, 51
The two samples are shifted by 256 samples to form new 512 samples of data in the latter half 256 samples of the current block and the first half 256 samples of the newly input next block, and prepare for the next MDCT processing.

【００３１】この図２（ｂ）のように、１２８サンプル
サイズのＭＤＣＴ処理を行う場合にも２５６サンプルず
つデータを重複させてＭＤＣＴ処理を行うのは、前後の
ブロックで重複させるサンプル長を一定（この例では２
５６サンプル）にしておくことによって、直交変換サイ
ズにかかわらずデコーダ側でのデータ取り込み処理を一
定に行えるようにするためである。As shown in FIG. 2B, even when MDCT processing of 128 sample size is performed, the MDCT processing is performed by duplicating data every 256 samples. 2 in this example
This is because the data acquisition process on the decoder side can be performed irrespective of the orthogonal transformation size by setting 56 samples).

【００３２】また、このような１２８サンプルサイズの
ＭＤＣＴ処理を行った場合におけるデコーダ側の逆ＭＤ
ＣＴ処理を以下に示す。Further, the inverse MD on the decoder side when such MDCT processing of 128 sample size is performed
The CT process is shown below.

【００３３】１）符号化データのビットストリームより
復元された５１２個の周波数データは、１２８サンプル
サイズの逆ＭＤＣＴ処理が４回行われることによって、
５１２個の周波数データが５１２個の時間データに逆変
換される。２）バッファには前ブロックの後半２５６個の時間デー
タが残っており、この前ブロックの後半２５６個の時間
データと、現ブロックの５１２個の時間データのうち前
半２５６個の時間データとが加算され、デコーダから出
力される。1) The 512 frequency data restored from the bit stream of the encoded data are subjected to inverse MDCT processing of 128 sample size four times,
The 512 frequency data are inversely transformed into 512 time data. 2) The second half 256 time data of the previous block remains in the buffer, and the second half 256 time data of the previous block and the first half 256 time data of the 512 time data of the current block are added. And output from the decoder.

【００３４】３）上記バッファに保持されている前ブロ
ックの後半２５６個の時間データが、現ブロックの５１
２個の時間データのうち後半２５６個の時間データに入
れ換えられる。上記の１）２）３）の処理を繰り返し行う。3) The latter half 256 time data of the previous block held in the above buffer are 51 times of the current block.
Of the two pieces of time data, the latter half 256 pieces of time data are replaced. The above processes 1) 2) 3) are repeated.

【００３５】このように、符号化側で５１２サンプルサ
イズのＭＤＣＴ処理を行った場合と１２８サンプルサイ
ズのＭＤＣＴ処理を行った場合とにおけるデコーダ側の
処理の違いは、１）の処理だけとなる。As described above, the difference in processing on the decoder side between the case where the 512-sample size MDCT processing is performed on the encoding side and the case where the 128-sample size MDCT processing is performed on the encoding side is only the processing of 1).

【００３６】以上のようにしてＭＤＣＴ回路３により直
交変換された周波数データ（５１２個のＭＤＣＴ係数デ
ータ）は、図３の場合と同様にブロック浮動小数点変換
回路４に供給される。ブロック浮動小数点変換回路４
は、ＭＤＣＴ回路３より入力される５１２個のＭＤＣＴ
係数データを仮数部と指数部とで表現される浮動小数点
データに変換し、指数部データをビット割り当て回路５
に出力するとともに、仮数部データを量子化符号化回路
６に出力する。The frequency data (512 MDCT coefficient data) orthogonally transformed by the MDCT circuit 3 as described above is supplied to the block floating point conversion circuit 4 as in the case of FIG. Block floating point conversion circuit 4
Is the 512 MDCTs input from the MDCT circuit 3.
The coefficient data is converted into floating point data represented by the mantissa part and the exponent part, and the exponent part data is converted into the bit allocation circuit 5.
And the mantissa part data to the quantization coding circuit 6.

【００３７】ビット割り当て回路５は、ブロック浮動小
数点変換回路４より入力される指数部データをもとに、
人の聴覚マスキング特性等を利用して仮数部データの量
子化ビット長を決定し、それを量子化符号化回路６に出
力する。量子化符号化回路６は、ビット割り当て回路５
より入力される量子化ビット長の分だけ、ブロック浮動
小数点変換回路４より入力される仮数部データのＭＳＢ
からデータを取り出して量子化符号化の処理を行う。The bit allocation circuit 5 is based on the exponent data input from the block floating point conversion circuit 4.
The quantization bit length of the mantissa data is determined by utilizing the human auditory masking characteristic and the like, and the determined quantization bit length is output to the quantization encoding circuit 6. The quantization coding circuit 6 includes a bit allocation circuit 5
MSB of the mantissa data input from the block floating point conversion circuit 4 by the quantization bit length input by
Then, the data is extracted from and the process of the quantization coding is performed.

【００３８】以上の処理の結果、第１のデータ出力端子
７から指数部データが出力され、第２のデータ出力端子
８から仮数部符号化データが出力される。これらの出力
されたデータは、図示しない記録媒体に蓄積されたり、
図示しない伝送路を介して送信されたりする。As a result of the above processing, the first data output terminal 7 outputs the exponent part data, and the second data output terminal 8 outputs the mantissa coded data. These output data are accumulated in a recording medium (not shown),
It may be transmitted via a transmission path (not shown).

【００３９】次に、上記した過渡状態検出回路２の動作
を更に詳しく説明する。図１のバッファ回路ＢＵＦより
過渡状態検出回路２に入力されるＮサンプル（＝５１２
サンプル）のデータは、まず２乗和計算部２１で、Ｎ／
４サンプル（＝１２８サンプル）の小ブロック毎に２乗
和が求められる。Next, the operation of the transient state detection circuit 2 described above will be described in more detail. N samples (= 512) input to the transient state detection circuit 2 from the buffer circuit BUF of FIG.
The data of (sample) is first calculated by the sum of squares calculation unit 21 as N /
The sum of squares is obtained for each small block of 4 samples (= 128 samples).

【００４０】Ｎサンプル（＝５１２サンプル）のデータ
をx(n),n=0,1,2,...,N-1とすると、各小ブロック毎のサ
ンプルデータの２乗和は次の式（１）で求められる。な
お、この式（１）において、ｍはＮ／４サンプルで成る
小ブロックのインデックスである。Assuming that the data of N samples (= 512 samples) is x (n), n = 0,1,2, ..., N-1, the sum of squares of the sample data of each small block is as follows. It is calculated by the equation (1). In this equation (1), m is an index of a small block composed of N / 4 samples.

【００４１】[0041]

【数１】 [Equation 1]

【００４２】次に、２乗和比較部２２で、上記式（１）
で計算されたＮ／４サンプル（＝１２８サンプル）で成
る小ブロック毎の２乗和をそれぞれ比較し、過渡状態を
検出する。この過渡状態の検出は、以下の手順により行
う。すなわち、互いに隣接する小ブロック間の２乗和
を、以下の式（２）〜（７）が成り立つかどうかを見る
ことによって比較する。ただし、これらの式中に示すＴ
ｈは、検出感度を調整するためのしきい値である。Next, in the sum of squares comparison unit 22, the above equation (1)
The sum of squares for each small block made up of N / 4 samples (= 128 samples) calculated in step 1 is compared with each other to detect a transient state. The detection of this transient state is performed by the following procedure. That is, the square sums between adjacent small blocks are compared by seeing whether the following expressions (2) to (7) hold. However, T shown in these equations
h is a threshold value for adjusting the detection sensitivity.

【００４３】s(m)＞Ｔｈ×s(m+1) （２） s(m+1)＞Ｔｈ×s(m) （３） s(m+1)＞Ｔｈ×s(m+2) （４） s(m+2)＞Ｔｈ×s(m+1) （５） s(m+2)＞Ｔｈ×s(m+3) （６） s(m+3)＞Ｔｈ×s(m+2) （７）S (m)> Th × s (m + 1) (2) s (m + 1)> Th × s (m) (3) s (m + 1)> Th × s (m + 2) (4) s (m + 2)> Th × s (m + 1) (5) s (m + 2)> Th × s (m + 3) (6) s (m + 3)> Th × s ( m + 2) (7)

【００４４】次に、ブロックサイズ決定部２３は、上記
２乗和比較部２２における比較の結果に基づいて、直交
変換サイズを決定する。すなわち、上記式（２）〜
（７）のうち少なくとも１つの条件を満足すれば、直交
変換サイズをＮ／４サンプルサイズ（＝１２８サンプル
サイズ）に決定する。また、上記式（２）〜（７）の何
れの条件も満足しなければ、直交変換サイズをＮサンプ
ルサイズ（＝５１２サンプルサイズ）に決定する。Next, the block size determining unit 23 determines the orthogonal transform size based on the result of the comparison in the square sum comparing unit 22. That is, the above equations (2) to
If at least one condition of (7) is satisfied, the orthogonal transform size is determined to be N / 4 sample size (= 128 sample size). If none of the conditions of the above equations (2) to (7) is satisfied, the orthogonal transform size is determined to be N sample size (= 512 sample size).

【００４５】上記の式（２）〜（７）では、互いに隣接
する小ブロック間の２乗和をそれぞれ比較することによ
って、入力信号の性質の急激な変化、例えば信号の急峻
な立ち上がりなどを検出している。その検出の感度を調
整するパラメータがしきい値Ｔｈである。しきい値Ｔｈ
は、その値を色々に変えながら実際の符号化復号化処理
を行い、聴感上の音質や雑音レベルを測定して、適切な
値を求めれば良い。In the above equations (2) to (7), a rapid change in the property of the input signal, such as a sharp rise of the signal, is detected by comparing the square sums of adjacent small blocks. doing. The parameter that adjusts the detection sensitivity is the threshold Th. Threshold Th
The actual encoding / decoding process may be performed while varying the value, and the audible sound quality and noise level may be measured to obtain an appropriate value.

【００４６】本実施形態では、Ｔｈ＝２０により良好な
結果が得られている。２０という数値は、信号振幅に換
算すると√２０≒４．４７である。すなわち、本実施形
態では、隣接する小ブロック間で平均振幅が４．４７倍
よりも大きくなる場合には、入力信号が過渡状態である
ということにしている。In this embodiment, good results are obtained when Th = 20. The numerical value of 20 is √20≈4.47 when converted into signal amplitude. That is, in the present embodiment, the input signal is in a transient state when the average amplitude between adjacent small blocks is larger than 4.47 times.

【００４７】なお、上記の実施形態においては、Ｎサン
プルサイズ（＝５１２サンプルサイズ）、Ｎ／４サンプ
ルサイズ（＝１２８サンプルサイズ）の直交変換が可能
なブロック長可変型変換符号化における使用を考慮して
過渡状態検出方法を実現した。これ以外にも、例えばＮ
サンプルサイズ、Ｎ／２サンプルサイズ（＝２５６サン
プルサイズ）の直交変換が可能なブロック長可変型変換
符号化にも適用することが可能である。In the above embodiment, consideration is given to use in variable block length transform coding capable of orthogonal transform of N sample size (= 512 sample size) and N / 4 sample size (= 128 sample size). And realized the transient state detection method. Besides this, for example, N
It is also applicable to variable block length transform coding capable of orthogonal transform of sample size and N / 2 sample size (= 256 sample size).

【００４８】この場合には、上記式（１）で示される４
つの小ブロック毎の２乗和計算を、Ｎサンプルの前半Ｎ
／２サンプルと後半Ｎ／２サンプルとの２つの２乗和計
算に置き換えるとともに、上記式（２）〜（７）で示さ
れる２乗和計算結果の比較を、前半Ｎ／２サンプルと後
半Ｎ／２サンプルとの２乗和の比較に置き換えること
で、過渡状態検出方法を実現することができる。このと
きの窓掛け処理のためのデータの重複のし方は、図２
（ｃ）のようにすれば良い。In this case, 4 shown in the above equation (1)
Calculate the sum of squares for each two small blocks in the first half N of N samples
/ 2 samples and latter half N / 2 samples are replaced with two square sum calculations, and the comparison of the square sum calculation results shown in the above equations (2) to (7) is performed by comparing the first half N / 2 samples and the latter half N The transient state detection method can be realized by replacing the sum of squares with the / 2 sample. How to duplicate the data for the windowing process at this time is shown in FIG.
It may be done as in (c).

【００４９】[0049]

【発明の効果】本発明は上述したように、入力信号を所
定区間毎に取り込んで２乗和を夫々求め、各区間毎に２
乗和された信号の少なくとも２以上の区間に渡る変化度
によって上記入力信号の過渡状態を検出するようにした
ので、直交変換処理やフィルタ処理を行わずに、時間軸
上の入力信号の２乗和計算を行うだけで過渡状態を検出
することができるようになり、計算負荷が小さく、符号
化の実時間処理に適している。本発明においては、直交
変換処理やフィルタ処理を用いた従来の過渡状態検出方
法とほぼ同程度の検出精度を実現できるので、実時間処
理という観点からすれば、計算負荷と検出精度との両性
能において優れている。As described above, according to the present invention, the input signal is taken in every predetermined section to obtain the sum of squares, and the sum of squares is set to 2 for each section.
Since the transient state of the input signal is detected by the degree of change of the summed signals over at least two sections, the square of the input signal on the time axis can be obtained without performing orthogonal transformation processing or filter processing. The transient state can be detected only by performing the sum calculation, the calculation load is small, and it is suitable for the real-time processing of encoding. In the present invention, it is possible to realize detection accuracy that is substantially the same as that of the conventional transient state detection method using orthogonal transformation processing and filter processing. Therefore, from the viewpoint of real-time processing, both the calculation load and the detection accuracy are improved. Is excellent in.

[Brief description of drawings]

【図１】本発明の一実施形態であるブロック長可変型変
換符号化装置の構成例を示すブロック図である。FIG. 1 is a block diagram showing a configuration example of a variable block length transform coding apparatus according to an embodiment of the present invention.

【図２】図１に示したＭＤＣＴ回路へのデータ取り込み
の様子を説明するための図である。FIG. 2 is a diagram for explaining how data is taken into the MDCT circuit shown in FIG.

【図３】従来のブロック長可変型変換符号化装置の構成
例を示すブロック図である。FIG. 3 is a block diagram showing a configuration example of a conventional block length variable transform coding apparatus.

【図４】急峻な立ち上がり部を含む入力信号を示す波形
図である。FIG. 4 is a waveform diagram showing an input signal including a steep rising portion.

【図５】図４の波形に対して５１２サンプルサイズのＭ
ＤＣＴによる符号化復号化処理を行った後に得られる、
プリエコー部を含む信号を示す波形図である。5 is an M of 512 sample size for the waveform of FIG.
Obtained after performing the encoding / decoding processing by DCT,
It is a wave form diagram which shows the signal containing a pre-echo part.

【図６】図４の波形に対して２５６サンプルサイズのＭ
ＤＣＴによる符号化復号化処理を行った後に得られる、
プリエコー部が軽減された信号を示す波形図である。6 is an M of 256 sample size for the waveform of FIG.
Obtained after performing the encoding / decoding processing by DCT,
It is a wave form diagram which shows the signal which the pre-echo part reduced.

【図７】従来のフィードバック型過渡状態検出方法を説
明するための図である。FIG. 7 is a diagram for explaining a conventional feedback type transient state detection method.

[Explanation of symbols]

１オーディオデータ入力端子２過渡状態検出回路３ＭＤＣＴ回路４ブロック浮動小数点変換回路５ビット割り当て回路６量子化符号化回路７第１のデータ出力端子８第２のデータ出力端子２１２乗和計算部２２２乗和比較部２３ブロックサイズ決定部ＢＵＦバッファ回路 1 Audio Data Input Terminal 2 Transient State Detection Circuit 3 MDCT Circuit 4 Block Floating Point Conversion Circuit 5 Bit Allocation Circuit 6 Quantization Coding Circuit 7 First Data Output Terminal 8 Second Data Output Terminal 21 Square Sum Calculator 22 Square sum comparison unit 23 Block size determination unit BUF buffer circuit

Claims

[Claims]

1. A variable block length transform coding apparatus having a transient state detection circuit for detecting a transient state of an input signal and determining an orthogonal transform size, wherein the transient state detection circuit has an input signal for each predetermined section. A block length variable conversion code, characterized in that the transient sum of the input signals is detected based on the degree of change in the signals summed up and squared in each section over at least two sections. Device.

2. A transient state of the input signal is detected by taking in an input signal for each predetermined section and obtaining a sum of squares for each section, and detecting a degree of change of the signal summed for the square for each section over at least two sections. A transient state detection circuit that determines the orthogonal transformation size by performing the orthogonal transformation by the unit of the orthogonal transformation size determined by the transient state detection circuit, and an orthogonal transformation performed by the orthogonal transformation circuit. A block floating point conversion circuit for converting the frequency data into a block floating point, a bit allocation circuit for allocating bits by the exponent part data supplied from the block floating point conversion circuit, and a mantissa part data obtained by the bit allocation circuit. According to the quantization bit length, the mantissa data supplied from the block floating point conversion circuit is quantized and encoded. Block length variable transform coding apparatus comprising: a quantization encoding circuit.

3. The transient state detection circuit takes in an input signal for each size of an integer fraction of the maximum orthogonal transform size of the orthogonal transform circuit, obtains the sum of squares of each, and at least the sum of squared signals. The variable block length transform coding apparatus according to claim 2, wherein the transient state of the input signal is detected by a degree of change over two or more sections.

4. A sum-of-squares calculating means for fetching an input signal for each predetermined section to obtain a sum of squares, and at least two or more sections of the signals summed squares for each predetermined section by the sum-of-squares calculating means. And a transient state detecting means for detecting the transient state of the input signal according to the degree of change over the transient state detecting device.