JP2003330497A

JP2003330497A - Method and device for encoding audio signal, encoding and decoding system, program for executing encoding, and recording medium with the program recorded thereon

Info

Publication number: JP2003330497A
Application number: JP2002140435A
Authority: JP
Inventors: Sen Chon Kok; セン・チョンコク; Hon Neo Sua; ホン・ネオスア
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 2002-05-15
Filing date: 2002-05-15
Publication date: 2003-11-19

Abstract

<P>PROBLEM TO BE SOLVED: To minimize artifacts caused at the time of switching of a state mode of MS stereo encoding. <P>SOLUTION: A joint stereo encoder 14-1 encodes a digitized audio signal which includes a pair of stereo channels of a left channel and a right channel and has each frame divided into consecutive frames consisting of a plurality of time samples, to one of a plurality of state modes of MS stereo encoding. In this encoding, the joint stereo encoder 14-1 calculates respective energies of the left channel and the right channel of a current frame of the audio signal and determines a state mode of MS stereo encoding of the current frame on the basis of the value of a ratio of the energy of the right channel to that of the left channel and the state mode of MS stereo encoding of at least one just preceding frame. <P>COPYRIGHT: (C)2004,JPO

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、ミッドサイド（Ｍ
Ｓ）ステレオ符号化において、アーティファクト（疑似
信号）を導入することなく、ＭＳステレオ符号化に係る
さまざまな状態モード間での遷移が実行可能になるよう
な、オーディオ信号の符号化方法及び装置、符号化及び
復号化システム、並びに符号化を実行するプログラム及
び当該プログラムを記録した記録媒体に関する。TECHNICAL FIELD The present invention relates to a midside (M
S) In stereo coding, an audio signal coding method, apparatus, and code that enable transitions between various state modes related to MS stereo coding to be performed without introducing artifacts (pseudo signals) The present invention relates to an encoding / decoding system, a program for executing encoding, and a recording medium recording the program.

【０００２】[0002]

【従来の技術】オーディオ信号の符号化の目的は、符号
化されたオーディオストリームが復号化器において圧縮
解凍された後でもオリジナルソースに対する可能な限り
高い忠実度が保持されるように、ディジタル化されたオ
ーディオストリームを符号化器において可能な限り少量
のビットに圧縮することにある。左（Ｌ）チャンネルと
右（Ｒ）チャンネルを備えたステレオ信号に対して一般
に用いられている圧縮方法は、ジョイントステレオ符号
化として知られている。ジョイントステレオ符号化は、
ミッドサイドステレオ（一般に、ＭＳステレオと略され
る。）及びインテンシティステレオ（一般に、ＩＳステ
レオと略される。）として知られる２つの異なる符号化
方法の集合的な用語である。図４は、ステレオ信号３１
にＭＳステレオ符号化とＩＳステレオ符号化とを適用し
て、ジョイントステレオ信号３２を生成する典型的な例
を提示している。変換に先行して各チャンネルのフレー
ムは少なくとも２つのサブバンドフレームにフィルタリ
ングされ、一方のサブバンドフレームは０乃至５．５ｋ
Ｈｚの周波数帯、他方は５．５ｋＨｚより高い周波数帯
を表す。The purpose of encoding an audio signal is to digitize it so that the encoded audio stream retains the highest possible fidelity to the original source even after being decompressed in a decoder. It is to compress the audio stream into as few bits as possible in the encoder. A commonly used compression method for stereo signals with left (L) and right (R) channels is known as joint stereo coding. Joint stereo coding is
It is a collective term for two different encoding methods known as midside stereo (generally abbreviated as MS stereo) and intensity stereo (generally abbreviated as IS stereo). FIG. 4 shows a stereo signal 31
A typical example of applying the MS stereo coding and the IS stereo coding to generate the joint stereo signal 32 is presented. Prior to the conversion, each channel frame is filtered into at least two subband frames, one subband frame being 0 to 5.5k.
Hz frequency band, the other represents a frequency band higher than 5.5 kHz.

【０００３】ＩＳステレオ符号化は、高い周波数（＞５
ｋＨｚ）では耳が正確に空間性を識別できないことを利
用して、左チャンネル及び右チャンネルの和信号のみを
符号化する。このことは、実質的に、オリジナルのステ
レオの各チャンネルから送られる情報を半分に削減す
る。復号化器では、結合された信号を、所定の増幅係数
及び減衰係数に基づいて異なるさまざまな量で増幅し又
は減衰し、新しい信号を左右のチャンネルに割り当てる
ことにより、空間性の感覚が部分的に作り直される。増
幅係数と減衰係数は符号化器で決定される。IS stereo coding is based on high frequency (> 5).
Only the sum signal of the left channel and the right channel is coded by taking advantage of the fact that the ear cannot accurately discriminate the spatiality at (kHz). This effectively cuts in half the information sent from each channel of the original stereo. In the decoder, the combined signal is amplified or attenuated by different amounts depending on the predetermined amplification factor and attenuation factor, and the new signal is assigned to the left and right channels, so that the spatial sense is partially Remade. The amplification coefficient and the attenuation coefficient are determined by the encoder.

【０００４】残念ながら、オーディオ信号のスペクトル
のうちの５ｋＨｚを下回る周波数帯に対しては、空間性
を識別するために耳が十分に敏感であるので、ＩＳステ
レオ符号化を適用することができない。この周波数帯で
は通常、本発明の要点であるＭＳステレオ符号化が用い
られる。Unfortunately, for frequencies below 5 kHz of the audio signal's spectrum, the IS stereo coding cannot be applied because the ears are sensitive enough to identify spatiality. In this frequency band, MS stereo coding, which is the main point of the present invention, is usually used.

【０００５】ＭＳステレオ符号化は、オリジナルの２チ
ャンネル信号を変換して、一方のチャンネルが他方より
重要な（すなわち、より多くの情報を含む）新しい信号
の対を発生する。例えば、左チャンネル（Ｌ）と右チャ
ンネル（Ｒ）とが互いに極めてよく類似している多くの
一般的な信号の場合、オリジナルのステレオチャンネル
を互いに加算し、かつ互いに減算することで、新しいチ
ャンネルを形成することができる。「差」信号である
Ｒ’ チャンネルは「和」信号であるＬ’チャンネルよ
り弱い信号を伝送するので、符号化に必要なビット数は
より少ない。ＭＳステレオ符号化には、さまざまなオー
ディオ符号化器に対して異なる変形例が存在する。ＡＴ
ＲＡＣ３の場合、Ｌ’チャンネルはつねに元のチャンネ
ルの平均の信号（Ｌ＋Ｒ）／２であるのに対し、Ｒ’チ
ャンネルは、差信号の半分の信号（Ｌ−Ｒ）／２、左チ
ャンネルの半分の信号Ｌ／２、及び右チャンネルの半分
の符号を反転した信号−Ｒ／２のうちのいずれかである
ことが可能である。以後、ＭＳステレオ符号化のこうし
た３つの可能な「状態モード」をそれぞれＳ／Ｄモー
ド、Ｓ／Ｌモード、及びＳ／Ｒモードと呼ぶ。MS stereo coding transforms the original two-channel signal to produce a new signal pair in which one channel is more important (ie, contains more information) than the other. For example, for many common signals where the left channel (L) and the right channel (R) are very similar to each other, the original stereo channels are added to and subtracted from each other to create a new channel. Can be formed. The R'channel, which is the "difference" signal, carries a weaker signal than the L'channel, which is the "sum" signal, and thus requires fewer bits for encoding. There are different variants of MS stereo coding for various audio encoders. AT
In the case of RAC3, the L'channel is always the average signal (L + R) / 2 of the original channel, while the R'channel is half the difference signal (LR) / 2, half of the left channel. Signal L / 2 and the signal -R / 2 with the sign of the right channel half inverted. Hereinafter, these three possible "state modes" of MS stereo coding are referred to as S / D mode, S / L mode, and S / R mode, respectively.

【０００６】ＭＳステレオ符号化が適用される必要のあ
るサブバンドフレーム（すなわち、０乃至５．５ｋＨｚ
の周波数のサブバンドフレーム）が決まると、ＡＴＲＡ
Ｃ３に準拠したビットストリームの形成前に決定される
べき残りのパラメータは、ＭＳステレオ符号化に係る現
在の状態モードがＳ／Ｄ，Ｓ／Ｌ又はＳ／Ｒのいずれで
あるかを表すチャンネル切り換え情報ＳＷＩのみとな
る。Sub-band frames (ie 0 to 5.5 kHz) to which MS stereo coding has to be applied.
When the subband frame of the frequency of
The remaining parameters to be determined before the formation of the C3 compliant bitstream are the channel switching, which indicates whether the current state mode for MS stereo coding is S / D, S / L or S / R. Information SWI only.

【０００７】[0007]

【表１】 ―――――――――――――――――――――――――――――――――――― ＳＷＩ状態モードＬ’ Ｒ’ 従来技術の場合の条件 ―――――――――――――――――――――――――――――――――――― ３Ｓ／Ｄ (L+R)/2 (L-R)/2 ＬとＲが類似する場合０Ｓ／Ｌ (L+R)/2 L/2 Ｒの方がＬより重要な場合１Ｓ／Ｒ (L+R)/2 -R/2 Ｌの方がＲより重要な場合 ――――――――――――――――――――――――――――――――――――[Table 1] ―――――――――――――――――――――――――――――――――――― SWI state mode L'R 'Conditions for prior art ―――――――――――――――――――――――――――――――――――― 3 S / D (L + R) / 2 (L-R) / 2 When L and R are similar 0 S / L (L + R) / 2 L / 2 R is more important than L 1 S / R (L + R) / 2 -R / 2 When L is more important than R ――――――――――――――――――――――――――――――――――――

【０００８】従来技術の方法では、Ｌチャンネル及びＲ
チャンネルの「重要さ」の基準として、一般にそれらＬ
チャンネル及びＲチャンネルのエネルギーが用いられ
る。サブバンドフレームのエネルギーＥ_Ｌ及びＥ_Ｒは通
常、各チャンネルのサブバンドフレーム内のすべての時
間サンプルの平方の総和で近似される。この従来技術の
条件に基づけば、現在のサブバンドフレームのＭＳステ
レオ符号化の状態モードを決定するための単純な方法
は、ｆを１より大きいしきい値係数として、次のように
なる。In the prior art method, L channel and R
As a measure of the "importance" of a channel, those L
The channel and R channel energies are used. The subband frame energies E _L and E _R are usually approximated by the sum of the squares of all time samples in the subband frame of each channel. Based on this prior art condition, a simple method for determining the state mode of MS stereo coding of the current subband frame is as follows, where f is a threshold factor greater than one.

【０００９】[0009]

【数１】Ｅ_Ｒ＞ｆ×Ｅ_Ｌであるとき、ＳＷＩ＝０そうではなく、Ｅ_Ｌ＞ｆ×Ｅ_Ｒであるとき、ＳＷＩ＝１上のいずれでもないとき、ＳＷＩ＝３## EQU1 ## When E _R > f × E _L , SWI = 0. Otherwise, when E _L > f × E _R , SWI = 1. When none of the above, SWI = 3.

【００１０】[0010]

【発明が解決しようとする課題】上記のような単純かつ
直接的な方法は、そのエネルギーＥ_Ｌ及びＥ_Ｒ値がThe simple and direct method as described above is such that the energy E _L and E _R values are

【数２】Ｅ_Ｒ≒ｆ×Ｅ_Ｌ又は## EQU2 ## E _R ≈f × E _L or

【数３】Ｅ_Ｌ≒ｆ×Ｅ_Ｒの範囲の辺りの値で滑らかに変化して漂っている信号に
対して、ＭＳステレオ符号化に係る２つの状態モード間
（Ｓ／ＤとＳ／Ｌモードの間、又はＳ／ＤとＳ／Ｒモー
ドの間）での高速のトグル、すなわち、ＭＳステレオ符
号化の状態モード間を急速に往復する切り換えを発生さ
せる可能性がある。ＭＳステレオ符号化に係る２つの状
態モード間のこうした高速のトグルは、現実の実装では
可聴なアーティファクトを発生させることが発見されて
いる。図９はこれの一例を示し、Ｌチャンネル及びＲチ
ャンネルにてなる信号に対して、従来技術に係るＭＳス
テレオ符号化の状態モード切り換え処理を適用した例を
示すグラフである。図９では、ｔを離散時間のパラメー
タとして、サブバンドフレームｔからサブバンドフレー
ムｔ＋３までの間において、エネルギーＥ_Ｌのわずかな
変化に起因し、Ｓ／ＤモードとＳ／Ｌモードとの間を往
復する高速のトグルが発生している。この高速のトグル
のために、信号強度が滑らかに変化している信号にアー
ティファクトを生じさせることがある。Respect Equation 3] E _L ≒ f × E _R range around the value smoothly changes to drifting signal of, between the two-state mode of the MS stereo coding (S / D and S / L Fast toggle between modes or between S / D and S / R modes, i.e. it can cause a rapid round-trip switch between state modes of MS stereo coding. It has been discovered that such a fast toggle between the two state modes associated with MS stereo coding causes audible artifacts in real world implementations. FIG. 9 shows an example of this, and is a graph showing an example in which the state mode switching process of the MS stereo coding according to the related art is applied to the signals of the L channel and the R channel. In FIG. 9, using t as a discrete-time parameter, between the subband frame t and the subband frame t + 3, due to a slight change in the energy E _L , the S / D mode and the S / L mode are changed. A high-speed toggle that reciprocates is occurring. This high-speed toggle can cause artifacts in the signal where the signal strength is changing smoothly.

【００１１】変換後は重要さのより少ない方のチャンネ
ルにより少ないビット数を配分するようにＭＳステレオ
符号化を利用して重要さの不均衡な信号（Ｌ’及びＲ’
チャンネル信号）を生成する一方で、ＭＳステレオ符号
化に係る２つの状態モード間での高速のトグルからアー
ティファクトが発生しないことを保証するような予防策
が講じられる必要がある。例えば、図９が示すような、
信号強度がサブバンドフレーム間で滑らかに変化する信
号の場合、Ｓ／Ｌ又はＳ／Ｒモードに入った時点で当該
モードがそのすぐ次のサブバンドフレームにおいてＳ／
Ｄモードに戻る（リープバックする）ことを防止する機
構が実装されていなければならない。After conversion, MS stereo coding is used to allocate less bits to the less important channels, and the imbalanced signals (L 'and R').
Channel signal), while precautions must be taken to ensure that no artifacts result from the fast toggle between the two state modes associated with MS stereo coding. For example, as shown in FIG.
In the case of a signal whose signal strength changes smoothly between subband frames, when the S / L or S / R mode is entered, the mode is changed to S / L in the next subband frame.
A mechanism must be implemented to prevent returning (leaping back) to D mode.

【００１２】それにも関わらず、こうした予防策は、Ｍ
Ｓステレオ符号化において先行フレームとは異なる状態
モードに遷移することを合理的に正当化できるような信
号を損なうほど、過度に適用されてはならない。これ
は、必要なときのＭＳステレオ符号化に係る状態モード
の切り換えに失敗したときにもまたアーティファクトが
導入されるからである。従って、上述された、状態モー
ドの過度の切り換えを防止するための機構は、控えめに
適用されなければならない。Nevertheless, such preventive measures are
It should not be applied excessively enough to undermine a signal that can reasonably be justified in S stereo coding to transition to a different state mode than the previous frame. This is because the artifacts are also introduced when the state mode switching for MS stereo coding fails when needed. Therefore, the mechanism described above for preventing excessive switching of state modes must be applied sparingly.

【００１３】本発明の目的は、以上の問題点を解決し、
ＭＳステレオ符号化の状態モードの切り換えにおいて望
ましくないアーティファクトの発生を防止することがで
きる、オーディオ信号の符号化方法及び装置、符号化及
び復号化システム、並びに符号化を実行するプログラム
及び当該プログラムを記録した記録媒体を提供すること
にある。The object of the present invention is to solve the above problems,
Audio signal encoding method and apparatus, encoding and decoding system capable of preventing occurrence of undesirable artifacts in MS stereo encoding state mode switching, program for executing encoding, and recording of the program To provide the recording medium.

【００１４】[0014]

【課題を解決するための手段】本発明の態様に係るオー
ディオ信号の符号化方法は、左チャンネルと右チャンネ
ルとからなる１対のステレオチャンネルを含みかつ各フ
レームが複数の時間サンプルにてなる連続的なフレーム
に分割されているディジタル化されたオーディオ信号
を、ミッドサイドステレオ符号化の複数の状態モードの
うちのいずれか１つの状態モードに符号化し、上記オー
ディオ信号の符号化方法は、上記オーディオ信号の現在
のフレームの左チャンネル及び右チャンネルの各エネル
ギーを計算するステップと、上記左チャンネルのエネル
ギーに対する上記右チャンネルのエネルギーの比の値
と、少なくとも１つの先行する直前のフレームのミッド
サイドステレオ符号化の状態モードとに基づいて、現在
のフレームのミッドサイドステレオ符号化の状態モード
を決定するステップとを含むことを特徴とする。SUMMARY OF THE INVENTION An audio signal encoding method according to an aspect of the present invention includes a pair of stereo channels consisting of a left channel and a right channel, each frame being a sequence of a plurality of time samples. A digitized audio signal divided into typical frames is encoded into any one of a plurality of state modes of midside stereo encoding, and the encoding method of the audio signal comprises: Calculating each of the left and right channel energies of the current frame of the signal, a value of the ratio of the right channel energy to the left channel energy, and at least one preceding side mid-side stereo code The current frame's midsize based on the Characterized in that it comprises the step of determining the status mode of de stereo coding.

【００１５】上記オーディオ信号の符号化方法におい
て、上記ミッドサイドステレオ符号化は、左右のチャン
ネルの信号の和信号と差信号とを含む第１の状態モード
と、左右のチャンネルの信号の和信号と左チャンネルの
信号とを含む第２の状態モードと、左右のチャンネルの
信号の和信号と右チャンネルの信号とを含む第３の状態
モードとを含むことを特徴とする。In the audio signal encoding method, the mid-side stereo encoding includes a first state mode including a sum signal and a difference signal of left and right channel signals and a sum signal of left and right channel signals. It is characterized by including a second state mode including a left channel signal and a third state mode including a sum signal of left and right channel signals and a right channel signal.

【００１６】また、上記オーディオ信号の符号化方法に
おいて、上記現在のフレームのミッドサイドステレオ符
号化の状態モードを決定するステップは、上記比の値が
所定の第１のしきい値よりも大きいとき、現在のフレー
ムのミッドサイドステレオ符号化の状態モードを上記第
２の状態モードに遷移させるステップと、上記比の値が
上記第１のしきい値の逆数よりも小さいとき、現在のフ
レームのミッドサイドステレオ符号化の状態モードを上
記第３の状態モードに遷移させるステップと、上記先行
する直前のフレームのミッドサイドステレオ符号化の状
態モードが上記第３の状態モードにあり、上記比の値
が、上記第１のしきい値以下であり、かつ上記第１のし
きい値よりも小さい所定の第２のしきい値より大きいと
き、現在のフレームのミッドサイドステレオ符号化の状
態モードを上記第１の状態モードに遷移させるステップ
と、上記先行する直前のフレームのミッドサイドステレ
オ符号化の状態モードが上記第２の状態モードにあり、
上記比の値が、上記第１のしきい値の逆数以上であり、
かつ上記第２のしきい値の逆数より小さいとき、現在の
フレームのミッドサイドステレオ符号化の状態モードを
上記第１の状態モードに遷移させるステップとを含むこ
とを特徴とする。In the audio signal encoding method, the step of determining the state mode of the mid-side stereo encoding of the current frame is performed when the value of the ratio is larger than a predetermined first threshold value. , Transitioning the mid-side stereo coding state mode of the current frame to the second state mode; and, when the value of the ratio is less than the reciprocal of the first threshold, mid of the current frame. Transitioning the side stereo coding state mode to the third state mode, and the mid-side stereo coding state mode of the immediately preceding preceding frame is in the third state mode, and the value of the ratio is , Less than or equal to the first threshold and greater than a predetermined second threshold less than the first threshold, the current frame A step of shifting the state mode of the mid-side stereo coding in the first state mode, the state mode of the mid-side stereo coding of the previous frame to the preceding is in the second state mode,
The value of the ratio is greater than or equal to the reciprocal of the first threshold,
And transitioning the mid-side stereo coding state mode of the current frame to the first state mode when it is smaller than the reciprocal of the second threshold value.

【００１７】さらに、上記オーディオ信号の符号化方法
において、上記現在のフレームのミッドサイドステレオ
符号化の状態モードを決定するステップは、上記先行す
る直前のフレームのミッドサイドステレオ符号化の状態
モードが上記第２の状態モードにあり、上記比の値が上
記第２のしきい値の逆数以上であるとき、現在のフレー
ムのミッドサイドステレオ符号化の状態モードを上記第
２の状態モードに保持するステップと、上記先行する直
前のフレームのミッドサイドステレオ符号化の状態モー
ドが上記第３の状態モードにあり、上記比の値が上記第
２のしきい値以下であるとき、現在のフレームのミッド
サイドステレオ符号化の状態モードを上記第３の状態モ
ードに保持するステップとをさらに含むことを特徴とす
る。Further, in the audio signal encoding method, the step of determining the state mode of the midside stereo encoding of the current frame is performed by determining the state mode of the midside stereo encoding of the immediately preceding frame. Holding the mid-side stereo coding state mode of the current frame in the second state mode when in the second state mode and the value of the ratio is greater than or equal to the reciprocal of the second threshold. And the mid-side stereo coding state mode of the preceding immediately preceding frame is in the third state mode and the ratio value is less than or equal to the second threshold value, the mid-side of the current frame is Holding the stereo encoding state mode in the third state mode.

【００１８】本発明のもう１つの態様に係るオーディオ
信号の符号化装置は、左チャンネルと右チャンネルとか
らなる１対のステレオチャンネルを含みかつ各フレーム
が複数の時間サンプルにてなる連続的なフレームに分割
されているディジタル化されたオーディオ信号を、ミッ
ドサイドステレオ符号化の複数の状態モードのうちのい
ずれか１つの状態モードに符号化し、上記オーディオ信
号の符号化装置は、上記オーディオ信号の現在のフレー
ムの左チャンネル及び右チャンネルの各エネルギーを計
算する計算手段と、上記左チャンネルのエネルギーに対
する上記右チャンネルのエネルギーの比の値と、少なく
とも１つの先行する直前のフレームのミッドサイドステ
レオ符号化の状態モードとに基づいて、現在のフレーム
のミッドサイドステレオ符号化の状態モードを決定する
決定手段とを備えたことを特徴とする。An audio signal encoding apparatus according to another aspect of the present invention includes a continuous frame including a pair of stereo channels consisting of a left channel and a right channel, each frame including a plurality of time samples. A digitized audio signal divided into a plurality of state modes of midside stereo encoding is encoded into one of the state modes, Means for calculating the respective energies of the left channel and the right channel of the frame, a value of the ratio of the energy of the right channel to the energy of the left channel, and at least one preceding mid-side stereo coding of the preceding frame. Based on the state mode and the midsides of the current frame Characterized by comprising a determining means for determining a state mode of Leo coding.

【００１９】上記オーディオ信号の符号化装置におい
て、上記ミッドサイドステレオ符号化は、左右のチャン
ネルの信号の和信号と差信号とを含む第１の状態モード
と、左右のチャンネルの信号の和信号と左チャンネルの
信号とを含む第２の状態モードと、左右のチャンネルの
信号の和信号と右チャンネルの信号とを含む第３の状態
モードとを含むことを特徴とする。In the audio signal encoding device, the mid-side stereo encoding is performed in a first state mode including a sum signal and a difference signal of left and right channel signals, and a sum signal of left and right channel signals. It is characterized by including a second state mode including a left channel signal and a third state mode including a sum signal of left and right channel signals and a right channel signal.

【００２０】また、上記オーディオ信号の符号化装置に
おいて、上記決定手段は、上記比の値が所定の第１のし
きい値よりも大きいとき、現在のフレームのミッドサイ
ドステレオ符号化の状態モードを上記第２の状態モード
に遷移させ、上記比の値が上記第１のしきい値の逆数よ
りも小さいとき、現在のフレームのミッドサイドステレ
オ符号化の状態モードを上記第３の状態モードに遷移さ
せ、上記先行する直前のフレームのミッドサイドステレ
オ符号化の状態モードが上記第３の状態モードにあり、
上記比の値が、上記第１のしきい値以下であり、かつ上
記第１のしきい値よりも小さい所定の第２のしきい値よ
り大きいとき、現在のフレームのミッドサイドステレオ
符号化の状態モードを上記第１の状態モードに遷移さ
せ、上記先行する直前のフレームのミッドサイドステレ
オ符号化の状態モードが上記第２の状態モードにあり、
上記比の値が、上記第１のしきい値の逆数以上であり、
かつ上記第２のしきい値の逆数より小さいとき、現在の
フレームのミッドサイドステレオ符号化の状態モードを
上記第１の状態モードに遷移させることを特徴とする。Further, in the audio signal encoding apparatus, the determining means determines the mid-side stereo encoding state mode of the current frame when the value of the ratio is larger than a predetermined first threshold value. Transition to the second state mode, and when the value of the ratio is less than the reciprocal of the first threshold, transition the mid-side stereo coding state mode of the current frame to the third state mode. Then, the state mode of the mid-side stereo coding of the preceding immediately preceding frame is in the third state mode,
When the ratio value is less than or equal to the first threshold value and greater than a predetermined second threshold value that is less than the first threshold value, the mid-side stereo coding of the current frame is performed. The state mode is transited to the first state mode, and the state mode of mid-side stereo coding of the preceding immediately preceding frame is in the second state mode,
The value of the ratio is greater than or equal to the reciprocal of the first threshold,
And, when it is smaller than the reciprocal of the second threshold value, the mid-side stereo coding state mode of the current frame is transited to the first state mode.

【００２１】さらに、上記オーディオ信号の符号化装置
において、上記決定手段は、上記先行する直前のフレー
ムのミッドサイドステレオ符号化の状態モードが上記第
２の状態モードにあり、上記比の値が上記第２のしきい
値の逆数以上であるとき、現在のフレームのミッドサイ
ドステレオ符号化の状態モードを上記第２の状態モード
に保持し、上記先行する直前のフレームのミッドサイド
ステレオ符号化の状態モードが上記第３の状態モードに
あり、上記比の値が上記第２のしきい値以下であると
き、現在のフレームのミッドサイドステレオ符号化の状
態モードを上記第３の状態モードに保持することを特徴
とする。Further, in the audio signal encoding apparatus, the determining means has the state mode of the midside stereo encoding of the immediately preceding preceding frame in the second state mode, and the value of the ratio is the above. When the reciprocal of the second threshold value or more, the state mode of the midside stereo encoding of the current frame is held in the second state mode, and the state of the midside stereo encoding of the immediately preceding frame is held. Keep the mid-side stereo coding state mode of the current frame in the third state mode when the mode is in the third state mode and the ratio value is less than or equal to the second threshold value. It is characterized by

【００２２】本発明の別の態様に係るオーディオ信号の
符号化及び復号化システムは、上記オーディオ信号の符
号化装置と、オーディオ信号の復号化装置とを備え、上
記オーディオ信号の復号化装置は、上記決定されたミッ
ドサイドステレオ符号化の状態モードに基づいて、上記
符号化されたオーディオ信号を復号化し、復号化された
１対のステレオチャンネルを含むオーディオ信号を得て
出力する手段を備えたことを特徴とする。An audio signal encoding and decoding system according to another aspect of the present invention includes the audio signal encoding device and the audio signal decoding device, and the audio signal decoding device includes: Means for decoding the encoded audio signal based on the determined state mode of the mid-side stereo encoding, and obtaining and outputting the decoded audio signal including a pair of stereo channels Is characterized by.

【００２３】本発明のさらに別の態様に係るプログラム
は、上記オーディオ信号の符号化方法に係るステップを
含むことを特徴とする。A program according to still another aspect of the present invention is characterized by including steps related to the audio signal encoding method.

【００２４】本発明のもう１つの態様によれば、上記プ
ログラムを記録し、コンピュータが読み取り可能な記録
媒体が提供される。According to another aspect of the present invention, there is provided a computer-readable recording medium in which the above program is recorded.

【００２５】前述の目的を達成するために、さまざまな
状態モード間の切り換えのための規準は、元のチャンネ
ルのエネルギーだけでなく先行するサブバンドフレーム
のＭＳステレオ符号化の状態モードをも考慮しなければ
ならない。To achieve the above-mentioned object, the criterion for switching between the various state modes considers not only the energy of the original channel but also the state mode of the MS stereo coding of the preceding subband frame. There must be.

【００２６】本発明の別の態様に係るオーディオ信号の
符号化方法は、１）現在のサブバンドフレームの両方のチャンネルにつ
いて、エネルギーＥ_Ｌ及びＥ_Ｒを計算するステップと、２）先行するサブバンドフレームに適用されたＭＳステ
レオ符号化の状態モードを記録するステップと、３）所定の範囲のエネルギーの比の値Ｅ_Ｒ／Ｅ_Ｌ（又は
Ｅ_Ｌ／Ｅ_Ｒ）に関して、当該状態モードが先行する状態
モードを受け継ぐことを保証するステップとを含み、こ
のことは、所定の状態モードＡから所定の状態モードＢ
に切り換えるための条件が、状態モードＢから状態モー
ドＡに切り換えるための条件の逆ではない場合に達成さ
れることが可能であり、４）一方のチャンネルで他方のチャンネルに比較して急
激に上昇するエネルギーの徴候が現れた場合に、ＭＳス
テレオ符号化に係る高速の状態モード切り換えを許容す
るステップを含むことを特徴とする。An audio signal encoding method according to another aspect of the present invention comprises: 1) calculating energies E _L and E _R for both channels of the current subband frame; and 2) preceding subband. and recording the state mode of the MS stereo coding applied to the frame, 3) for a given range of energy values of the ratio E _{R /} E L _(or E _{L /} E _R), the status mode is preceded Ensuring that the state mode is inherited, which means that the predetermined state mode A to the predetermined state mode B
Can be achieved if the condition for switching to state mode B is not the reverse of the condition for switching from state mode B, 4) one channel rises sharply compared to the other. The step of allowing fast state mode switching for MS stereo coding in the event of an indication of energy to occur.

【００２７】本発明のもう１つの態様に係るオーディ信
号の符号化方法は、ディジタル化されたオーディオ信号
のＭＳステレオ符号化に係る状態モードを決定する一方
で、アーティファクトの発生を最小化するか又はアーテ
ィファクトをまったく発生させないオーディオ信号の符
号化方法であって、上記オーディオ信号は複数の時間サ
ンプルにてなる連続的なフレームに分割された１対のス
テレオチャンネルを備え、（ａ）上記信号の現在のフレ
ームの左チャンネル及び右チャンネルのエネルギーＥ_Ｌ
及びＥ_Rを計算するステップと、（ｂ）上記エネルギー
Ｅ_Ｌ及びＥ_Ｒ間の比の値と、少なくとも１つの先行する
直前のフレームのＭＳステレオ符号化の状態モードとに
基づいて、現在のフレームのＭＳステレオ符号化の状態
モードを決定するステップとを含むことを特徴とする。A method for encoding an audio signal according to another aspect of the present invention determines the state mode for MS stereo encoding of a digitized audio signal while minimizing the occurrence of artifacts or A method of encoding an audio signal that does not generate any artifacts, said audio signal comprising a pair of stereo channels divided into consecutive frames of a plurality of time samples, wherein: Energy E _L of the left and right channels of the frame
And E _R , and (b) the current frame based on the value of the ratio between the energies E _L and E _R and the state mode of MS stereo coding of at least one preceding immediately preceding frame. Determining the state mode of the MS stereo encoding of.

【００２８】上記オーディオ信号の符号化方法におい
て、所定の範囲の上記比の値Ｅ_Ｒ／Ｅ _Ｌ又はＥ_Ｌ／Ｅ_Ｒ
に対して、現在のフレームのＭＳステレオ符号化の状態
モードは直前のフレームのＭＳステレオ符号化の状態モ
ードを受け継ぐことを特徴とする。In the above audio signal encoding method
And the value E of the above ratio within a predetermined range_R/ E _LOr E_L/ E_R
For the current frame MS stereo coding state
The mode is the status mode of MS stereo coding of the previous frame.
It is characterized by inheriting the code.

【００２９】また、上記オーディオ信号の符号化方法に
おいて、状態モードＡから状態モードＢに切り換える条
件は、状態モードＢから状態モードＡに切り換える条件
の逆ではないことを特徴とする。Further, in the above audio signal encoding method, the condition for switching from the state mode A to the state mode B is not the reverse of the condition for switching from the state mode B to the state mode A.

【００３０】さらに、上記オーディオ信号の符号化方法
において、一方のチャンネルのエネルギーが他方のチャ
ンネルのエネルギーに対して急激に上昇する場合は、Ｍ
Ｓステレオ符号化の別の状態モードへの高速の切り換え
が許容されることを特徴とする。Further, in the above audio signal encoding method, when the energy of one channel rises sharply with respect to the energy of the other channel, M
It is characterized in that fast switching to another state mode of S stereo coding is allowed.

【００３１】またさらに、上記オーディオ信号の符号化
方法において、上記条件は、上記比の値が所定の予め決
められた値を下回る、もしくは上回るようなものである
ことを特徴とする。Furthermore, in the above-mentioned audio signal encoding method, the condition is such that the value of the ratio is below or above a predetermined value.

【００３２】また、上記オーディオ信号の符号化方法に
おいて、他方のチャンネルのエネルギーに対する一方の
チャンネルのエネルギー上昇の上記急激さは、上記比の
値によっても評価されることを特徴とする。Further, in the above-mentioned audio signal encoding method, the abruptness of the energy increase of one channel with respect to the energy of the other channel is also evaluated by the value of the ratio.

【００３３】[0033]

【発明の実施の形態】以下、図面を参照して本発明に係
る実施形態について説明する。本願明細書では、ＡＴＲ
ＡＣ３のためのミニディスク記録再生システムを一実施
形態として参照して説明するが、本発明の符号化方法
は、ジョイントステレオ符号化を用いる他の符号化器に
も適用することができる。DETAILED DESCRIPTION OF THE INVENTION Embodiments of the present invention will be described below with reference to the drawings. In the present specification, the ATR
Although a mini disk recording / reproducing system for AC3 will be described with reference to an embodiment, the encoding method of the present invention can be applied to other encoders using joint stereo encoding.

【００３４】図１は、本発明の一実施形態に係るミニデ
ィスク記録再生システムの構成を示すブロック図であ
る。この実施形態は、特に、オーディオエンコーダ２内
の詳細構成に特徴を有している。FIG. 1 is a block diagram showing the configuration of a mini disk recording / reproducing system according to an embodiment of the present invention. This embodiment is particularly characterized by the detailed configuration inside the audio encoder 2.

【００３５】図１において、Ａ／Ｄコンバータ１はオー
ディオ入力信号をＡ／Ｄ変換してディジタル化されたオ
ーディオサンプル信号に変換し、次いで、オーディオエ
ンコーダ２は、詳細後述するようなジョイントステレオ
符号化処理を用いて上記変換されたオーディオサンプル
信号を圧縮して符号化し、ＡＴＲＡＣ３のビットストリ
ーム信号を発生する。次いで、ミニディスク記録装置３
は、ＡＴＲＡＣ３のビットストリーム信号を所定の変調
記録信号に変調した後、上記変調記録信号をミニディス
ク４に記録する。一方、ミニディスク再生装置５は、ミ
ニディスク４から記録信号を再生して復調することによ
り、ＡＴＲＡＣ３のビットストリーム信号を再生する。
さらに、オーディオデコーダ６は圧縮されているＡＴＲ
ＡＣ３のビットストリーム信号に対して、上記ジョイン
トステレオ符号化処理とは逆のジョイントステレオ復号
化処理を用いてディジタルオーディオ信号に復号化した
後、最後にＤ／Ａコンバータ７はディジタルオーディオ
信号をＤ／Ａ変換してオーディオ出力信号を出力する。In FIG. 1, an A / D converter 1 A / D-converts an audio input signal to convert it into a digitized audio sample signal, and then an audio encoder 2 uses a joint stereo coding as will be described later in detail. The converted audio sample signal is compressed and encoded using processing to generate an ATRAC3 bitstream signal. Then, the mini disk recording device 3
Modulates the bit stream signal of the ATRAC3 into a predetermined modulation recording signal, and then records the modulation recording signal on the mini disk 4. On the other hand, the mini disc reproducing apparatus 5 reproduces the recording signal from the mini disc 4 and demodulates it to reproduce the bit stream signal of the ATRAC 3.
In addition, the audio decoder 6 has a compressed ATR
The bit stream signal of AC3 is decoded into a digital audio signal by using a joint stereo decoding process which is the reverse of the above-mentioned joint stereo coding process, and finally the D / A converter 7 converts the digital audio signal into a D / A signal. A-converts and outputs an audio output signal.

【００３６】上述のジョイントステレオ符号化処理にお
いて適切なＭＳステレオ符号化の状態モードを選択する
ことによって、ディジタルオーディオ信号の符号化及び
復号化の際に発生するアーティファクトの発生を防止す
ることができる。この実施形態では、記録再生システム
は、ＡＴＲＡＣ３のビットストリーム信号をミニディス
ク４に記録するように構成されているが、本発明はこれ
に限らず、その他の記録媒体又は伝送媒体を用いるよう
に記録再生システムや伝送システムを構成してもよい。By selecting an appropriate MS stereo coding state mode in the above-mentioned joint stereo coding processing, it is possible to prevent the occurrence of artifacts that occur during coding and decoding of digital audio signals. In this embodiment, the recording / reproducing system is configured to record the bit stream signal of the ATRAC3 on the mini disc 4, but the present invention is not limited to this, and recording is performed using another recording medium or transmission medium. A reproduction system and a transmission system may be configured.

【００３７】図２は、図１のオーディオエンコーダ２の
詳細構成を示すブロック図である。図２において、オー
ディオエンコーダ２は、ＡＴＲＡＣ３の音声信号圧縮技
術を用いた典型的な変換符号化器であるが、オーディオ
エンコーダ２の詳細構成のうち、特に、ジョイントステ
レオ符号化器１４−１の詳細構成において特徴を有して
いる。ＡＴＲＡＣ３は、基本的に、４４．１ｋＨｚでサ
ンプリングされたオーディオサンプル信号のストリーム
に対して設計された、サブバンド符号化と変換符号化の
技術のハイブリッド符号化技術である。FIG. 2 is a block diagram showing the detailed arrangement of the audio encoder 2 shown in FIG. In FIG. 2, the audio encoder 2 is a typical conversion encoder using the audio signal compression technology of ATRAC3. Among the detailed configurations of the audio encoder 2, in particular, the details of the joint stereo encoder 14-1. It has a characteristic in the configuration. ATRAC3 is basically a hybrid coding technique of subband coding and transform coding designed for a stream of audio sample signals sampled at 44.1 kHz.

【００３８】図２に示すように、オーディオエンコーダ
２は、２ステージのＱＭＦフィルタ１０を使用して、そ
れぞれ時間領域で１０２４個のサンプルを備える連続的
なオーディオサンプル信号のフレームを４つの部分帯域
（サブバンド）信号に周波数帯域分割し、次に、各部分
帯域毎に、時間方向にそれぞれ２５６個のサンプル信号
を有するフレームに分割した信号（以下、サブバンドフ
レームという。）を出力する。ここで、ＱＭＦ分割フィ
ルタ１０は、詳しくは、オーディオサンプル信号を低域
信号と高域信号に等分するローパスフィルタ１１及びハ
イパスフィルタ１２と、上記分割された低域及び高域信
号を周波数についてさらに等分するフィルタバンク１３
−１乃至１３−４とを備えて構成される。従って、ＱＭ
Ｆフィルタの出力は、ダウンサンプリングされ、それぞ
れが２５６個のサンプルを備えている４つのサブバンド
フレームである。これらはそれぞれ、時間軸上では、Ｍ
ＤＣＴ変換のための変換ブロックの半分の長さであり、
０乃至５．５ｋＨｚ（サブバンド０）、５．５ｋＨｚ乃
至１１ｋＨｚ（サブバンド１）、１１ｋＨｚ乃至１６．
５ｋＨｚ（サブバンド２）、及び１６．５ｋＨｚ乃至２
２ｋＨｚ（サブバンド３）の各周波数帯から得られるオ
ーディオ信号である。As shown in FIG. 2, the audio encoder 2 uses a two-stage QMF filter 10 to convert a frame of a continuous audio sample signal with 1024 samples in the time domain into four sub-bands ( A subband) signal is frequency-band-divided, and then a signal (hereinafter referred to as a subband frame) divided into frames each having 256 sample signals in the time direction is output for each partial band. Here, in detail, the QMF division filter 10 further includes a low-pass filter 11 and a high-pass filter 12 that equally divide the audio sample signal into a low-pass signal and a high-pass signal, and further divides the divided low-pass and high-pass signals with respect to frequency. Filter bank 13 for equal division
-1 to 13-4. Therefore, QM
The output of the F filter is four subband frames that are downsampled, each with 256 samples. Each of these is M on the time axis.
Half the length of the transform block for the DCT transform,
0 to 5.5 kHz (subband 0), 5.5 kHz to 11 kHz (subband 1), 11 kHz to 16.
5 kHz (subband 2), and 16.5 kHz to 2
It is an audio signal obtained from each frequency band of 2 kHz (subband 3).

【００３９】次いで、ジョイントステレオ符号化器１４
−１乃至１４−４により、ジョイントステレオ符号化処
理は、ＱＭＦフィルタの出力におけるこれらの４つの時
間領域の信号に適用される。先述の空間性に対する感度
を理由として、ＭＳステレオ符号化はサブバンド０のサ
ブバンドフレームに対してのみ必要であり、より高い周
波数帯のサブバンドフレームに対しては、ＩＳステレオ
符号化の方がより有用かつ有効であると思われるので、
ジョイントステレオ符号化器１４−１はＭＳステレオ符
号化のみを実行し、ジョイントステレオ符号化器１４−
２乃至１４−４は、ＩＳステレオ符号化のみを実行す
る。Next, the joint stereo encoder 14
-1 to 14-4, the joint stereo coding process is applied to these four time domain signals at the output of the QMF filter. Due to the sensitivity to spatiality mentioned above, MS stereo coding is only necessary for subband frames of subband 0, and IS stereo coding is better for subband frames of higher frequency bands. As it seems more useful and effective,
The joint stereo encoder 14-1 performs only MS stereo encoding, and the joint stereo encoder 14-
2 to 14-4 perform IS stereo coding only.

【００４０】ジョイントステレオ符号化器１４−１は、
サブバンド０の時間的に連続した各サブバンドフレーム
に対して適切なＭＳステレオ符号化の状態モードを選択
する。前述のように、ＭＳステレオ符号化の状態モード
は、サブバンドフレーム毎に、左右のチャンネルの信号
の和信号と差信号とを含むＳ／Ｄモード（すなわち、
（Ｌ＋Ｒ）／２及び（Ｌ−Ｒ）／２の信号にてなる第１
の状態モード）と、左右のチャンネルの信号の和信号と
左チャンネルの信号とを含むＳ／Ｌモード（すなわち、
（Ｌ＋Ｒ）／２及びＬ／２の信号にてなる第２の状態モ
ード）と、左右のチャンネルの信号の和信号と右チャン
ネルの信号とを含むＳ／Ｒモード（すなわち、（Ｌ＋
Ｒ）／２及び−Ｒ／２の信号にてなる第３の状態モー
ド）とのいずれかに決定される。ジョイントステレオ符
号化器１４−１は、オーディオ信号の現在のフレームの
左チャンネル及び右チャンネルの各エネルギーを計算す
る計算手段（図示せず。）と、左チャンネルのエネルギ
ーに対する右チャンネルのエネルギーの比の値と、少な
くとも１つの先行する直前のフレームのＭＳステレオ符
号化の状態モードとに基づいて、現在のフレームのＭＳ
ステレオ符号化の状態モードを決定する決定手段（図示
せず。）とを備えている。ＭＳステレオ符号化された
Ｌ’チャンネル信号（すなわち、（Ｌ＋Ｒ）／２信号）
とＲ’チャンネル信号（すなわち、（Ｌ−Ｒ）／２信
号、Ｌ／２信号、又は−Ｒ／２信号）のためのビット数
の配分は、サブバンドフレーム毎に可変にすることがで
きる。各サブバンドフレームに対して決定されたＭＳス
テレオ符号化の状態モードのチャンネル切り換え情報Ｓ
ＷＩとＬ’及びＲ’チャンネル信号のためのビット数の
配分の情報とは、サイド情報としてオーディオデコーダ
６に伝送される。ジョイントステレオ符号化器１４−１
によって実行されるＭＳステレオ符号化の状態モード切
り換え処理の詳細は後述される。The joint stereo encoder 14-1 is
Select the appropriate MS stereo coding state mode for each temporally consecutive subband frame of subband 0. As described above, the MS stereo coding state mode is the S / D mode (that is, the sum signal and the difference signal of the left and right channel signals for each subband frame).
First with (L + R) / 2 and (LR) / 2 signals
State mode), and an S / L mode including a sum signal of left and right channel signals and a left channel signal (that is,
An S / R mode including a second state mode composed of (L + R) / 2 and L / 2 signals and a sum signal of left and right channel signals and a right channel signal (that is, (L + R)
R) / 2 and a third state mode composed of signals of −R / 2). The joint stereo encoder 14-1 calculates the energy of the left channel and the energy of the right channel of the current frame of the audio signal, and calculates the ratio of the energy of the right channel to the energy of the left channel. The MS of the current frame based on the value and the state mode of MS stereo coding of at least one preceding immediately preceding frame.
And a determination means (not shown) for determining a stereo encoding state mode. MS stereo coded L'channel signal (ie (L + R) / 2 signal)
The distribution of the number of bits for the R'channel signal and the R'channel signal (that is, the (LR) / 2 signal, the L / 2 signal, or the -R / 2 signal) can be variable for each subband frame. Channel switching information S of the MS stereo coding state mode determined for each subband frame
The WI and the information on the allocation of the number of bits for the L ′ and R ′ channel signals are transmitted to the audio decoder 6 as side information. Joint stereo encoder 14-1
The details of the state mode switching process of the MS stereo encoding executed by will be described later.

【００４１】ジョイントステレオ符号化処理に続いて、
利得制御部１５−１乃至１５−４において、各サブバン
ドフレームに対する利得係数（すなわち増幅率又は減衰
率）を計算して利得制御が実行される。次いで、ＭＤＣ
Ｔ処理部１６−１は、１つの周波数帯域で利得制御さ
れ、かつＭＤＣＴ処理するための変換ブロックに縦続接
続された少なくとも２つのサブバンドフレーム毎の信号
に対してそれぞれ変形離散コサイン変換（ＭＤＣＴ）の
処理を実行し、その結果として生じるスペクトル情報信
号をトーン成分符号化器１７−１に出力する。トーン成
分符号化器１７−１は、トーン成分検出部、スケールフ
ァクタ処理部及びビット割り当て部等を備えて構成さ
れ、ＭＤＣＴ処理された情報信号からトーン成分信号と
非トーン性のスペクトル信号とを分離して、それぞれを
トーン成分量子化器１８−１とスペクトル量子化器１９
−１に出力する。ここで、トーン成分量子化器１８−１
とスペクトル量子化器１９−１はトーン成分信号と非ト
ーン性のスペクトル信号とに個別に符号化（量子化）の
処理をする。次いで、ビットストリームマルチプレクサ
２０は、ハフマン符号化器とマルチプレクサとを備えて
構成され、トーン成分量子化器１８−１とスペクトル量
子化器１９−１において符号化された信号を、複数個
（例えば１４個）のハフマンテーブルを用いて圧縮し、
次いで、トーン成分符号化器１７−１乃至１７−４にお
けるトーン成分及びスペクトルの情報と、符号化された
各トーン成分と、符号化された各非トーン成分と、ジョ
イントステレオ符号化器１４−１乃至１４−４から出力
されたジョイントステレオ符号化処理のデータ（例え
ば、ＭＳステレオ符号化のチャンネル切り換え情報ＳＷ
ＩとＬ’及びＲ’チャンネル信号のためのビット数の配
分の情報、並びにＩＳステレオ符号化の増幅係数及び／
又は減衰係数）と、利得制御部１５−１乃至１５−４か
ら出力された利得係数のデータ（例えば、サブバンドフ
レーム内の利得制御する位置のデータと、利得制御する
レベルに関するデータと、利得制御する点の個数のデー
タとを含む。）を含み、ＡＴＲＡＣ３標準に従うサイド
情報とを多重化することにより、ＡＴＲＡＣ３のビット
ストリーム信号を得て出力する。Following the joint stereo coding process,
In gain control sections 15-1 to 15-4, gain control is executed by calculating a gain coefficient (that is, an amplification factor or an attenuation factor) for each subband frame. Then MDC
The T processing unit 16-1 has a modified discrete cosine transform (MDCT) for each signal of at least two subband frames that are gain-controlled in one frequency band and are cascade-connected to a transform block for MDCT processing. Is executed, and the resulting spectrum information signal is output to the tone component encoder 17-1. The tone component encoder 17-1 includes a tone component detection unit, a scale factor processing unit, a bit allocation unit, and the like, and separates the tone component signal and the non-tone spectrum signal from the MDCT-processed information signal. And a tone component quantizer 18-1 and a spectrum quantizer 19 respectively.
Output to -1. Here, the tone component quantizer 18-1
The spectrum quantizer 19-1 separately processes the tone component signal and the non-tone spectrum signal for encoding (quantization). Next, the bit stream multiplexer 20 is configured to include a Huffman encoder and a multiplexer, and outputs a plurality of signals (for example, 14 bits) encoded by the tone component quantizer 18-1 and the spectrum quantizer 19-1. Huffman table)
Next, tone component and spectrum information in the tone component encoders 17-1 to 17-4, each encoded tone component, each encoded non-tone component, and the joint stereo encoder 14-1. To 14-4 output the joint stereo coding processing data (for example, MS stereo coding channel switching information SW).
Information on the allocation of the number of bits for the I, L'and R'channel signals, and the amplification factor and / or of IS stereo coding
Or attenuation coefficient), gain coefficient data output from the gain control units 15-1 to 15-4 (for example, data of a position for gain control in a subband frame, data on a level for gain control, and gain control). Data of the number of points to be included.) And the side information in accordance with the ATRAC3 standard are multiplexed to obtain and output the bitstream signal of ATRAC3.

【００４２】図３は、図１のオーディオデコーダ６の詳
細構成を示すブロック図である。最初に、ビットストリ
ームデマルチプレクサ２１は、ミニディスク４等の記録
媒体、又は伝送媒体から読み出された後復調されたＡＴ
ＲＡＣ３のビットストリーム信号を、各サブバンド毎の
トーン成分の符号列及びスペクトルの符号列（非トーン
成分の符号列）と、ジョイントステレオ符号化処理と利
得係数のデータを含むサイド情報とに分離する。次い
で、トーン成分逆量子化器２２−１とスペクトル逆量子
化器２３−１はそれぞれ、各符号列をスペクトル係数に
復号化（逆量子化）し、トーン成分復号化部２４−１は
分離され復号化されたトーン成分信号と非トーン性のス
ペクトル信号とを合成する。また、他のサブバンドにお
けるトーン成分の符号列及びスペクトルの符号列につい
ても同様である。次いで、逆ＭＤＣＴ処理部２５−１乃
至２５−４は、各周波数帯域毎に逆ＭＤＣＴによって時
間領域のサブバンドフレームを生成し、次いで、逆利得
制御部２６−１乃至２６−４は、ビットストリームデマ
ルチプレクサ２１から入力されたサイド情報内の利得制
御のデータに基づいて、各サブバンドフレームに対して
利得制御したときとは逆の利得係数で逆利得制御し、す
なわち、利得制御によって増幅された部分を抑圧し、抑
圧された部分を増幅し、逆利得制御された各バンドのサ
ブバンドフレームをジョイントステレオ復号化器２７−
１乃至２７−４に出力する。FIG. 3 is a block diagram showing the detailed structure of the audio decoder 6 shown in FIG. First, the bit stream demultiplexer 21 reads out from a recording medium such as a mini disk 4 or a transmission medium and then demodulates the AT.
The bitstream signal of the RAC3 is separated into a tone component code string and a spectrum code string (non-tone component code string) for each subband, and side information including joint stereo coding processing and gain coefficient data. . Next, the tone component dequantizer 22-1 and the spectrum dequantizer 23-1 respectively decode (dequantize) each code string into spectrum coefficients, and the tone component decoding unit 24-1 is separated. The decoded tone component signal and the non-tone spectrum signal are combined. The same applies to the tone component code strings and spectrum code strings in other subbands. Next, the inverse MDCT processing units 25-1 to 25-4 generate time-domain subband frames by inverse MDCT for each frequency band, and then the inverse gain control units 26-1 to 26-4 perform bitstream processing. Based on the gain control data in the side information input from the demultiplexer 21, inverse gain control is performed with a gain coefficient opposite to that when gain control is performed for each subband frame, that is, amplified by gain control. The part is suppressed, the suppressed part is amplified, and the inverse-gain-controlled sub-band frame of each band is joint stereo decoder 27-
1 to 27-4.

【００４３】次に、ジョイントステレオ復号化器２７−
１乃至２７−４は、オーディオエンコーダ２から伝送さ
れたサイド情報に基づいて、上記逆利得制御されたサブ
バンドフレームに対して、ＭＳステレオ復号化とＩＳス
テレオ復号化とを実行して、オーディオエンコーダ２に
入力された元のステレオ信号を復元し、ＱＭＦ合成フィ
ルタ２８に出力する。ここで、ジョイントステレオ復号
化器２７−１は、入力されたサブバンドフレームに対し
てＭＳステレオ復号化を実行して、ＭＳステレオ符号化
される前のステレオチャンネルを含むオーディオ信号を
復元して出力し、ジョイントステレオ復号化器２７−２
乃至２７−４は、入力されたサブバンドフレームに対し
てそれぞれＩＳステレオ復号化を実行して、Ｌチャンネ
ルとＲチャンネルがそれぞれ所定の量だけ増幅され、及
び／又は減衰されたオーディオ信号を出力する。ＭＳス
テレオ復号化とＩＳステレオ復号化とを、集合的に、ジ
ョイントステレオ復号化と呼ぶ。最後に、ＱＭＦ合成フ
ィルタ２８は各バンド（周波数帯域）毎のサブバンドフ
レームを合成して、合成されたディジタルオーディオ信
号を出力する。Next, the joint stereo decoder 27-
1 to 27-4 perform MS stereo decoding and IS stereo decoding on the inverse gain-controlled subband frame based on the side information transmitted from the audio encoder 2 to output the audio encoder. The original stereo signal input to 2 is restored and output to the QMF synthesis filter 28. Here, the joint stereo decoder 27-1 performs MS stereo decoding on the input subband frame to restore and output an audio signal including a stereo channel before being MS stereo encoded. And joint stereo decoder 27-2
27 to 27-4 each perform IS stereo decoding on the input subband frame, and output an audio signal in which the L channel and the R channel are amplified and / or attenuated by a predetermined amount. . MS stereo decoding and IS stereo decoding are collectively referred to as joint stereo decoding. Finally, the QMF synthesis filter 28 synthesizes the sub-band frames for each band (frequency band) and outputs the synthesized digital audio signal.

【００４４】以上説明したように、図１のミニディスク
記録再生システムは、オーディオ信号の符号化装置であ
るオーディオエンコーダ２とオーディオ信号の復号化装
置であるオーディオデコーダ６とを備えたオーディオ信
号の符号化及び復号化システムである。上記オーディオ
エンコーダ２は、入力されたオーディオ信号のサブバン
ドフレームを、ミッドサイドステレオ符号化の複数の状
態モードのうちのいずれか１つの状態モードに符号化す
るジョイントステレオ符号化器１４−１を備え、また、
上記オーディオデコーダ６は、決定されたミッドサイド
ステレオ符号化の状態モードに基づいて、上記符号化さ
れたオーディオ信号を復号化し、復号化された１対のス
テレオチャンネルを含むオーディオ信号を得て出力する
ジョイントステレオ復号化器２７−１を備えている。As described above, the mini-disc recording / reproducing system of FIG. 1 has an audio signal code including the audio encoder 2 which is an audio signal encoding device and the audio decoder 6 which is an audio signal decoding device. It is an encryption and decryption system. The audio encoder 2 includes a joint stereo encoder 14-1 that encodes a subband frame of an input audio signal into any one of a plurality of state modes of midside stereo encoding. ,Also,
The audio decoder 6 decodes the encoded audio signal based on the determined state mode of the midside stereo encoding, obtains and outputs the decoded audio signal including a pair of stereo channels. The joint stereo decoder 27-1 is provided.

【００４５】図５に、本実施形態のＭＳステレオ符号化
の状態モード切り換え処理が、切り換え特性の状態モー
ドに依存する性質を強調するように、状態遷移図の形式
で提示されている。ノード１０３，１００，１０１は、
ＡＴＲＡＣ３のＭＳステレオ符号化に係る３つの可能な
状態モード（Ｓ／Ｄ，Ｓ／Ｌ，Ｓ／Ｒ）を示している。
分岐は、状態モード間の遷移が発生し得る条件を示して
いる。遷移が発生するとすれば、時間的に連続したサブ
バンドフレーム毎に１回発生する。In FIG. 5, the state mode switching process of MS stereo coding according to the present embodiment is presented in the form of a state transition diagram so as to emphasize the property of the switching characteristic depending on the state mode. The nodes 103, 100, 101 are
3 illustrates three possible state modes (S / D, S / L, S / R) for MS stereo encoding of ATRAC3.
Branches indicate conditions under which transitions between state modes may occur. If a transition occurs, it occurs once for each temporally consecutive subband frame.

【００４６】最初に、チャンネルのエネルギーＥ_Ｌ及び
Ｅ_Ｒが計算される。左チャンネルのエネルギーＥ_Ｌは、
左チャンネルのサブバンド０におけるサブバンドフレー
ムの２５６個の時間領域サンプルすべての平方の和とし
て定義される。右チャンネルのエネルギーＥ_Ｒも同様
に、右チャンネルのサブバンド０におけるサブバンドフ
レームの２５６個の時間領域サンプルすべての平方の和
である。First, the energies E _L and E _R of the channel are calculated. The energy E _{L of the} left channel is
It is defined as the sum of the squares of all 256 time domain samples of the subband frame in subband 0 of the left channel. The right channel energy E _R is likewise the sum of the squares of all 256 time domain samples of the subband frame in subband 0 of the right channel.

【００４７】直前のサブバンドフレームのＭＳステレオ
符号化の状態モードがＳ／Ｄモード（ノード１０３の場
合のようにＳＷＩ＝３）であるときは、しきい値係数ｆ
₁≫１としてWhen the state mode of MS stereo coding of the immediately preceding subband frame is the S / D mode (SWI = 3 as in the case of node 103), the threshold coefficient f
₁ >> as ₁

【数４】Ｅ_Ｒ＞ｆ₁×Ｅ_Ｌが成り立ち、エネルギーＥ_ＲがＥ_Ｌより大きい不均衡な
状態を示すならば、現在のサブバンドフレームのＭＳス
テレオ符号化の状態モードはＳ／Ｌモード（ノード１０
０の場合のようにＳＷＩ＝０）に切り換えられる。しか
しながら、Ｓ／Ｌモードが再びＳ／Ｄモードに戻るよう
に切り換えられるためには、課される条件は、上の条件
の逆、すなわち## EQU00004 ## If E _R > f ₁ × E _L holds and the energy E _R indicates an imbalanced state larger than E _L , the state mode of MS stereo coding of the current subband frame is the S / L mode. (Node 10
SWI = 0) as in the case of 0. However, in order for the S / L mode to be switched back to the S / D mode, the condition imposed is the reverse of the above condition, ie

【数５】Ｅ_Ｌ≧Ｅ_Ｒ／ｆ₁ ではなく、[Equation 5] Not E _L ≧ E _R / f ₁ ,

【数６】ｆ₁×Ｅ_Ｒ≧Ｅ_Ｌ＞ｆ₂×Ｅ_Ｒである。ここで、しきい値係数ｆ₂は## EQU6 ## where f ₁ × E _R ≧ E _L > f ₂ × E _R. Here, the threshold coefficient f ₂ is

【数７】ｆ₁≫ｆ₂≫（１／ｆ₁）を満たし、上式は、エネルギーＥ_Ｌが実質的にＥ_Ｒ／ｆ
₁よりも大きいが（ｆ₂≫１／ｆ₁であるので）、ｆ₁×Ｅ
_Ｒ以下で大き過ぎるまでには至っていない（ｆ₁≫ｆ₂で
あるので）状態であることを示す。こうすることによ
り、本実施形態のＭＳステレオ符号化の状態モード切り
換え処理は、その左チャンネルのエネルギーＥ_ＬがＥ_Ｒ
／ｆ₁に対してゆるやかに上昇して滑らかに変化する信
号を、Ｓ／Ｌモード（ＳＷＩ＝０）及びＳ／Ｄモード
（ＳＷＩ＝３）間の高速のトグルから保護する。すなわ
ち、次式の条件Meets Equation 7] _{_{f 1 »f 2 »(1 /}} f 1), the above equation, the energy _{E L} is substantially _E R / f
Greater than ₁ (since f ₂ >> ₁ / f ₁ ), but f ₁ × E
Below _R , it is in a state not too large (because f ₁ >> f ₂ ). By doing so, the state mode switching process of the MS stereo coding of the present embodiment, the energy E _L of the left channel E _R
A signal that rises slowly and changes smoothly with respect to / f _{1 is} protected from a high-speed toggle between the S / L mode (SWI = 0) and the S / D mode (SWI = 3). That is, the condition of the following equation

【数８】Ｅ_Ｌ≦ｆ₂×Ｅ_Ｒが満たされる限り、現在のサブバンドフレームのＭＳス
テレオ符号化の状態モードは、Ｓ／Ｌモードに保持され
る。図８は、図９と同一のオーディオ信号に対して本実
施形態に係るＭＳステレオ符号化の状態モード切り換え
処理を適用するときの実施例を示し、図９の従来例で発
生した高速のトグルが、図８の実施形態では緩和されて
いることがわかる。Ｓ／ＬモードからＳ／Ｄモードに戻
るように切り換える条件（数６）は、Ｓ／Ｄモードから
Ｓ／Ｌモードに切り換える条件の逆（数５）ではない。
これにより、Ｓ／Ｄモード及びＳ／Ｌモード間の高速の
トグルが防止される。As long as E _L ≦ f ₂ × E _R is satisfied, the state mode of MS stereo coding of the current subband frame is kept in S / L mode. FIG. 8 shows an example in which the state mode switching process of the MS stereo coding according to the present embodiment is applied to the same audio signal as in FIG. 9, and the high-speed toggle generated in the conventional example of FIG. It can be seen that the embodiment shown in FIG. The condition (Equation 6) for switching the S / L mode back to the S / D mode is not the reverse (Equation 5) of the condition for switching the S / D mode to the S / L mode.
This prevents high speed toggle between the S / D and S / L modes.

【００４８】一方で、時間的に近接したサブバンドフレ
ーム間で、左チャンネルのエネルギーＥ_ＬがＥ_Ｒ／ｆ₁
に対して急激に上昇すると、次のサブバンドフレームに
おいてＳ／ＬモードからＳ／Ｄモードへ高速に切り換え
ることが許容される。このことは、次のサブバンドフレ
ームにおいてOn the other hand, the energy E _L of the left channel is E _R / f ₁ between subband frames that are temporally close to each other.
A sharp rise with respect to, it is allowed to switch from the S / L mode to the S / D mode at a high speed in the next subband frame. This means that in the next subband frame

【数９】ｆ₁×Ｅ_Ｒ≧Ｅ_Ｌ＞ｆ₂×Ｅ_Ｒであれば発生する。信号強度が高速に変化する信号の場
合、ＭＳステレオ符号化の状態モードの高速の切り換え
は、高速の状態モード切り換えによってもたらされる可
能性のあるいかなるアーティファクトも通常は信号強度
の急激なサージによって音響心理学的にマスキングされ
るので正当と認められるものである。さらに、この方法
の拡張により、次のサブバンドフレームにおいて次式で
表される状況## EQU9 ## If f ₁ × E _R ≧ E _L > f ₂ × E _R occurs. In the case of signals with rapidly changing signal strength, the fast switching of the state modes of MS stereo coding means that any artifacts that may be introduced by the fast state mode switching are usually psychoacoustic due to a sudden surge of signal strength. It is legally accepted because it is masked. Furthermore, due to the extension of this method, the situation expressed by the following equation in the next subband frame

【数１０】Ｅ_Ｌ＞ｆ₁×Ｅ_Ｒが発生すれば、Ｓ／ＬモードからＳ／Ｒモード（ノード
１０１の場合のようにＳＷＩ＝１）への高速の切り換え
もまた許容される。If E _L > f ₁ × E _R occurs, fast switching from S / L mode to S / R mode (SWI = 1 as in node 101) is also allowed.

【００４９】同様に、直前のサブバンドフレームのＭＳ
ステレオ符号化の状態モードがＳ／Ｄモード（ノード１
０３の場合のようにＳＷＩ＝３）であるとき、エネルギ
ーＥ _ＬがエネルギーＥ_Ｒより大きい不均衡な状態を示すSimilarly, the MS of the immediately preceding subband frame
The stereo encoding state mode is the S / D mode (node 1
When SWI = 3) as in the case of 03, the energy
-E _LIs energy E_RIndicates a greater imbalance

【数１１】Ｅ_Ｌ＞ｆ₁×Ｅ_Ｒが成り立てば、現在のサブバンドフレームのＭＳステレ
オ符号化の状態モードはＳ／Ｒモード（ノード１０１の
場合のようにＳＷＩ＝１）に切り換えられる。しかしな
がら、Ｓ／Ｒモードが再びＳ／Ｄモードに戻るように切
り換えられるためには、課される条件は、上の条件の
逆、すなわちIf E _L > f ₁ × E _R holds, the state mode of MS stereo coding of the current subband frame is switched to S / R mode (SWI = 1 as in the case of node 101). However, in order for the S / R mode to be switched back to the S / D mode again, the condition imposed is the opposite of the above condition, ie

【数１２】Ｅ_Ｒ≧Ｅ_Ｌ／ｆ₁ ではなく、[Equation 12] Not E _R ≧ E _L / f ₁ ,

【数１３】ｆ₁×Ｅ_Ｌ≧Ｅ_Ｒ＞ｆ₂×Ｅ_Ｌであり、この条件は、右チャンネルのエネルギーＥ_Ｒが
実質的にＥ_Ｌ／ｆ₁よりも大きいが（ｆ₂≫１／ｆ₁であ
るので）、ｆ₁×Ｅ_Ｌ以下で大き過ぎるまでには至って
いない（ｆ₁≫ｆ₂であるので）状態であることを示す。
こうすることにより、本実施形態に係るＭＳステレオ符
号化の状態モード切り換え処理は、その右チャンネルの
エネルギーＥ_ＲがＥ_Ｌ／ｆ₁に対してゆるやかに上昇し
て滑らかに変化する信号を、Ｓ／Ｒモード（ＳＷＩ＝
１）及びＳ／Ｄモード（ＳＷＩ＝３）間の高速のトグル
から保護する。すなわち、次式の条件Equation 13 is _{_{_{f 1 × E L ≧ E R}}} > f 2 × E L, this condition, the energy _{E R} of the right channel is substantially larger than _{_{_{E L / f 1 (f 2}}} »1 / Since f ₁ ), the state is not too large below f ₁ × E _L (because f ₁ >> f ₂ ).
By doing so, the state mode switching process of the MS stereo coding according to the present embodiment, the signal energy E _R of the right channel changes smoothly and gradually increases with respect to E _{L /} f _1, S / R mode (SWI =
Protect from fast toggle between 1) and S / D mode (SWI = 3). That is, the condition of the following equation

【数１４】Ｅ_Ｒ≦ｆ₂×Ｅ_Ｌが満たされる限り、現在のサブバンドフレームのＭＳス
テレオ符号化の状態モードは、Ｓ／Ｒモードに保持され
る。また、直前のサブバンドフレームのＭＳステレオ符
号化の状態モードがＳ／Ｄモード（ノード１０３の場合
のようにＳＷＩ＝３）であるとき、数４及び数１１の条
件が満たされないならば、現在のサブバンドフレームの
状態モードはそのままＳ／Ｄモードに保持される。As long as E _R ≦ f ₂ × E _L is satisfied, the state mode of MS stereo coding of the current subband frame is kept in S / R mode. If the condition mode of the MS stereo coding of the immediately preceding subband frame is the S / D mode (SWI = 3 as in the case of the node 103), if the conditions of Eqs. The sub-band frame state mode is maintained as it is in the S / D mode.

【００５０】同様の方法で、時間的に近接したサブバン
ドフレーム間で、右チャンネルのエネルギーＥ_ＲがＥ_Ｌ
／ｆ₁に対して急激に上昇すると、次のサブバンドフレ
ームにおいてＳ／ＲモードからＳ／Ｄモードへ高速に切
り換えることが許容される。このことは、次のサブバン
ドフレームにおいてIn a similar manner, the energy E _R of the right channel is E _L between subband frames that are close in time.
When it rapidly rises with respect to / f ₁ , it is allowed to switch from the S / R mode to the S / D mode at a high speed in the next subband frame. This means that in the next subband frame

【数１５】ｆ₁×Ｅ_Ｌ≧Ｅ_Ｒ＞ｆ₂×Ｅ_Ｌであれば発生する。さらに、次のサブバンドフレームに
おいて次式で表される状況## EQU15 ## If f ₁ × E _L ≧ E _R > f ₂ × E _L occurs. Furthermore, in the next subband frame, the situation expressed by

【数１６】Ｅ_Ｒ＞ｆ₁×Ｅ_Ｌが発生すれば、Ｓ／ＲモードからＳ／Ｌモード（ノード
１００の場合のようにＳＷＩ＝０）への高速の切り換え
もまた許容される。If E _R > f ₁ × E _L occurs, fast switching from S / R mode to S / L mode (SWI = 0 as in node 100) is also allowed.

【００５１】図６は、図５を参照して説明されたＭＳス
テレオ符号化の状態モード切り換え処理の原理を説明す
るための別の状態遷移図である。図６（ａ）で、直前の
サブバンドフレームのＭＳステレオ符号化の状態モード
がＳ／Ｄモードであるとき、Ｓ／Ｌモードに遷移するた
めには、エネルギーの比の値Ｅ_Ｒ／Ｅ_Ｌがしきい値係数
ｆ_１を超過しなければならない。また、Ｓ／Ｄモードか
らＳ／Ｒモードに遷移するためには、比の値Ｅ_Ｒ／Ｅ_Ｌ
がしきい値係数１／ｆ_１より小さくなる（すなわち、比
の値Ｅ_Ｌ／Ｅ_Ｒがしきい値係数ｆ_１より大きくなる）必
要がある。図６（ｂ）で、直前のサブバンドフレームの
ＭＳステレオ符号化の状態モードがＳ／Ｌであるとき、
Ｓ／Ｄモードに遷移するためには、比の値Ｅ_Ｒ／Ｅ_Ｌが
しきい値係数１／ｆ_２より小さくかつしきい値係数１／
ｆ_１以上でなければならない。また、Ｓ／Ｌモードから
Ｓ／Ｒモードに遷移するためには、比の値Ｅ_Ｒ／Ｅ_Ｌが
しきい値係数１／ｆ_１より小さくなる必要がある。一
方、比の値Ｅ_Ｒ／Ｅ_Ｌがしきい値係数１／ｆ_２以上であ
るときは、現在のサブバンドフレームのＭＳステレオ符
号化の状態モードを、Ｓ／Ｌモードに保持する。同様
に、図６（ｃ）で、直前のサブバンドフレームのＭＳス
テレオ符号化の状態モードがＳ／Ｒであるとき、Ｓ／Ｄ
モードに遷移するためには、比の値Ｅ_Ｒ／Ｅ_Ｌがしきい
値係数ｆ_２より大きくかつしきい値係数ｆ_１以下でなけ
ればならない。また、Ｓ／ＲモードからＳ／Ｌモードに
遷移するためには、比の値Ｅ_Ｒ／Ｅ_Ｌがしきい値係数ｆ
_１より大きくなる必要がある。一方、比の値Ｅ_Ｒ／Ｅ_Ｌ
がしきい値係数ｆ_２以下であるときは、現在のサブバン
ドフレームのＭＳステレオ符号化の状態モードを、Ｓ／
Ｒモードに保持する。FIG. 6 is another state transition diagram for explaining the principle of the state mode switching process of the MS stereo coding described with reference to FIG. In FIG. 6A, when the state mode of the MS stereo coding of the immediately preceding subband frame is the S / D mode, in order to transit to the S / L mode, the energy ratio value E _R / E _L Must exceed the threshold factor f ₁ . Further, in order to transit from the S / D mode to the S / R mode, the ratio value E _R / E _L
Must be smaller than the threshold coefficient 1 / f ₁ (that is, the ratio value E _L / E _R must be larger than the threshold coefficient f ₁ ). In FIG. 6B, when the state mode of MS stereo coding of the immediately preceding subband frame is S / L,
In order to transit to the S / D mode, the ratio value E _R / E _L is smaller than the threshold coefficient 1 / f ₂ and the threshold value 1 / f _2.
Must be greater than or equal to f ₁ . Further, in order to transit from the S / L mode to the S / R mode, the ratio value E _R / E _L needs to be smaller than the threshold coefficient 1 / f ₁ . On the other hand, when the ratio value E _R / E _L is equal to or greater than the threshold coefficient 1 / f ₂ , the current MS stereo coding state mode of the subband frame is held in the S / L mode. Similarly, in FIG. 6C, when the MS stereo coding state mode of the immediately preceding subband frame is S / R, S / D
In order to enter the mode, the ratio value E _R / E _L must be greater than the threshold coefficient f ₂ and less than or equal to the threshold coefficient f ₁ . Further, in order to transit from the S / R mode to the S / L mode, the ratio value E _R / E _L is equal to the threshold coefficient f.
Must be greater than ₁ . On the other hand, the ratio value E _R / E _L
Is a threshold coefficient f ₂ or less, the state mode of MS stereo coding of the current subband frame is S / S.
Hold in R mode.

【００５２】調整の結果、ＡＴＲＡＣ３については、し
きい値係数ｆ₁＝１００及びｆ₂＝１が妥当な値であるこ
とが発見されているが、他の符号化器では他の値の方が
より適切である場合もある。一般に、２つのしきい値係
数の間の差が、数十ｄＢであることが望ましい。As a result of the adjustment, for ATRAC3, it has been found that the threshold coefficients f ₁ = 100 and f ₂ = 1 are reasonable values, but other encoders have other values. It may be more appropriate. In general, it is desirable that the difference between the two threshold factors is several tens of dB.

【００５３】以上の原理に従って、ジョイントステレオ
符号化器１４−１は、図７に示されたようなＭＳステレ
オ符号化の状態モード切り換え処理を実行する。図７の
ステップＳ１で、初期化処理として、直前のサブバンド
フレームのＭＳステレオ符号化の状態モードをＳ／Ｄモ
ードであると設定し、ステップＳ２において、現在のサ
ブバンドフレームのＬチャンネル及びＲチャンネルのエ
ネルギーＥ_Ｌ及びＥ_Ｒを計算する。次いで、ステップＳ
３において、直前のサブバンドフレームのＭＳステレオ
符号化の状態モードが何であるかを決定する。直前のサ
ブバンドフレームがＳ／Ｌモードであったときは、ステ
ップＳ４で、Ｌチャンネル及びＲチャンネルのエネルギ
ーの比の値Ｅ_Ｒ／Ｅ_Ｌがしきい値係数の逆数１／ｆ_２よ
りも小さいか否か（すなわち、Ｅ_Ｌ＞ｆ_２×Ｅ_Ｒである
か否か）が決定される。ステップＳ４がＮＯのとき、ス
テップＳ６で、現在のサブバンドフレームのＭＳステレ
オ符号化の状態モードをＳ／Ｌモードに設定して、ステ
ップＳ１３に進む。ステップＳ４がＹＥＳであるとき
は、ステップＳ５で、エネルギーの比の値Ｅ_Ｒ／Ｅ_Ｌが
しきい値係数の逆数１／ｆ_１よりも小さいか否か（すな
わち、Ｅ_Ｌ＞ｆ_１×Ｅ _Ｒであるか否か）が決定される。
ステップＳ５がＹＥＳであるときは、ステップＳ１２
で、現在のサブバンドフレームのＭＳステレオ符号化の
状態モードをＳ／Ｒモードに設定して、ステップＳ１３
に進む。ステップＳ５がＮＯであるとき、ステップＳ９
で、現在のサブバンドフレームのＭＳステレオ符号化の
状態モードをＳ／Ｄモードに設定して、ステップＳ１３
に進む。In accordance with the above principle, joint stereo
The encoder 14-1 uses the MS stereo as shown in FIG.
The encoding state mode switching process is executed. Of FIG.
In step S1, the immediately preceding subband is used as initialization processing.
The state mode of MS stereo coding of the frame is set to S / D mode.
The current status is set in step S2.
Band channel L channel and R channel
Energy E_LAnd E_RTo calculate. Then, step S
In 3, the MS stereo of the immediately preceding subband frame
Determine what the encoding state mode is. The last service
If the band frame is in S / L mode,
At S4, the energy of the L and R channels
The ratio value E_R/ E_LIs the reciprocal of the threshold coefficient 1 / f_TwoYo
Is less than (ie E_L> F_Two× E_RIs
Or not) is determined. When step S4 is NO,
At step S6, the MS stereo of the current subband frame
Set the encoding status mode to S / L mode and
Go to step S13. When step S4 is YES
Is the energy ratio value E in step S5._R/ E_LBut
Reciprocal of threshold coefficient 1 / f₁Is less than (
Wow, E_L> F₁× E _ROr not) is determined.
When step S5 is YES, step S12
In the current MS stereo coding of the subband frame
Set the state mode to S / R mode, and step S13
Proceed to. When step S5 is NO, step S9
In the current MS stereo coding of the subband frame
Set the status mode to S / D mode, and step S13
Proceed to.

【００５４】ステップＳ３で、直前のサブバンドフレー
ムがＳ／Ｄモードであったときは、ステップＳ７で、Ｌ
チャンネル及びＲチャンネルのエネルギーの比の値Ｅ_Ｒ
／Ｅ _Ｌがしきい値係数ｆ_１よりも大きいか否か（すなわ
ち、Ｅ_Ｒ＞ｆ_１×Ｅ_Ｌであるか否か）が決定される。ス
テップＳ７がＹＥＳであるときはステップＳ６に進む。
ステップＳ７がＮＯであるときは、ステップＳ８で、エ
ネルギーの比の値Ｅ_Ｒ／Ｅ_Ｌがしきい値係数の逆数１／
ｆ_１よりも小さいか否か（すなわち、Ｅ_Ｌ＞ｆ _１×Ｅ_Ｒ
であるか否か）が決定される。ステップＳ８がＹＥＳで
あるときはステップＳ１２に進み、ステップＳ８がＮＯ
であるときはステップＳ９に進む。また、ステップＳ３
で、直前のサブバンドフレームのＭＳステレオ符号化の
状態モードがＳ／Ｒであったときは、ステップＳ１０
で、エネルギーの比の値Ｅ_Ｒ／Ｅ_Ｌがしきい値係数ｆ_２
よりも大きいか否か（すなわち、Ｅ_Ｒ＞ｆ_２×Ｅ_Ｌであ
るか否か）が決定される。ステップＳ１０がＮＯである
ときはステップＳ１２に進む。ステップＳ１０がＹＥＳ
であるときは、ステップＳ１１で、エネルギーの比の値
Ｅ_Ｒ／Ｅ_Ｌがしきい値係数ｆ_１よりも大きいか否か（す
なわち、Ｅ_Ｒ＞ｆ_１×Ｅ_Ｌであるか否か）が決定され
る。ステップＳ１１がＹＥＳであるときはステップＳ６
に進み、ステップＳ１１がＮＯであるときはステップＳ
９に進む。In step S3, the immediately preceding subband frame
If the system is in S / D mode, then in step S7, L
Value E of the energy ratio of the channel and the R channel_R
/ E _LIs the threshold coefficient f₁Whether it is greater than
Chi, E_R> F₁× E_LOr not) is determined. Su
If step S7 is YES, the process proceeds to step S6.
If step S7 is NO, step S8
Energy ratio value E_R/ E_LIs the reciprocal of the threshold coefficient 1 /
f₁Less than (ie E_L> F ₁× E_R
Or not) is determined. If step S8 is YES
If there is, go to step S12, step S8 NO
When it is, it progresses to step S9. Also, step S3
Then, of the MS stereo coding of the immediately preceding subband frame
If the state mode is S / R, step S10
And the energy ratio value E_R/ E_LIs the threshold coefficient f_Two
Greater than or equal to (ie E_R> F_Two× E_LAnd
Or not) is determined. Step S10 is NO
If so, the process proceeds to step S12. YES in step S10
If it is, in step S11, the value of the energy ratio
E_R/ E_LIs the threshold coefficient f₁Is greater than (
N, E_R> F₁× E_LOr not) is determined
It When step S11 is YES, step S6
Proceed to step S11 if NO in step S11.
Proceed to 9.

【００５５】次いで、ステップＳ１３において、現在の
サブバンドフレームが入力信号の最後のサブバンドフレ
ームであるか否かが判断される。ステップＳ１３がＮＯ
であるときは、ステップＳ１４でサブバンド０の次のサ
ブバンドフレームを取得し、ステップＳ６，Ｓ９又はＳ
１２で決定された現在のサブバンドフレームのＭＳステ
レオ符号化の状態モードを直前のサブバンドフレームの
ＭＳステレオ符号化の状態モードとして設定し、次い
で、ステップＳ２に戻る。ステップＳ１３がＹＥＳであ
るときは処理を終了する。Then, in step S13, it is determined whether or not the current subband frame is the last subband frame of the input signal. Step S13 is NO
If so, the sub-band frame next to sub-band 0 is acquired in step S14, and steps S6, S9 or S
The MS stereo coding state mode of the current subband frame determined in 12 is set as the MS stereo coding state mode of the immediately preceding subband frame, and then the process returns to step S2. If step S13 is YES, the process ends.

【００５６】上述された本実施形態のＭＳステレオ符号
化の状態モード切り換え処理は、コンピュータ技術の当
業者には明らかになるように、本願明細書の教示すると
ころに従ってプログラミングされた従来の汎用のコンピ
ュータ又はマイクロプロセッサ、もしくはコンピュータ
が組み込まれた符号化装置を用いて実行されることが可
能である。適切なソフトウェアのコーディングは、ソフ
トウェア技術の当業者には明らかになるように、本開示
の教示することに基づいて熟練したプログラマによって
容易に準備されることが可能である。本実施形態の処理
はまた、当業者には直ちに明らかになるように、特定用
途向け集積回路の準備によって、又は従来の構成要素回
路の適切なネットワークを相互に接続することによって
実装されることも可能である。上記コンピュータは、こ
こに開示されたオーディオエンコーダ２を含むだけでな
く、対応するオーディオデコーダ６も含むことが注意さ
れるべきであり、上記オーディオデコーダ６は、オーデ
ィオエンコーダ２によって生成されたＭＳステレオ符号
化されたビットストリーム信号を復号化して、復号化さ
れたステレオチャンネルを含むオーディオ信号を得て出
力するために動作するように実装されることが可能であ
る。The state mode switching process of the MS stereo coding of the present embodiment described above is well known to those skilled in the computer art, and a conventional general purpose computer programmed according to the teachings of the present application. Alternatively, it can be implemented using a microprocessor, or a computer-embedded encoding device. Appropriate software coding can be readily prepared by skilled programmers based on the teachings of the present disclosure, as will be apparent to those skilled in the software art. The processing of this embodiment may also be implemented by the provision of application specific integrated circuits, or by interconnecting appropriate networks of conventional component circuits, as will be readily apparent to those skilled in the art. It is possible. It should be noted that the computer not only includes the audio encoder 2 disclosed herein, but also the corresponding audio decoder 6, which is the MS stereo code generated by the audio encoder 2. It may be implemented to decode the encoded bitstream signal to operate to obtain and output an audio signal containing the decoded stereo channels.

【００５７】本発明は、オーディオエンコーダ２又はオ
ーディオデコーダ６である上記コンピュータを動作させ
るための、コンピュータプログラムとして実施されても
よい。上記コンピュータプログラムは、上述のようにＭ
Ｓステレオ符号化の状態モード切り換え処理を実行する
ための複数の命令を含み、上記コンピュータは上記コン
ピュータプログラムを実行することにより、オーディオ
信号に対してＭＳステレオ符号化の状態モードを決定
し、及び／又はＭＳステレオ符号化されたオーディオ信
号を復号化する。上記コンピュータプログラムは、コン
ピュータプログラムの記録媒体、又はコンピュータプロ
グラムの記録媒体を備えたコンピュータプログラム製品
の形態、すなわちコンピュータプログラムの記録媒体で
提供されるソフトウェアプログラムの形態であってもよ
く、上記コンピュータプログラムの記録媒体は、コンピ
ュータプログラムが記録され、コンピュータにより読み
取り可能な記録媒体を含む。上記コンピュータにより読
み取り可能な記録媒体は、例えば、（フロッピー（登録
商標）ディスクのような）磁気ディスク又は磁気テープ
のような磁気記録媒体、光ディスク、光テープ又は機械
により読み取り可能なメモリ（ＲＡＭ）、又は読出し専
用メモリ（ＲＯＭ）のような光学的記録媒体、又は任意
の別の物理的なデバイス又は媒体を含むが、それらに制
限されない。また、上記コンピュータプログラムは、イ
ンターネット又は無線ＬＡＮのような電気通信回線及び
／又は無線回線を介して、サーバ等のコンピュータから
クライアント装置等のコンピュータに伝送されてもよ
い。The present invention may be implemented as a computer program for operating the above-mentioned computer which is the audio encoder 2 or the audio decoder 6. The computer program is M
A plurality of instructions for performing an S stereo coding state mode switching process, the computer executing the computer program to determine a MS stereo coding state mode for an audio signal, and / or Alternatively, the MS stereo encoded audio signal is decoded. The computer program may be in the form of a computer program recording medium or a computer program product provided with a computer program recording medium, that is, in the form of a software program provided on the computer program recording medium. The recording medium includes a computer-readable recording medium in which a computer program is recorded. The computer-readable recording medium is, for example, a magnetic recording medium such as a magnetic disk or a magnetic tape (such as a floppy (registered trademark) disk), an optical disk, an optical tape, or a machine-readable memory (RAM). Or, including but not limited to, an optical recording medium such as read only memory (ROM), or any other physical device or medium. The computer program may be transmitted from a computer such as a server to a computer such as a client device via an electric communication line and / or a wireless line such as the Internet or a wireless LAN.

【００５８】＜変形例＞本発明に係る実施形態のオーデ
ィオ信号の符号化及び復号化システムは、ミニディスク
記録再生システムとして説明されたが、他の任意の記録
媒体又は伝送媒体であってもよい。本実施形態の記載は
ＡＴＲＡＣ３符号化器を基礎とするものであるが、その
適用可能性は、同様のＭＳステレオ符号化の方法を利用
する他の符号化器に拡張することが可能である。また、
本実施形態では、ＭＳステレオ符号化は、ジョイントス
テレオ符号化器１４−１のみで実行されたが、他のジョ
イントステレオ符号化器１４−２乃至１４−４において
ＭＳステレオ符号化を実行することを制限するものでは
ない。またさらに、本実施形態では、ステレオ信号の各
チャンネルのエネルギーと直前のサブバンドフレームの
ＭＳステレオ符号化の状態モードとに基づいて現在のサ
ブバンドフレームのＭＳステレオ符号化の状態モードが
決定されたが、２つ以上の先行するサブバンドフレーム
に基づいて現在のサブバンドフレームのＭＳステレオ符
号化の状態モードを決定してもよい。また、本実施形態
では、左右のチャンネルの信号を比較するために、各チ
ャンネルのサブバンドフレーム内のすべての時間サンプ
ルの平方の総和（エネルギー）を計算したが、それ以外
の基準、例えば、振幅の絶対値の総和などを用いて左右
のチャンネルの信号を比較してもよい。<Modification> The audio signal encoding / decoding system according to the embodiment of the present invention has been described as the mini-disc recording / reproducing system, but may be any other recording medium or transmission medium. . Although the description of the present embodiment is based on the ATRAC3 encoder, its applicability can be extended to other encoders that utilize the same MS stereo encoding method. Also,
In the present embodiment, the MS stereo coding is performed only by the joint stereo encoder 14-1, but it is possible to perform the MS stereo coding by the other joint stereo encoders 14-2 to 14-4. There is no limit. Furthermore, in this embodiment, the MS stereo coding state mode of the current subband frame is determined based on the energy of each channel of the stereo signal and the MS stereo coding state mode of the immediately preceding subband frame. However, the state mode of MS stereo coding of the current subband frame may be determined based on two or more preceding subband frames. Further, in the present embodiment, in order to compare the signals of the left and right channels, the sum total of squares (energy) of all time samples in the subband frame of each channel is calculated. The signals of the left and right channels may be compared using the sum of the absolute values of.

【００５９】以上説明したように、本発明に係る実施形
態のオーディオ信号の符号化方法によれば、ＭＳステレ
オ符号化の実装が注意深く行われないときに生じること
がある、アーティファクトに関する次の２点の潜在的な
危険を回避することができる。その１つは、ＭＳステレ
オ符号化の状態モードの切り換えが想定されていないと
きに２つの状態モード間で高速にトグルすることであ
り、もう１つは状態モードの切り換えが必要であるとき
にその切り換えに失敗することである。本実施形態は、
こうしたアーティファクトの２つの原因に有効に取り組
むことに成功している。特に、一方のチャンネルのエネ
ルギーが他方のチャンネルのエネルギーに対して急激に
上昇する場合は、他方のＭＳステレオ符号化の状態モー
ドへの高速の切り換えが許容される。最後に、本実施形
態は、その簡単さと非常に少ないメモリ要件とのために
ＬＳＩ上での実装が非常に容易である。As described above, according to the audio signal encoding method of the embodiment of the present invention, the following two points regarding the artifacts which may occur when the MS stereo encoding is not carefully implemented. You can avoid the potential danger of. One is to toggle rapidly between two state modes when MS stereo coding state mode switching is not envisaged, and the other is when state mode switching is needed. It means that switching fails. In this embodiment,
He has been successful in addressing the two causes of these artifacts. In particular, if the energy of one channel rises sharply with respect to the energy of the other, a fast switch to the other MS stereo coding state mode is allowed. Finally, this embodiment is very easy to implement on an LSI due to its simplicity and very few memory requirements.

【００６０】[0060]

【発明の効果】以上詳述したように、本発明に係るオー
ディオ信号の符号化方法によれば、左チャンネルと右チ
ャンネルとからなる１対のステレオチャンネルを含みか
つ各フレームが複数の時間サンプルにてなる連続的なフ
レームに分割されているディジタル化されたオーディオ
信号を、ＭＳステレオ符号化の複数の状態モードのうち
のいずれか１つの状態モードに符号化するためのオーデ
ィオ信号の符号化方法において、上記オーディオ信号の
現在のフレームの左チャンネル及び右チャンネルの各エ
ネルギーを計算するステップと、上記左チャンネルのエ
ネルギーに対する上記右チャンネルのエネルギーの比の
値と、少なくとも１つの先行する直前のフレームのＭＳ
ステレオ符号化の状態モードとに基づいて、現在のフレ
ームのＭＳステレオ符号化の状態モードを決定するステ
ップとを含む。As described above in detail, according to the audio signal coding method of the present invention, each frame includes a pair of stereo channels consisting of a left channel and a right channel, and each frame has a plurality of time samples. A method for encoding an audio signal for encoding a digitized audio signal divided into successive frames into any one of a plurality of state modes of MS stereo encoding Calculating the respective energies of the left and right channels of the current frame of the audio signal, the value of the ratio of the energy of the right channel to the energy of the left channel, and the MS of at least one preceding frame.
Determining the MS stereo coding state mode of the current frame based on the stereo coding state mode.

【００６１】また、本発明に係るオーディオ信号の符号
化装置によれば、左チャンネルと右チャンネルとからな
る１対のステレオチャンネルを含みかつ各フレームが複
数の時間サンプルにてなる連続的なフレームに分割され
ているディジタル化されたオーディオ信号を、ＭＳステ
レオ符号化の複数の状態モードのうちのいずれか１つの
状態モードに符号化するためのオーディオ信号の符号化
装置において、上記オーディオ信号の現在のフレームの
左チャンネル及び右チャンネルの各エネルギーを計算す
る計算手段と、上記左チャンネルのエネルギーに対する
上記右チャンネルのエネルギーの比の値と、少なくとも
１つの先行する直前のフレームのＭＳステレオ符号化の
状態モードとに基づいて、現在のフレームのＭＳステレ
オ符号化の状態モードを決定する決定手段とを備えてい
る。Further, according to the audio signal coding apparatus of the present invention, a continuous frame including a pair of stereo channels consisting of a left channel and a right channel and each frame consisting of a plurality of time samples is formed. A device for encoding an audio signal for encoding a divided digitized audio signal into any one of a plurality of state modes of MS stereo encoding, wherein: Calculating means for calculating the respective energies of the left and right channels of the frame, a value of the ratio of the energy of the right channel to the energy of the left channel, and a state mode of MS stereo coding of at least one preceding frame Based on and, the current frame MS stereo coding state And a determining means for determining the mode.

【００６２】上記オーディオ信号の符号化方法及び装置
によれば、ディジタル化されたオーディオ信号のＭＳス
テレオ符号化に係る状態モードを決定する一方で、アー
ティファクトの発生を最小化するか又はアーティファク
トをまったく発生させないという効果がもたらされる。
また、本発明によれば、上記オーディオ信号の符号化装
置と、オーディオ信号の復号化装置とを備えたオーディ
オ信号の符号化及び復号化システムを提供することがで
きる。さらに、本発明によれば、上記オーディオ信号の
符号化方法に係るステップを含むプログラムと、上記プ
ログラムを記録し、コンピュータが読み取り可能な記録
媒体を提供することができる。According to the above audio signal encoding method and apparatus, the state mode related to the MS stereo encoding of the digitized audio signal is determined, while the generation of the artifact is minimized or the artifact is generated at all. The effect of not letting it occur is brought about.
Further, according to the present invention, it is possible to provide an audio signal encoding / decoding system including the audio signal encoding device and the audio signal decoding device. Further, according to the present invention, it is possible to provide a program including the steps related to the audio signal encoding method and a computer-readable recording medium in which the program is recorded.

【００６３】ＭＳステレオ符号化は、多くのオーディオ
符号化器にとってビットレートを低減させる一般的な方
法であるが、注意深く実装されなければアーティファク
トを発生させやすい。本発明は、ＭＳステレオ符号化の
状態モード間の高速のトグルと、モードの切り換えの遅
れとを防止してアーティファクトの発生を最小にする有
効で効率的なオーディオ信号の符号化方法及び装置を提
供する。MS stereo coding is a common method of reducing the bit rate for many audio coders, but it is prone to artifacts if not implemented carefully. The present invention provides an effective and efficient audio signal encoding method and apparatus for preventing high speed toggle between state modes of MS stereo encoding and delay of mode switching to minimize the occurrence of artifacts. To do.

[Brief description of drawings]

【図１】本発明の一実施形態に係るミニディスク記録
再生システムの構成を示すブロック図である。FIG. 1 is a block diagram showing a configuration of a mini disk recording / reproducing system according to an embodiment of the present invention.

【図２】図１のオーディオエンコーダ２の詳細構成を
示すブロック図である。FIG. 2 is a block diagram showing a detailed configuration of the audio encoder 2 of FIG.

【図３】図１のオーディオデコーダ６の詳細構成を示
すブロック図である。3 is a block diagram showing a detailed configuration of the audio decoder 6 of FIG.

【図４】ステレオ信号３１にＭＳステレオ符号化とＩ
Ｓステレオ符号化とを適用して、ジョイントステレオ信
号３２を生成する実施例を示す図である。FIG. 4 shows MS stereo coding and I for stereo signal 31.
FIG. 7 is a diagram illustrating an example in which S stereo encoding is applied to generate a joint stereo signal 32.

【図５】ＭＳステレオ符号化に係る３つの可能な状態
モードの間の状態遷移図である。FIG. 5 is a state transition diagram between three possible state modes according to MS stereo coding.

【図６】直前のサブバンドフレームのＭＳステレオ符
号化の状態モードに基づいて、現在のサブバンドフレー
ムのＭＳステレオ符号化の状態モードを決定するため条
件を示す図であって、（ａ）は直前のサブバンドフレー
ムのＭＳステレオ符号化の状態モードがＳ／Ｄモードの
ときの条件、（ｂ）は直前のサブバンドフレームのＭＳ
ステレオ符号化の状態モードがＳ／Ｌモードのときの条
件、（ｃ）は直前のサブバンドフレームのＭＳステレオ
符号化の状態モードがＳ／Ｒモードのときの条件を示す
図である。FIG. 6 is a diagram showing conditions for determining the MS stereo coding state mode of the current subband frame based on the MS stereo coding state mode of the immediately preceding subband frame, FIG. The condition when the MS stereo coding state mode of the immediately preceding subband frame is the S / D mode, (b) is the MS of the immediately preceding subband frame
FIG. 6 is a diagram showing conditions when the stereo encoding state mode is the S / L mode, and (c) is a diagram showing conditions when the MS stereo encoding state mode of the immediately preceding subband frame is the S / R mode.

【図７】図２のジョイントステレオ符号化器１４−１
によって実行されるＭＳステレオ符号化の状態モード切
り換え処理を示すフローチャートである。7 is a joint stereo encoder 14-1 of FIG.
6 is a flowchart showing a state mode switching process of MS stereo encoding executed by the.

【図８】Ｌチャンネル及びＲチャンネルにてなる信号
に対して、本発明の実施形態に係るＭＳステレオ符号化
の状態モード切り換え処理を適用した例を示すグラフで
ある。FIG. 8 is a graph showing an example in which the state mode switching process of MS stereo coding according to the embodiment of the present invention is applied to the signals of the L channel and the R channel.

【図９】Ｌチャンネル及びＲチャンネルにてなる信号
に対して、従来技術に係るＭＳステレオ符号化の状態モ
ード切り換え処理を適用した例を示すグラフである。FIG. 9 is a graph showing an example in which a state mode switching process of MS stereo coding according to the related art is applied to signals of L channel and R channel.

[Explanation of symbols]

１…Ａ／Ｄコンバータ、２…オーディオエンコーダ、３…ミニディスク記録装置、４…ミニディスク、５…ミニディスク再生装置、６…オーディオデコーダ、７…Ｄ／Ａコンバータ、１０…ＱＭＦ分割フィルタ、１１…ローパスフィルタ、１２…ハイパスフィルタ、１３−１乃至１３−４…フィルタバンク、１４−１乃至１４−４…ジョイントステレオ符号化器、１５−１乃至１５−４…利得制御部、１６−１乃至１６−４…ＭＤＣＴ処理部、１７−１乃至１７−４…トーン成分符号化器、１８−１乃至１８−４…トーン成分量子化器、１９−１乃至１９−４…スペクトル量子化器、２０…ビットストリームマルチプレクサ、２１…ビットストリームデマルチプレクサ、２２−１乃至２２−４…トーン成分逆量子化器、２３−１乃至２３−４…スペクトル逆量子化器、２４−１乃至２４−４…トーン成分復号化器、２５−１乃至２５−４…逆ＭＤＣＴ処理部、２６−１乃至２６−４…逆利得制御部、２７−１乃至２７−４…ジョイントステレオ復号化器、２８…ＱＭＦ合成フィルタ。 1 ... A / D converter, 2 ... audio encoder, 3 ... Mini disk recording device, 4 ... Mini disc, 5 ... Mini disc player, 6 ... Audio decoder, 7 ... D / A converter, 10 ... QMF division filter, 11 ... Low-pass filter, 12 ... High-pass filter, 13-1 to 13-4 ... Filter bank, 14-1 to 14-4 ... Joint stereo encoder, 15-1 to 15-4 ... Gain control section, 16-1 to 16-4 ... MDCT processing unit, 17-1 to 17-4 ... Tone component encoder, 18-1 to 18-4 ... Tone component quantizer, 19-1 to 19-4 ... Spectral quantizer, 20 ... Bitstream multiplexer, 21 ... Bitstream demultiplexer, 22-1 to 22-4 ... Tone component dequantizer, 23-1 to 23-4 ... Spectral inverse quantizer, 24-1 to 24-4 ... Tone component decoder, 25-1 to 25-4 ... Inverse MDCT processing unit, 26-1 to 26-4 ... Inverse gain control section, 27-1 to 27-4 ... Joint stereo decoder, 28 ... QMF synthesis filter.

───────────────────────────────────────────────────── フロントページの続き (72)発明者スアホン・ネオシンガポール534415シンガポール、タイ・セン・アベニュー、ブロック1022、04− 3530番、タイ・セン・インダストリアル・エステイト、パナソニック・シンガポール研究所株式会社内Ｆターム(参考） 5J064 AA00 BA16 BB07 BC02 BC06 BC07 BC11 BC16 BC25 BC29 BD02 BD03 BD04 ─────────────────────────────────────────────────── ─── Continued front page (72) Inventor Suahone Neo Singapore 534415 Singapore, Thailand Sen Avenue, Block 1022, 04- No. 3530, Thai Sen Industrial Estate, Panasonic Singapore Research Institute Co., Ltd. F-term (reference) 5J064 AA00 BA16 BB07 BC02 BC06 BC07 BC11 BC16 BC25 BC29 BD02 BD03 BD04

Claims

[Claims]

1. A digitized audio signal comprising a pair of stereo channels consisting of a left channel and a right channel, each frame being divided into consecutive frames of a plurality of time samples. A method of encoding an audio signal for encoding one of a plurality of state modes of stereo encoding, comprising calculating respective energies of a left channel and a right channel of a current frame of the audio signal. The mid-side stereo code of the current frame based on the step, the value of the ratio of the energy of the right channel to the energy of the left channel, and the state mode of mid-side stereo coding of at least one preceding frame. Determining the state mode of activation Method of encoding an audio signal, characterized in that.

2. The mid-side stereo coding includes a first state mode including a sum signal and a difference signal of left and right channel signals, and a sum signal of left and right channel signals and a left channel signal. 2. The audio signal encoding method according to claim 1, further comprising a second state mode and a third state mode including a sum signal of left and right channel signals and a right channel signal.

3. The step of determining the state mode of mid-side stereo coding of the current frame comprises the steps of: when the ratio value is greater than a predetermined first threshold value;
Transitioning the mid-side stereo coding state mode of the current frame to the second state mode; and when the value of the ratio is less than the reciprocal of the first threshold, the mid-side of the current frame. The step of transitioning the stereo coding state mode to the third state mode, and the mid-side stereo coding state mode of the immediately preceding preceding frame is in the third state mode, and the value of the ratio is When being less than or equal to the first threshold and greater than a second predetermined threshold that is less than the first threshold, the state mode of midside stereo coding of the current frame is changed to the first mode. And the state mode of the mid-side stereo coding of the preceding immediately preceding frame is in the second state mode. Is greater than or equal to the reciprocal of the first threshold and less than the reciprocal of the second threshold, the mid-side stereo coding state mode of the current frame is changed to the first state mode. The method for encoding an audio signal according to claim 2, further comprising:

4. The step of determining a mid-side stereo coding state mode of the current frame, wherein the mid-side stereo coding state mode of the immediately preceding frame is in the second state mode, When the ratio value is greater than or equal to the reciprocal of the second threshold, holding the mid-side stereo coding state mode of the current frame in the second state mode; When the state mode of midside stereo coding is in the third state mode and the value of the ratio is less than or equal to the second threshold value, the state mode of midside stereo coding of the current frame is set to the third state mode. 4. The method of encoding an audio signal according to claim 3, further comprising the step of holding in three state modes.

5. A digitized audio signal comprising a pair of stereo channels consisting of a left channel and a right channel, each frame being divided into consecutive frames of a plurality of time samples. An apparatus for encoding an audio signal for encoding into any one of a plurality of state modes of stereo encoding, calculating respective energies of a left channel and a right channel of a current frame of the audio signal. The mid-side stereo of the current frame based on the calculation means, the value of the ratio of the energy of the right channel to the energy of the left channel, and the state mode of the mid-side stereo coding of at least one preceding immediately preceding frame. And a determining means for determining the state mode of encoding. Encoding apparatus of an audio signal, characterized in that the.

6. The mid-side stereo coding includes a first state mode including a sum signal and a difference signal of left and right channel signals, and a sum signal of left and right channel signals and a left channel signal. 6. The audio signal encoding apparatus according to claim 5, further comprising a second state mode and a third state mode including a sum signal of left and right channel signals and a right channel signal.

7. The determining means, when the value of the ratio is larger than a predetermined first threshold value,
Transitioning the state mode of mid-side stereo coding of the current frame to the second state mode, and when the value of the ratio is less than the reciprocal of the first threshold, the mid-side stereo code of the current frame. The state mode of encoding is transited to the third state mode, the state mode of the mid-side stereo coding of the preceding immediately preceding frame is in the third state mode, and the value of the ratio is the first When it is less than or equal to the threshold value and is greater than a predetermined second threshold value that is smaller than the first threshold value, the mid-side stereo coding state mode of the current frame is changed to the first state mode. The state mode of mid-side stereo coding of the preceding preceding frame is transited to the second state mode, and the value of the ratio is less than or equal to the reciprocal of the first threshold. 7. And the audio signal according to claim 6, wherein the state mode of midside stereo coding of the current frame is transited to the first state mode when it is less than the reciprocal of the second threshold. Signal coding device.

8. The determination means is characterized in that the state mode of mid-side stereo coding of the immediately preceding frame is in the second state mode, and the value of the ratio is not less than the reciprocal of the second threshold value. , The mid-side stereo coding state mode of the current frame is held in the second state mode, and the mid-side stereo coding state mode of the immediately preceding preceding frame is set to the third state mode. 8. If the ratio value is less than or equal to the second threshold value, the mid-side stereo coding state mode of the current frame is held in the third state mode. Audio signal encoding device.

9. An audio signal encoding and decoding system comprising the audio signal encoding device according to claim 5 and an audio signal decoding device, wherein: An audio signal decoding apparatus decodes the encoded audio signal based on the determined state mode of the midside stereo encoding to obtain an audio signal including a pair of decoded stereo channels. A system for encoding and decoding an audio signal, characterized by comprising means for outputting.

10. A program comprising steps relating to the audio signal encoding method according to claim 1. Description:

11. A computer-readable recording medium in which the program according to claim 10 is recorded.