JP2007183528A

JP2007183528A - Encoding apparatus, encoding method, and encoding program

Info

Publication number: JP2007183528A
Application number: JP2006117345A
Authority: JP
Inventors: Masanao Suzuki; 政直鈴木; Masakiyo Tanaka; 正清田中; Yoshiteru Tsuchinaga; 義照土永; Miyuki Shirakawa; 美由紀白川; Takashi Makiuchi; 孝志牧内
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2005-12-06
Filing date: 2006-04-21
Publication date: 2007-07-19

Abstract

<P>PROBLEM TO BE SOLVED: To perform encoding with less deterioration in sound quality even under a low-bit-rate condition. <P>SOLUTION: An encoding apparatus 400 compresses a stereo signal by using a sum signal and a difference signal of a left component signal and a right component signal of the stereo signal. The encoding apparatus includes: a complexity calculating unit 406 that calculates complexity of the sum signal and complexity of the difference signal; a bit allocation setting unit 407 that sets, based on the complexities calculated by the complexity calculating unit 406, an allocation rate of bits to be allocated in quantizing the sum signal and the difference signal; and a sum signal quantizing unit 408 and a difference signal quantizing unit 409 that quantize the sum signal and the difference signal based on the allocation rate set by the bit allocation setting unit 407. <P>COPYRIGHT: (C)2007,JPO&INPIT

Description

この発明は、音声や音楽などのオーディオ信号を低ビットレートで圧縮するために、左チャネルと右チャネルとからなるステレオ信号を効率良く符号化する符号化装置、符号化方法、および符号化プログラムに関する。 The present invention relates to an encoding apparatus, an encoding method, and an encoding program for efficiently encoding a stereo signal composed of a left channel and a right channel in order to compress an audio signal such as voice or music at a low bit rate. .

従来、音声や音楽などのオーディオ信号を直交変換して得られる周波数スペクトルを符号化する方式として、ＩＳＯ／ＩＥＣ１３８１８−７ＭＰＥＧ−２のオーディオ規格であるＡＡＣ（ＡｄｖａｎｃｅｄＡｕｄｉｏＣｏｄｉｎｇ）方式が知られている。ＡＡＣ方式は、地上波デジタルラジオ放送の音声符号化方式として採用されており、さらに、ＭＳ（ミッド・サイド）ステレオ符号化と呼ばれる技術を用いてステレオ信号の圧縮効率を高めている。 Conventionally, an AAC (Advanced Audio Coding) method, which is an audio standard of ISO / IEC 13818-7 MPEG-2, is known as a method for encoding a frequency spectrum obtained by orthogonally transforming audio signals such as voice and music. Yes. The AAC system is adopted as an audio encoding system for terrestrial digital radio broadcasting, and further, the compression efficiency of stereo signals is increased by using a technique called MS (mid side) stereo encoding.

図１２は、ＭＳステレオ符号化の符号化手順を示すブロック図である。図１２に示したＭＳステレオ符号化装置１２００は、まず左チャネルのオーディオ信号（Ｌ）をＬ直交変換部１２０１によって直交変換し、右チャネルのオーディオ信号（Ｒ）をＲ直交変換部１２０２によって直交変換する。変換後のＬ，Ｒは、ＭＳステレオ変換部１２０３へ入力され、入力されたＬ，Ｒから和信号Ｍ（Ｍ＝（Ｌ＋Ｒ）／２）と、差信号Ｓ（Ｓ＝（Ｌ−Ｒ）／２）とにそれぞれ変換される。さらに、和信号Ｍは、和信号量子化器１２０４によって符号化される（符号語１）。同様に差信号Ｓは、差信号量子化器１２０５によって符号化される（符号語２）。 FIG. 12 is a block diagram illustrating an encoding procedure of MS stereo encoding. The MS stereo encoding apparatus 1200 shown in FIG. 12 first orthogonally transforms the left channel audio signal (L) by the L orthogonal transform unit 1201, and orthogonal transforms the right channel audio signal (R) by the R orthogonal transform unit 1202. To do. The converted L and R are input to the MS stereo conversion unit 1203. From the input L and R, the sum signal M (M = (L + R) / 2) and the difference signal S (S = (LR) / And 2). Furthermore, the sum signal M is encoded by the sum signal quantizer 1204 (codeword 1). Similarly, the difference signal S is encoded by the difference signal quantizer 1205 (codeword 2).

ＭＳステレオ符号化の際、ＭＳステレオ変換部１２０３において、ＬとＲとの相関が高い、つまり類似性が高い場合は、和信号Ｍに比べて差信号Ｓの電力が小さくなる。したがって、差信号Ｓの符号化ビット数を少なくして、和信号Ｍの符号化ビット数を多くすることで、符号化効率を向上させることができる。 At the time of MS stereo encoding, when the correlation between L and R is high in the MS stereo conversion unit 1203, that is, the similarity is high, the power of the difference signal S is smaller than that of the sum signal M. Therefore, by reducing the number of encoded bits of the difference signal S and increasing the number of encoded bits of the sum signal M, the encoding efficiency can be improved.

また、ＭＳステレオ符号化による変換に加え、符号化効率を高める方法として、適応的に差信号Ｓをモノラル化する技術が開示されている（例えば、下記特許文献１参照。）。図１３は、適応モノラル化の原理を示す説明図である。図１３に示した図表１３００を用いて適応モノラル化を説明する。図表１３１０，１３２０は、あるオーディオ信号のＬとＲとの波形を示す図表である。また、図表１３３０，１３４０は、ＬとＲを用いて生成した和信号Ｍと、差信号Ｓの波形を示す図表である。Ｌの波形１３１１と、Ｒの波形１３２１とは、それぞれ和信号Ｍの波形１３３１と、差信号Ｓの波形１３４１とに変換される。 In addition to conversion by MS stereo encoding, a technique for adaptively monauralizing the difference signal S is disclosed as a method for improving encoding efficiency (see, for example, Patent Document 1 below). FIG. 13 is an explanatory diagram showing the principle of adaptive monauralization. The adaptive monauralization will be described with reference to the chart 1300 shown in FIG. Charts 1310 and 1320 are charts showing waveforms of L and R of an audio signal. Charts 1330 and 1340 are charts showing the waveforms of the sum signal M and the difference signal S generated using L and R, respectively. The L waveform 1311 and the R waveform 1321 are converted into a waveform 1331 of the sum signal M and a waveform 1341 of the difference signal S, respectively.

ここで、Ｌ，Ｒから和信号Ｍ、差信号Ｓへの変換に際して周波数ｆの信号に着目する。モノラル化では、ＬとＲとの類似度を求め、ＬとＲとの類似度が高い場合は、差信号Ｓを無音化もしくは小さい振幅を持った信号に変形する。ＬとＲとの類似度が高い場合は差信号Ｓ＝（Ｌ−Ｒ）／２≒０となるので差信号Ｓのビット数を減らして０とする。つまり、差信号Ｓを示す波形１３４１では、周波数ｆの信号が０となり、その分のビットを和信号Ｍを示す波形１３３１の周波数ｆの信号に割り当てている。したがって、和信号Ｍのビット数が増加し、量子化に伴うオーディオ信号の歪みを小さくすることができる。 Here, attention is focused on a signal of frequency f when converting from L and R into a sum signal M and a difference signal S. In monauralization, the similarity between L and R is obtained. If the similarity between L and R is high, the difference signal S is silenced or transformed into a signal having a small amplitude. When the similarity between L and R is high, the difference signal S = (LR) / 2≈0, so the number of bits of the difference signal S is reduced to zero. That is, in the waveform 1341 indicating the difference signal S, the signal of the frequency f is 0, and the corresponding bit is assigned to the signal of the frequency f of the waveform 1331 indicating the sum signal M. Therefore, the number of bits of the sum signal M increases, and the distortion of the audio signal accompanying quantization can be reduced.

特開２００１−２５５８９２号公報Japanese Patent Laid-Open No. 2001-255892

しかしながら、地上波デジタルラジオ放送では、ＣＤ並みの高品質音声（音楽）と映像を合計３３０ｋｂｐｓ程度で実現するため、音声に割り当てられるビットレートは３２ｋｂｐｓ〜６４ｋｂｐｓと非常に小さい。したがって、従来のＭＳステレオ符号化技術では量子化ビット数の不足による音質劣化が避けられないという問題があった。 However, in terrestrial digital radio broadcasting, high-quality audio (music) and video equivalent to CD are realized at a total of about 330 kbps, and therefore the bit rate assigned to audio is as very low as 32 kbps to 64 kbps. Therefore, the conventional MS stereo encoding technique has a problem that sound quality deterioration due to an insufficient number of quantization bits cannot be avoided.

また、上記の特許文献１に記載の適応モノラル化を用いた場合であっても、差信号Ｓが０の帯域、つまり、モノラル化された帯域においては差信号Ｓの量子化ビット数を減らすことができるが、モノラル化できない帯域では差信号Ｓの量子化ビット数を減らすことができないため、低ビットレート条件では十分な音質が得られないという問題があった。 Even when the adaptive monauralization described in Patent Document 1 is used, the number of quantization bits of the difference signal S is reduced in the band where the difference signal S is 0, that is, in the monaural band. However, since the number of quantized bits of the difference signal S cannot be reduced in a band that cannot be monaural, there is a problem that sufficient sound quality cannot be obtained under low bit rate conditions.

この発明は、上述した従来技術による問題点を解消するため、低ビットレート条件でも音質劣化の少ない符号化を実現する符号化装置、符号化方法、および符号化プログラムを提供することを目的とする。 An object of the present invention is to provide an encoding device, an encoding method, and an encoding program that can realize encoding with little deterioration in sound quality even under a low bit rate condition in order to eliminate the above-described problems caused by the prior art. .

上述した課題を解決し、目的を達成するため、この発明にかかる符号化装置は、ステレオ信号の左成分信号と右成分信号との和信号と、差信号とを用いてステレオ信号を圧縮する符号化装置において、前記和信号の複雑度と、前記差信号の複雑度とをそれぞれ求める複雑度算出手段と、前記複雑度算出手段によって求めた複雑度に応じて前記和信号と、前記差信号とをそれぞれ量子化する際のビット数の分配割合を設定するビット数設定手段と、前記ビット数設定手段によって決定された前記分配割合に応じて前記和信号と、前記差信号をそれぞれ量子化する量子化手段と、を備えることを特徴とする。 In order to solve the above-described problems and achieve the object, an encoding apparatus according to the present invention is a code that compresses a stereo signal using a sum signal of a left component signal and a right component signal of a stereo signal and a difference signal. In the encoding apparatus, complexity calculation means for obtaining the complexity of the sum signal and the complexity of the difference signal, respectively, the sum signal according to the complexity obtained by the complexity calculation means, and the difference signal A bit number setting means for setting a distribution ratio of the number of bits when each of the signals is quantized, and a quantizer for respectively quantizing the sum signal and the difference signal according to the distribution ratio determined by the bit number setting means And a converting means.

この発明によれば、類似度に基づいて差信号をモノラル化することにより量子化ビットを削減することができる。また、和信号Ｍと修正差信号Ｓ’の複雑度に応じて量子化ビット数を適応的に配分することができるため、従来技術に比べて効率のよい符号化を図ることができる。 According to the present invention, it is possible to reduce the number of quantization bits by making the difference signal monaural based on the similarity. In addition, since the number of quantization bits can be adaptively allocated according to the complexity of the sum signal M and the modified difference signal S ′, more efficient encoding can be achieved compared to the prior art.

本発明にかかる符号化装置、符号化方法、および符号化プログラムによれば、低ビットレート条件であっても音質劣化の少ない、高音質な音声（音楽）として再生することができるという効果を奏する。 According to the encoding device, encoding method, and encoding program of the present invention, it is possible to reproduce high-quality sound (music) with little deterioration in sound quality even under low bit rate conditions. .

以下に添付図面を参照して、この発明にかかる符号化装置、符号化方法、および符号化プログラムの好適な実施の形態を詳細に説明する。 Exemplary embodiments of an encoding device, an encoding method, and an encoding program according to the present invention will be explained below in detail with reference to the accompanying drawings.

（符号化の原理）
まず、図１〜図３を用いて本発明にかかる符号化方法の原理を説明する。図１は、通常のモノラル化を示す説明図である。図１に示した図表１００は、差信号Ｓの電力を示す図表１１０と、和信号Ｍのビット数を示す図表１２０と、和信号Ｍの複雑度を示す図表１３０とを表している。 (Principle of encoding)
First, the principle of the encoding method according to the present invention will be described with reference to FIGS. FIG. 1 is an explanatory diagram showing normal monauralization. A chart 100 shown in FIG. 1 represents a chart 110 showing the power of the difference signal S, a chart 120 showing the number of bits of the sum signal M, and a chart 130 showing the complexity of the sum signal M.

図表１００に示した周波数ｆ１の信号に注目して通常のモノラル化の手順を説明する。図表１１０は、横軸が周波数を表し、縦軸が電力を表すことで、差信号Ｓの周波数ごとの電力を表している。周波数ｆ１の差信号Ｓは、モノラル化によって電力が０に変換される。この変換によって差信号Ｓは、ビット数が削減される（図表１１０の例では−５０ｂｉｔ）。 A normal monaural procedure will be described by paying attention to the signal of frequency f1 shown in chart 100. The chart 110 represents the power for each frequency of the difference signal S by representing the frequency on the horizontal axis and representing the power on the vertical axis. The difference signal S having the frequency f1 is converted into 0 by monauralization. The number of bits of the difference signal S is reduced by this conversion (−50 bits in the example of the chart 110).

図表１２０は、横軸が周波数を表し、縦軸が量子化した際のビット数を表すことで、和信号Ｍの周波数ごとの量子化ビット数を表している。図表１２０においては、図表１１０に示したモノラル化によって削減された差信号Ｓのビット（−５０ｂｉｔ）が、周波数ｆ１の元のビット数１２１に新たにビット数１２２（＋５０ｂｉｔ）として上乗せされる。 In the chart 120, the horizontal axis represents the frequency, and the vertical axis represents the number of bits when quantized, thereby representing the number of quantization bits for each frequency of the sum signal M. In the chart 120, the bit (−50 bits) of the difference signal S reduced by monauralization shown in the chart 110 is newly added as the number of bits 122 (+50 bits) to the original number of bits 121 of the frequency f1.

図表１３０は、横軸が周波数を表し、縦軸が複雑度を表すことで、和信号Ｍの周波数ごとの複雑度を表している。図表１３０に示した例では、周波数ｆ１の和信号Ｍの複雑度１３１と、周波数ｆ２の和信号Ｍの複雑度１３２が高いことがわかる。周波数ｆ１の和信号Ｍは、図表１２０で説明したように、周波数ｆ１の差信号Ｓの削減部分のビット数１２２が上乗せされている。したがって、周波数ｆ１の和信号Ｍは、量子化誤差を小さくすることができ、音質の向上が期待できる。 In the chart 130, the horizontal axis represents the frequency, and the vertical axis represents the complexity, so that the complexity for each frequency of the sum signal M is represented. In the example shown in the chart 130, it can be seen that the complexity 131 of the sum signal M at the frequency f1 and the complexity 132 of the sum signal M at the frequency f2 are high. As described in the chart 120, the sum signal M of the frequency f1 is added with the number of bits 122 of the reduced portion of the difference signal S of the frequency f1. Therefore, the sum signal M having the frequency f1 can reduce the quantization error and can be expected to improve the sound quality.

しかしながら、通常のモノラル化の場合、ビット数が上乗せされるのは差信号Ｓのビット数が削減された周波数の信号に限られている。周波数ｆ１と同様に複雑度が高い周波数ｆ２の和信号Ｍのビット数１２３には、新たなビット数の上乗せ（例えば、破線で示したビット数１２４）は、行われない。したがって、周波数ｆ２の和信号Ｍは、量子化誤差を小さくできず、音質を向上させることができない。 However, in the case of normal monauralization, the number of bits is added only to a signal having a frequency in which the number of bits of the difference signal S is reduced. Similar to the frequency f1, the number of bits 123 of the sum signal M of the frequency f2 having high complexity is not added with a new number of bits (for example, the number of bits 124 indicated by a broken line). Therefore, the sum signal M of the frequency f2 cannot reduce the quantization error and cannot improve the sound quality.

本発明では、差信号Ｓのモノラル化によって削減されたビット数を、周波数に関係なく、同じフレーム内の各信号の複雑度に応じて振り分ける。具体的な振り分け方法としては、和信号Ｍの複雑度に応じてビット数を振り分ける方法と、差信号Ｓの複雑度に応じてビット数を振り分ける方法とを用いる。以下、図２，３を用いてそれぞれの振り分け方法について説明する。 In the present invention, the number of bits reduced by making the difference signal S monaural is distributed according to the complexity of each signal in the same frame regardless of the frequency. As a specific distribution method, a method of distributing the number of bits according to the complexity of the sum signal M and a method of distributing the number of bits according to the complexity of the difference signal S are used. Hereinafter, each distribution method will be described with reference to FIGS.

まず、図２は、和信号Ｍの複雑度に応じてビット数を振り分ける方法を示す説明図である。ここでは、和信号Ｍの複雑度を調べ、差信号Ｓで削減したビットを、和信号Ｍを表すビットのうち、複雑な周波数のビットに振り分ける方法について説明する。図２に示した図表２００は、差信号Ｓの電力を示す図表２１０と、和信号Ｍのビット数を示す図表２２０と、和信号Ｍの複雑度を示す図表２３０とを表している。 First, FIG. 2 is an explanatory diagram showing a method of distributing the number of bits according to the complexity of the sum signal M. Here, a method of examining the complexity of the sum signal M and allocating the bits reduced by the difference signal S to bits of a complex frequency among the bits representing the sum signal M will be described. The chart 200 shown in FIG. 2 represents a chart 210 showing the power of the difference signal S, a chart 220 showing the number of bits of the sum signal M, and a chart 230 showing the complexity of the sum signal M.

図表２１０は、横軸が周波数を表し、縦軸が電力を表すことで、差信号Ｓの周波数ごとの電力を表している。ここで、周波数ｆ１の差信号Ｓは、モノラル化によって電力が０に変換される。この変換によって差信号Ｓは、ビット数が削減される（図表２１０の例では−５０ｂｉｔ）。 In the chart 210, the horizontal axis represents the frequency, and the vertical axis represents the power, so that the power for each frequency of the difference signal S is represented. Here, the power of the difference signal S of the frequency f1 is converted to 0 by monauralization. This conversion reduces the number of bits of the difference signal S (−50 bits in the example of the chart 210).

図表２２０は、横軸が周波数を表し、縦軸が量子化した際のビット数を表すことで、和信号Ｍの周波数ごとの量子化ビット数を表している。図表２１０に示したように周波数ｆ１の差信号Ｓから削減されたビット数（−５０ｂｉｔ）を、周波数ｆ１の和信号Ｍの元のビット数２２１と、周波数ｆ２の和信号Ｍの元のビット数２２４とにそれぞれ振り分け、上乗せする。図表２２０の例では、周波数ｆ１の和信号Ｍには、＋２０ｂｉｔのビット数２２２が上乗せされて、周波数ｆ２の和信号Ｍには、＋３０ｂｉｔのビット数２２３が上乗せされている。 In the chart 220, the horizontal axis represents frequency, and the vertical axis represents the number of bits when quantized, so that the number of quantization bits for each frequency of the sum signal M is represented. As shown in FIG. 210, the number of bits (−50 bits) reduced from the difference signal S of the frequency f1 is changed to the original number of bits 221 of the sum signal M of the frequency f1 and the original number of bits of the sum signal M of the frequency f2. 224 and add to each. In the example of the chart 220, the sum signal M of the frequency f1 is added with a bit number 222 of +20 bits, and the sum signal M of the frequency f2 is added with a bit number 223 of +30 bits.

図表２３０は、横軸が周波数を表し、縦軸が複雑度を表すことで、和信号Ｍの周波数ごとの複雑度を表している。図表２２０に示したような和信号Ｍへのビット数の上乗せは、図表２３０に示した和信号Ｍの周波数ごとの複雑度に応じて決定する。したがって、周波数ｆ１の和信号Ｍの複雑度２３１と、周波数ｆ２の和信号Ｍの複雑度２３２とを、図表２２０によって振り分けられたビット数２２２，２２３に対応させている。 In the chart 230, the horizontal axis represents the frequency, and the vertical axis represents the complexity, so that the complexity of each frequency of the sum signal M is represented. The addition of the number of bits to the sum signal M as shown in the chart 220 is determined according to the complexity of each frequency of the sum signal M shown in the chart 230. Therefore, the complexity 231 of the sum signal M at the frequency f1 and the complexity 232 of the sum signal M at the frequency f2 are made to correspond to the number of bits 222 and 223 distributed by the chart 220.

一方、図３は、差信号Ｓの複雑度に応じてビット数を振り分ける方法を示す説明図である。ここでは、差信号Ｓの複雑度を調べ、差信号Ｓで削減したビットを、差信号Ｓを表すビットのうち、複雑な周波数のビットに振り分ける方法について説明する。図３に示した図表３００は、差信号Ｓの電力を示す図表３１０と、差信号Ｓのビット数を示す図表３２０と、差信号Ｓの複雑度を示す図表３３０とを表している。 On the other hand, FIG. 3 is an explanatory diagram showing a method of distributing the number of bits according to the complexity of the difference signal S. Here, a method of examining the complexity of the difference signal S and allocating the bits reduced by the difference signal S to bits having a complex frequency among the bits representing the difference signal S will be described. The chart 300 shown in FIG. 3 represents a chart 310 that shows the power of the difference signal S, a chart 320 that shows the number of bits of the difference signal S, and a chart 330 that shows the complexity of the difference signal S.

図表３１０は、横軸が周波数を表し、縦軸が電力を表すことで、差信号Ｓの周波数ごとの電力を表している。ここで、周波数ｆ１の差信号Ｓは、モノラル化によって電力が０に変換される。この変換によって差信号Ｓは、ビット数が削減される（図表３１０の例では−５０ｂｉｔ）。 In the chart 310, the horizontal axis represents the frequency, and the vertical axis represents the power, so that the power for each frequency of the difference signal S is represented. Here, the power of the difference signal S of the frequency f1 is converted to 0 by monauralization. The number of bits of the difference signal S is reduced by this conversion (−50 bits in the example of the chart 310).

図表３２０は、横軸が周波数を表し、縦軸が量子化した際のビット数を表すことで、差信号Ｓの周波数ごとの量子化ビット数を表している。図表３１０に示したように周波数ｆ１の差信号Ｓから削減されたビット数（−５０ｂｉｔ）３２１を、周波数ｆ０の差信号Ｓの元のビット数３２２と、周波数ｆ２の差信号Ｓの元のビット数３２４とにそれぞれ振り分け、上乗せする。差信号Ｓにビットを上乗せする場合は、図表３１０に示したように周波数ｆ１の差信号Ｓは、０に変換されているため、ビット数３２１を必要としない。したがって、差信号Ｓの複雑度に応じて、周波数ｆ０と周波数ｆ２のそれぞれの差信号Ｓは、ビット数（図３の例ではビット数３２３，３２５）の上乗せによりビット数が増加し、量子化誤差が減少する。 In the chart 320, the horizontal axis represents the frequency, and the vertical axis represents the number of bits when quantized, thereby representing the number of quantization bits for each frequency of the difference signal S. As shown in the chart 310, the number of bits (−50 bits) 321 reduced from the difference signal S of the frequency f1 is changed to the original number of bits 322 of the difference signal S of the frequency f0 and the original bits of the difference signal S of the frequency f2. Each is assigned to Formula 324 and added. When a bit is added to the difference signal S, the difference signal S of the frequency f1 is converted to 0 as shown in the chart 310, so that the number of bits 321 is not required. Therefore, according to the complexity of the difference signal S, the difference signal S of each of the frequency f0 and the frequency f2 is increased in the number of bits due to the addition of the number of bits (the number of bits 323 and 325 in the example of FIG. 3). The error is reduced.

図表３３０は、横軸が周波数を表し、縦軸が複雑度を表すことで、差信号Ｓの周波数ごとの複雑度を表している。図表３３０に示したように、周波数ｆ０の差信号Ｓの複雑度３３２と、周波数ｆ２の差信号Ｓの複雑度３３３とが高いため、図表３２０に示したようなビット数の割り当てに反映されている。なお、周波数ｆ１の差信号Ｓは、ビット数が０であるにも拘わらず複雑度３３１を示しているが、これは、モノラル化され０に変換される前の周波数ｆ１の差信号Ｓの複雑度を示しているためである。 In the chart 330, the horizontal axis represents frequency, and the vertical axis represents complexity, so that the complexity of the difference signal S for each frequency is represented. As shown in the chart 330, since the complexity 332 of the difference signal S at the frequency f0 and the complexity 333 of the difference signal S at the frequency f2 are high, it is reflected in the allocation of the number of bits as shown in the chart 320. Yes. Note that the difference signal S of the frequency f1 indicates the complexity 331 even though the number of bits is 0, but this is the complexity of the difference signal S of the frequency f1 before being converted to monaural and converted to 0. This is because the degree is shown.

以上、図１〜図３を用いて説明したように、本発明では、モノラル化によって削減された差信号Ｓのビット数を複雑度に応じて和信号Ｍもしくは差信号Ｓの複雑度の高い信号に振り分ける。ビット数の振り分けの際には、和信号Ｍと差信号Ｓとを含む全体の複雑度を求め、重要な信号を抽出する。具体的には、差信号Ｓよりも和信号Ｍの複雑度が大きい際には、和信号Ｍに多くのビット数を割り振る。反対に、和信号Ｍよりも差信号Ｓの複雑度が大きい場合は、差信号Ｓに多くのビット数を割り振る。以下に説明する符号化装置は、説明した原理に基づいて符号化を実現する。 As described above with reference to FIGS. 1 to 3, in the present invention, the number of bits of the difference signal S reduced by monauralization is changed to a sum signal M or a signal having a high complexity of the difference signal S according to the complexity. Sort out. When distributing the number of bits, the overall complexity including the sum signal M and the difference signal S is obtained, and important signals are extracted. Specifically, when the complexity of the sum signal M is larger than the difference signal S, a larger number of bits is assigned to the sum signal M. Conversely, when the complexity of the difference signal S is larger than the sum signal M, a larger number of bits is assigned to the difference signal S. The encoding apparatus described below realizes encoding based on the described principle.

（符号化装置の基本構成）
つぎに、本発明にかかる符号化装置の基本構成を説明する。図４は、本発明にかかる符号化装置の基本構成を示すブロック図である。符号化装置４００は、上述した符号化の原理に基づいて符号化を行う。符号化装置４００は、Ｌ直交変換部４０１と、Ｒ直交変換部４０２と、ＭＳステレオ変換部４０３と、比較手段としての類似度計算部４０４と、修正手段としての差信号修正部４０５と、複雑度算出手段としての複雑度計算部４０６と、ビット数設定手段としてのビット割り当て決定部４０７と、量子化手段としての和信号量子化器４０８、および差信号量子化器４０９とから構成される。 (Basic configuration of encoding device)
Next, the basic configuration of the encoding apparatus according to the present invention will be described. FIG. 4 is a block diagram showing the basic configuration of the encoding apparatus according to the present invention. The encoding device 400 performs encoding based on the above-described encoding principle. The encoding apparatus 400 includes an L orthogonal transform unit 401, an R orthogonal transform unit 402, an MS stereo transform unit 403, a similarity calculation unit 404 as a comparison unit, a difference signal correction unit 405 as a correction unit, A complexity calculation unit 406 as a degree calculation unit, a bit allocation determination unit 407 as a bit number setting unit, a sum signal quantizer 408 and a difference signal quantizer 409 as quantization units.

Ｌ直交変換部４０１は、時間領域の入力信号（左チャネルのステレオ信号Ｌ（ｔ））を直交変換し、スペクトル信号Ｌ（ｆ）を出力する。直交変換とは時間領域ｔの空間座標から、周波数座標ｆへ変換する処理である。同様に、Ｒ直交変換部４０２は、時間領域の入力信号（右チャネルのステレオ信号Ｒ（ｔ））を直交変換し、スペクトル信号Ｒ（ｆ）を出力する。 The L orthogonal transform unit 401 orthogonally transforms the time domain input signal (left channel stereo signal L (t)) and outputs a spectrum signal L (f). Orthogonal transformation is a process of transforming from space coordinates in the time domain t to frequency coordinates f. Similarly, the R orthogonal transform unit 402 performs orthogonal transform on the time domain input signal (right channel stereo signal R (t)) and outputs a spectrum signal R (f).

ＭＳステレオ変換部４０３は、Ｌ直交変換部４０１から入力されたスペクトル信号Ｌ（ｆ）と、Ｒ直交変換部４０２から入力されたスペクトル信号Ｒ（ｆ）とをＭＳステレオ変換し、周波数に応じた値を示すスペクトル信号による和信号Ｍ（ｆ）と差信号Ｓ（ｆ）として出力する。 The MS stereo transform unit 403 performs MS stereo transform on the spectrum signal L (f) input from the L orthogonal transform unit 401 and the spectrum signal R (f) input from the R orthogonal transform unit 402, and according to the frequency The sum signal M (f) and difference signal S (f) are output as a spectrum signal indicating the value.

類似度計算部４０４は、Ｌ直交変換部４０１から入力されたスペクトル信号Ｌ（ｆ）と、Ｒ直交変換部４０２から入力されたスペクトル信号Ｒ（ｆ）との類似度を求める。類似度とはスペクトル信号Ｌ（ｆ）とスペクトル信号Ｒ（ｆ）との相関を数値的に算出した値である。類似度計算の具体的な内容に関しては、実施の形態の記述の際に詳しく説明する。類似度計算部４０４によって計算された類似度は、差信号修正部４０５に入力される。 The similarity calculation unit 404 obtains the similarity between the spectrum signal L (f) input from the L orthogonal transform unit 401 and the spectrum signal R (f) input from the R orthogonal transform unit 402. The similarity is a value obtained by numerically calculating the correlation between the spectrum signal L (f) and the spectrum signal R (f). Specific contents of the similarity calculation will be described in detail when the embodiment is described. The similarity calculated by the similarity calculation unit 404 is input to the difference signal correction unit 405.

差信号修正部４０５は、ＭＳステレオ変換部４０３から入力された差信号Ｓ（ｆ）に、類似度計算部４０４から入力された類似度に基づいて修正を施し、修正差信号Ｓ’（ｆ）を作成する。差信号修正部４０５によって行われる処理は、モノラル化に相当する。具体的な処理内容としては、周波数帯ごとの差信号Ｓの類似度があらかじめ定めた閾値よりも高いか否かの判断を行う。閾値よりも類似度が高い差信号Ｓは、すなわち差が≒０となり、モノラル化により修正差信号Ｓ’（ｆ）＝０として作成される。また、閾値より類似度が低い差信号は、差が大きいため、そのまま、修正差信号Ｓ’（ｆ）＝Ｓ（ｆ）として作成される。 The difference signal correction unit 405 corrects the difference signal S (f) input from the MS stereo conversion unit 403 based on the similarity input from the similarity calculation unit 404, and the correction difference signal S ′ (f). Create The process performed by the difference signal correction unit 405 corresponds to monauralization. As specific processing contents, it is determined whether or not the similarity of the difference signal S for each frequency band is higher than a predetermined threshold. The difference signal S having a similarity higher than the threshold value, that is, the difference becomes ≈0, and is created as a corrected difference signal S ′ (f) = 0 by monauralization. In addition, the difference signal having a similarity lower than the threshold value has a large difference, so that the difference signal S ′ (f) = S (f) is generated as it is.

複雑度計算部４０６は、ＭＳステレオ変換部４０３から入力された和信号Ｍ（ｆ）を用いて和信号Ｍ（ｆ）の複雑度ＰＥ＿ｍ＿ａｖｅを求め、差信号修正部４０５から入力された修正差信号Ｓ’（ｆ）を用いて修正差信号Ｓ’（ｆ）の複雑度ＰＥ＿ｓ＿ａｖｅを求める。さらに、求めた複雑度ＰＥの比を求めビット割り当て決定部４０７へ出力する。 The complexity calculation unit 406 obtains the complexity PE_m_ave of the sum signal M (f) using the sum signal M (f) input from the MS stereo conversion unit 403, and the corrected difference signal input from the difference signal correction unit 405. The complexity PE_s_ave of the corrected difference signal S ′ (f) is obtained using S ′ (f). Further, the ratio of the obtained complexity PE is obtained and output to the bit allocation determining unit 407.

ビット割り当て決定部４０７は、複雑度計算部４０６から入力された複雑度ＰＥの比の値に応じてビット数の分配の割合を決定し、和信号量子化器４０８と、差信号量子化器４０９とにそれぞれビット割り当て情報を出力する。割り当ての際には、複雑度ＰＥの比と閾値との比較に基づいて行う。 The bit allocation determination unit 407 determines the distribution ratio of the number of bits according to the ratio value of the complexity PE input from the complexity calculation unit 406, and performs a sum signal quantizer 408 and a difference signal quantizer 409. And bit allocation information are output respectively. Allocation is performed based on a comparison between the ratio of the complexity PE and a threshold value.

和信号量子化器４０８は、ＭＳステレオ変換部４０３から入力された和信号Ｍ（ｆ）を、ビット割り当て決定部４０７から入力されたビット割り当て情報に基づいて、量子化する。量子化後の和信号Ｍ（ｆ）は、符号語１として出力される。同様に、差信号量子化器４０９は、差信号修正部４０５から入力された修正差信号Ｓ’（ｆ）を、ビット割り当て決定部４０７から入力されたビット割り当て情報に基づいて、量子化する。量子化後の差信号Ｓ（ｆ）は、符号語２として出力される。 The sum signal quantizer 408 quantizes the sum signal M (f) input from the MS stereo conversion unit 403 based on the bit allocation information input from the bit allocation determination unit 407. The quantized sum signal M (f) is output as codeword 1. Similarly, the difference signal quantizer 409 quantizes the modified difference signal S ′ (f) input from the difference signal correcting unit 405 based on the bit allocation information input from the bit allocation determining unit 407. The quantized difference signal S (f) is output as codeword 2.

本発明にかかる符号化装置４００は、以上説明したような基本構成を用いてステレオ信号の符号化を行う。つぎに、各機能部の具体的な構成例とその処理内容について詳しく説明する。ここでは、符号化装置の構成例を実施の形態１〜実施の形態３として説明する。 The encoding apparatus 400 according to the present invention encodes a stereo signal using the basic configuration as described above. Next, a specific configuration example and processing contents of each functional unit will be described in detail. Here, a configuration example of the encoding device will be described as Embodiment 1 to Embodiment 3.

（実施の形態１）
実施の形態１では、図４に示した複雑度計算部４０６に対応する複雑度計算部５１０（図５−１参照）において、和信号Ｍと修正差信号Ｓ’とのそれぞれの心理視聴エントロピー（ＰＥ値）を求め、ＰＥ値の比を複雑度として出力する。また、ビット割り当て決定部４０７では、あらかじめ定めておいた複雑度と修正差信号Ｓ’との対応関係に応じてビット数の分配割合を決定する。 (Embodiment 1)
In the first embodiment, in the complexity calculation unit 510 (see FIG. 5A) corresponding to the complexity calculation unit 406 shown in FIG. 4, the psychological viewing entropy of the sum signal M and the corrected difference signal S ′ ( PE value) is obtained, and the ratio of PE values is output as complexity. Also, the bit allocation determining unit 407 determines the distribution ratio of the number of bits according to the correspondence between the complexity determined in advance and the correction difference signal S ′.

図５−１は、実施の形態１にかかる符号化装置の構成を示すブロック図である。図５−１に示した符号化装置５００は、図４に示した基本構成の具体的な実施例を示す。以下、図４に示した符号化装置４００の類似度計算部４０４、差信号修正部４０５、複雑度計算部４０６、ビット割り当て決定部４０７、和信号量子化器４０８および差信号量子化器４０９の具体的な処理について説明する。 FIG. 5A is a block diagram of the configuration of the encoding apparatus according to the first embodiment. A coding apparatus 500 illustrated in FIG. 5A illustrates a specific example of the basic configuration illustrated in FIG. 4. Hereinafter, the similarity calculation unit 404, the difference signal correction unit 405, the complexity calculation unit 406, the bit allocation determination unit 407, the sum signal quantizer 408, and the difference signal quantizer 409 of the encoding device 400 illustrated in FIG. Specific processing will be described.

図５−２は、実施の形態１にかかる符号化装置の符号化処理の手順を示すフローチャートである。図５−２のフローチャートにおいて、まず、ＭＤＣＴ５０１およびＭＤＣＴ５０２において、左右のステレオ信号Ｌ（ｔ），Ｒ（ｔ）のＭＤＣＴ変換を行う（ステップＳ５２１）。実施の形態１〜実施の形態３ではＬ直交変換部４０１およびＲ直交変換部４０２の処理を実現するために、ＭＤＣＴ（ＭｏｄｉｆｉｅｄＤｉｓｃｒｅｔｅＣｏｓｉｎｅＴｒａｎｓｆｏｒｍ；変形離散コサイン変換）を用いる。ＭＤＣＴは、通常のＤＣＴ演算では成分抽出時にブロック境界部分でブロック歪みが発生するため、ブロック区間長の５０％を隣接ブロックとオーバーラップすることによりブロック歪みを除去する変換処理である。 FIG. 5-2 is a flowchart of an encoding process performed by the encoding apparatus according to the first embodiment. In the flowchart of FIG. 5-2, first, MDCT conversion of the left and right stereo signals L (t) and R (t) is performed in the MDCT 501 and the MDCT 502 (step S521). In Embodiments 1 to 3, MDCT (Modified Discrete Cosine Transform) is used to realize the processing of L orthogonal transform unit 401 and R orthogonal transform unit 402. MDCT is a conversion process that removes block distortion by overlapping 50% of the block section length with an adjacent block because block distortion occurs at the block boundary portion during component extraction in normal DCT calculation.

続いて、ＭＳステレオ変換部４０３によって、左右のスペクトル信号Ｌ（ｆ），Ｒ（ｆ）にＭＳステレオ変換を行う（ステップＳ５２２）。また、類似度計算部４０４においては、スペクトル信号Ｌ（ｆ）とスペクトル信号Ｒ（ｆ）との類似度を計算する（ステップＳ５２３）。ここで、類似度計算部４０４における類似度計算について詳しく説明する。類似度は、スペクトル信号Ｌ（ｆ）とスペクトル信号Ｒ（ｆ）との相関を用いる。 Subsequently, the MS stereo conversion unit 403 performs MS stereo conversion on the left and right spectrum signals L (f) and R (f) (step S522). In addition, the similarity calculation unit 404 calculates the similarity between the spectrum signal L (f) and the spectrum signal R (f) (step S523). Here, the similarity calculation in the similarity calculation unit 404 will be described in detail. For the similarity, the correlation between the spectrum signal L (f) and the spectrum signal R (f) is used.

図６は、信号の帯域の上限と下限の関係を示す図表である。図表６００は、横軸が周波数ｆを表し、縦軸がステレオ信号Ｌの電力を表す。各信号は、複数の周波数帯域（例えば、周波数帯６０１〜６０３で示した帯域ｉ−１，ｉ，ｉ＋１）によって構成されているため、周波数帯域ごとに下記（１）式を用いて相関ｃｏｒ（ｉ）を求める。したがって、類似度計算部４０４から相関ｃｏｒ（ｉ）が差信号修正部４０５へ入力される。 FIG. 6 is a chart showing the relationship between the upper limit and the lower limit of the signal band. In the chart 600, the horizontal axis represents the frequency f, and the vertical axis represents the power of the stereo signal L. Since each signal is composed of a plurality of frequency bands (for example, bands i-1, i, i + 1 shown by frequency bands 601 to 603), the correlation cor ( i) is determined. Therefore, the correlation cor (i) is input from the similarity calculation unit 404 to the difference signal correction unit 405.

その後、相関ｃｏｒ（ｉ）に基づいて、差信号修正部４０５によって、ＭＳステレオ変換部４０３から入力された差信号Ｓ（ｆ）の修正を行う（ステップＳ５２４）。差信号修正部４０５は、差信号Ｓ（ｆ）の帯域ごとに相関ｃｏｒ（ｉ）と閾値との比較を行う。具体的には、相関ｃｏｒ（ｉ）が閾値以上の場合は、帯域ｉ（図６参照）に含まれる全周波数ｆについて修正差信号Ｓ’（ｆ）＝０とする。また、相関ｃｏｒ（ｉ）が閾値以下の場合は、帯域ｉ（図６参照）に含まれる全周波数ｆについて修正差信号Ｓ’（ｆ）＝Ｓ（ｆ）とする。 Thereafter, based on the correlation cor (i), the difference signal correction unit 405 corrects the difference signal S (f) input from the MS stereo conversion unit 403 (step S524). The difference signal correction unit 405 compares the correlation cor (i) with a threshold value for each band of the difference signal S (f). Specifically, when the correlation cor (i) is equal to or greater than the threshold value, the corrected difference signal S ′ (f) = 0 is set for all frequencies f included in the band i (see FIG. 6). When the correlation cor (i) is equal to or smaller than the threshold value, the corrected difference signal S ′ (f) = S (f) is set for all the frequencies f included in the band i (see FIG. 6).

つぎに、複雑度計算部５１０によって行われる複雑度計算の詳細な処理について説明する。複雑度計算部５１０は、許容誤差計算部５０３と、電力計算部５０４と、ＰＥ値計算部５０５と、ＰＥ比計算部５０６とから構成される。複雑度計算部５１０では、まず、許容誤差計算部５０３によって、許容誤差計算を行う（ステップＳ５２５）。 Next, detailed processing of complexity calculation performed by the complexity calculator 510 will be described. The complexity calculation unit 510 includes an allowable error calculation unit 503, a power calculation unit 504, a PE value calculation unit 505, and a PE ratio calculation unit 506. In the complexity calculation unit 510, first, an allowable error calculation is performed by the allowable error calculation unit 503 (step S525).

許容誤差計算部５０３は、ＭＳステレオ変換部４０３から和信号Ｍ（ｆ）が入力され、差信号修正部４０５から修正差信号Ｓ’（ｆ）が入力され、帯域ｉにおける和信号Ｍ（ｆ）の許容誤差電力ｎ＿ｍ（ｉ）と、修正差信号Ｓ’（ｆ）の許容誤差電力ｎ＿ｓ（ｉ）を求める。このステップにおける許容誤差電力の算出としては、例えば、公知の技術である心理視聴モデルにおける許容誤差電力の計算（ＩＳＯ／ＩＥＣ１３８１８−７：２００３，ＡｄｖａｎｃｅｄＡｕｄｕｏＣｏｄｉｎｇ）を用いることができる。 The allowable error calculation unit 503 receives the sum signal M (f) from the MS stereo conversion unit 403, receives the correction difference signal S ′ (f) from the difference signal correction unit 405, and adds the sum signal M (f) in the band i. The allowable error power n_m (i) and the allowable error power n_s (i) of the corrected difference signal S ′ (f) are obtained. As calculation of allowable error power in this step, for example, calculation of allowable error power in a psychological viewing model (ISO / IEC 13818-7: 2003, Advanced Audio Coding), which is a known technique, can be used.

続いて、電力計算部５０４によって電力計算を行う（ステップＳ５２６）。電力計算部５０４は、ＭＳステレオ変換部４０３から入力された和信号Ｍ（ｆ）の帯域ｉにおける電力ｅ＿ｍ（ｉ）と、差信号修正部４０５から入力された修正差信号Ｓ’（ｆ）の帯域ｉにおける電力ｅ＿ｓ（ｉ）を下記の（２），（３）式から求める。 Subsequently, power calculation is performed by the power calculation unit 504 (step S526). The power calculation unit 504 includes the power e_m (i) in the band i of the sum signal M (f) input from the MS stereo conversion unit 403 and the corrected difference signal S ′ (f) input from the difference signal correction unit 405. The power e_s (i) in the band i is obtained from the following equations (2) and (3).

続いて、ＰＥ値計算部５０５によって複雑度ＰＥ値計算を行う（ステップＳ５２７）。ＰＥ値計算部５０５には、許容誤差計算部５０３から和信号Ｍの許容誤差電力ｎ＿ｍ（Ｐ１）と、修正差信号Ｓ’の許容誤差電力ｎ＿ｓ（Ｐ２）が入力され、電力計算部５０４から、和信号Ｍの電力ｅ＿ｍ（Ｐ３）と、修正差信号Ｓ’の電力ｅ＿ｓ（Ｐ４）とが入力される。ＰＥ値計算部５０５は、下記（４）式を用いて、和信号Ｍの許容誤差電力ｎ＿ｍと、和信号Ｍの電力ｅ＿ｍとから和信号Ｍの複雑度ＰＥ＿ｍを求める。同様に、下記（５）式を用いて修正差信号Ｓ’の許容誤差電力ｎ＿ｓと、修正差信号Ｓ’の電力ｅ＿ｓとから修正差信号Ｓ’の複雑度ＰＥ＿ｓを求める。なお、下記（４），（５）式のシグマに用いているｎは帯域の個数を表している。 Subsequently, the PE value calculation unit 505 performs complexity PE value calculation (step S527). The PE value calculation unit 505 receives the allowable error power n_m (P1) of the sum signal M and the allowable error power n_s (P2) of the correction difference signal S ′ from the allowable error calculation unit 503, and from the power calculation unit 504, The power e_m (P3) of the sum signal M and the power e_s (P4) of the corrected difference signal S ′ are input. The PE value calculation unit 505 obtains the complexity PE_m of the sum signal M from the allowable error power n_m of the sum signal M and the power e_m of the sum signal M using the following equation (4). Similarly, the complexity PE_s of the corrected difference signal S ′ is obtained from the allowable error power n_s of the corrected difference signal S ′ and the power e_s of the corrected difference signal S ′ using the following equation (5). Note that n used in the sigma of the following equations (4) and (5) represents the number of bands.

つぎに、ＰＥ比計算部５０６によって、ＰＥ比計算を行う（ステップＳ５２８）。ＰＥ比計算部５０６には、ＰＥ値計算部５０５から和信号Ｍの複雑度ＰＥ＿ｍと、修正差信号Ｓ’の複雑度ＰＥ＿ｓが入力される。そして、ＰＥ比計算部５０６は、和信号Ｍの複雑度ＰＥ＿ｍに対する修正差信号Ｓ’の複雑度ＰＥ＿ｓの割合を下記（６）式によって求め、複雑度の比（ＰＥ比）をｐｅ＿ｒａｔｉｏとしてビット割り当て決定部４０７へ出力する。ここまでのステップにより、複雑度計算部５１０の処理が終了する。なお、複雑度計算部５１０は、上述したようなＰＥ比の計算に替わって、ＰＥ差を求め、ビット割り当て決定部４０７に出力してもよい。さらに、ＰＥ比、またはＰＥ差を求める際には、各信号の全周波数帯域のＰＥ値の合計や平均を用いてもよい。 Next, the PE ratio calculation unit 506 performs PE ratio calculation (step S528). The PE ratio calculation unit 506 receives the complexity PE_m of the sum signal M and the complexity PE_s of the correction difference signal S ′ from the PE value calculation unit 505. Then, the PE ratio calculation unit 506 obtains the ratio of the complexity PE_s of the modified difference signal S ′ to the complexity PE_m of the sum signal M by the following equation (6), and assigns the bit as the complexity ratio (PE ratio) as pe_ratio. The data is output to the determination unit 407. With the steps so far, the processing of the complexity calculator 510 ends. The complexity calculation unit 510 may obtain a PE difference and output it to the bit allocation determination unit 407 instead of calculating the PE ratio as described above. Furthermore, when calculating the PE ratio or PE difference, the sum or average of PE values in all frequency bands of each signal may be used.

ｐｅ＿ｒａｔｉｏ＝ＰＥ＿ｓ／ＰＥ＿ｍ …（６） pe_ratio = PE_s / PE_m (6)

続いて、ビット割り当て決定部４０７における処理について説明する。まず、修正差信号Ｓ’（ｆ）の総ビット数を決定し（ステップＳ５２９）、続いて、和信号Ｍ（ｆ）の総ビット数を決定する（ステップＳ５３０）。修正差信号Ｓ’（ｆ）の総ビット数を決定する具体的な手順としては、複雑度比ｐｅ＿ｒａｔｉｏと修正差信号Ｓ’（ｆ）のビット配分量との関係をあらかじめ定めておく手順がある。 Next, processing in the bit allocation determination unit 407 will be described. First, the total number of bits of the modified difference signal S ′ (f) is determined (step S529), and then the total number of bits of the sum signal M (f) is determined (step S530). As a specific procedure for determining the total number of bits of the correction difference signal S ′ (f), there is a procedure for predetermining the relationship between the complexity ratio pe_ratio and the bit allocation amount of the correction difference signal S ′ (f). .

図７は、ＰＥ比とビット分配の関係を示す図表である。図表７００は、横軸が複雑度比ｐｅ＿ｒａｔｉｏを表し、縦軸が修正差信号Ｓ’のビット配分量を表す。また、曲線７０１は、複雑度比ｐｅ＿ｒａｔｉｏとビット配分の関係を示している。ビット割り当て決定部４０７は、図表７００のような、複雑度比ｐｅ＿ｒａｔｉｏとビット配分の関係をあらかじめ定めておく。具体的には、複雑度比ｐｅ＿ｒａｔｉｏの値が大きいときには、修正差信号Ｓ’のビット数分配を多くし、複雑度比ｐｅ＿ｒａｔｉｏの値が小さいときには、修正差信号Ｓ’のビット数分配を少なくする。つまり、修正差信号Ｓ’の複雑度の大きい帯域に多くのビット数を分配するような曲線７０１を設定しておく。 FIG. 7 is a chart showing the relationship between PE ratio and bit distribution. In the chart 700, the horizontal axis represents the complexity ratio pe_ratio, and the vertical axis represents the bit allocation amount of the correction difference signal S '. A curve 701 indicates the relationship between the complexity ratio pe_ratio and bit allocation. The bit allocation determining unit 407 previously determines the relationship between the complexity ratio pe_ratio and the bit allocation as shown in the chart 700. Specifically, when the value of the complexity ratio pe_ratio is large, the bit number distribution of the correction difference signal S ′ is increased. When the value of the complexity ratio pe_ratio is small, the bit number distribution of the correction difference signal S ′ is decreased. . That is, a curve 701 is set so that a large number of bits are distributed to a band with a high complexity of the correction difference signal S ′.

和信号Ｍのビット数は、ステップＳ５２９によって決定された修正差信号Ｓ’（ｆ）のビット数の分配に基づいて決定される。具体的には、１フレーム当たりの量子化ビット数をｂｉｔ＿ｔｏｔａｌとすると、図７の曲線７０１によって修正差信号Ｓ’のビット数ｂｉｔ＿ｓを求めｂｉｔ＿ｔｏｔａｌから修正差信号Ｓ’のビット数ｂｉｔ＿ｓを引き、和信号Ｍのビット数ｂｉｔ＿ｍを求める（ｂｉｔ＿ｍ＝ｂｉｔ＿ｔｏｔａｌ−ｂｉｔ＿ｓ）。 The number of bits of the sum signal M is determined based on the distribution of the number of bits of the modified difference signal S ′ (f) determined in step S529. Specifically, assuming that the number of quantization bits per frame is bit_total, the number of bits bit_s of the modified difference signal S ′ is obtained from the curve 701 in FIG. 7, and the number of bits bit_s of the modified difference signal S ′ is subtracted from the bit_total. The number of bits bit_m of the signal M is obtained (bit_m = bit_total−bit_s).

以上のようにして求めたビット数に応じて、和信号量子化器４０８ではビット数ｂｉｔ＿ｍで和信号Ｍ（ｆ）の量子化を行い（ステップＳ５３１）、差信号量子化器４０９ではビット数ｂｉｔ＿ｓで修正差信号Ｓ’（ｆ）の量子化を行い（ステップＳ５３２）、一連の処理を終了する。 In accordance with the number of bits obtained as described above, the sum signal quantizer 408 quantizes the sum signal M (f) with the bit number bit_m (step S531), and the difference signal quantizer 409 performs the bit number bit_s. Then, the corrected difference signal S ′ (f) is quantized (step S532), and the series of processes is terminated.

（実施の形態２）
実施の形態２は、複雑度計算部８１０における複雑度の計算の際に実施の形態１と異なる方法を用いる。また、ビット割り当て決定部４０７におけるビット割り当ての際には、ＰＥ値の重み係数に応じてビット数の分配を行う。 (Embodiment 2)
In the second embodiment, a method different from that in the first embodiment is used when the complexity calculation unit 810 calculates the complexity. Further, when bit allocation is performed by the bit allocation determination unit 407, the number of bits is distributed according to the weight coefficient of the PE value.

図８−１は、実施の形態２にかかる符号化装置の構成を示すブロック図である。実施の形態２にかかる符号化装置８００は、実施の形態１の符号化装置５００と同じ構成によって符号化を行うが、複雑度計算部８１０の処理内容が異なり、それに伴いビット割り当て決定部４０７におけるビット割り当て方法にも変化が生じる。したがって、符号化装置８００の特徴となる、ＰＥ値計算部５０５と、ＰＥ比計算部５０６と、ビット割り当て決定部４０７とについて詳しく説明する。また、他の構成は、符号化装置５００と同じであるため、同一の符号を付して説明を省略する。 FIG. 8A is a block diagram of the configuration of the encoding apparatus according to the second embodiment. The encoding apparatus 800 according to the second embodiment performs encoding with the same configuration as the encoding apparatus 500 of the first embodiment, but the processing content of the complexity calculation unit 810 is different, and accordingly the bit allocation determination unit 407 There is also a change in the bit allocation method. Therefore, the PE value calculation unit 505, the PE ratio calculation unit 506, and the bit allocation determination unit 407, which are features of the encoding apparatus 800, will be described in detail. In addition, since the other configuration is the same as that of the encoding apparatus 500, the same reference numerals are given and description thereof is omitted.

図８−２は、実施の形態２にかかる符号化装置の符号化処理の手順を示すフローチャートである。図８−２のフローチャートにおいて、ステップＳ８２１〜ステップＳ８２４は、図５−２に示したフローチャートのステップＳ５２１〜ステップＳ５２４と同様の処理を行う。つぎに、複雑度計算部８１０における処理を説明する。 FIG. 8-2 is a flowchart of an encoding process performed by the encoding apparatus according to the second embodiment. In the flowchart of FIG. 8B, steps S821 to S824 perform the same processes as steps S521 to S524 of the flowchart shown in FIG. Next, processing in the complexity calculator 810 will be described.

許容量誤差計算部５０３における許容量誤差計算（ステップＳ８２５）と、電力計算部５０４における電力計算（ステップＳ８２６）は、図５−２に示したフローチャートのステップＳ５２５、ステップＳ５２６と同様の処理を行う。つぎに、ＰＥ値計算部５０５によってＰＥ値計算を行う（ステップＳ８２７）。ここで、ＰＥ値計算部５０５には、許容誤差計算部５０３から和信号Ｍの許容誤差電力ｎ＿ｍと、修正差信号Ｓ’の許容誤差電力ｎ＿ｓが入力され、電力計算部５０４から、和信号Ｍの電力ｅ＿ｍと、修正差信号Ｓ’の電力ｅ＿ｓとが入力される。 The tolerance error calculation (step S825) in the tolerance error calculation unit 503 and the power calculation (step S826) in the power calculation unit 504 perform the same processing as steps S525 and S526 in the flowchart shown in FIG. . Next, the PE value calculation unit 505 performs PE value calculation (step S827). Here, the allowable error power n_m of the sum signal M and the allowable error power n_s of the corrected difference signal S ′ are input from the allowable error calculator 503 to the PE value calculator 505, and the sum signal M is input from the power calculator 504. Power e_m and the power e_s of the corrected difference signal S ′ are input.

そして、実施の形態２のＰＥ値計算部５０５は、下記（７）式を用いて、和信号Ｍの許容誤差電力ｎ＿ｍと、和信号Ｍの電力ｅ＿ｍとから和信号Ｍの複雑度ＰＥ＿ｍ（ｉ）を求める。同様に、実施の形態２のＰＥ値計算部５０５は、下記（８）式を用いて修正差信号Ｓ’の許容誤差電力ｎ＿ｓと、修正差信号Ｓ’の電力ｅ＿ｓとから修正差信号Ｓ’の複雑度ＰＥ＿ｓ（ｉ）を求める。 Then, the PE value calculation unit 505 of the second embodiment uses the following equation (7) to calculate the complexity PE_m (i of the sum signal M from the allowable error power n_m of the sum signal M and the power e_m of the sum signal M. ) Similarly, the PE value calculation unit 505 of the second embodiment uses the following equation (8) to calculate the corrected difference signal S ′ from the allowable error power n_s of the corrected difference signal S ′ and the power e_s of the corrected difference signal S ′. The complexity PE_s (i) is obtained.

つぎに、ＰＥ比計算部５０６によって、ＰＥ比計算を行う（ステップＳ８２８）。ＰＥ比計算部５０６には、ＰＥ値計算部５０５から和信号Ｍの複雑度ＰＥ＿ｍ（ｉ）と、修正差信号Ｓ’の複雑度ＰＥ＿ｓ（ｉ）が入力される。そして、ＰＥ比計算部５０６は、和信号Ｍの複雑度ＰＥ＿ｍに対する修正差信号Ｓ’の複雑度ＰＥ＿ｓの割合を下記（９）式によって求め、複雑度の比（ＰＥ比）をｐｅ＿ｒａｔｉｏとしてビット割り当て決定部４０７へ出力する。さらに、ＰＥ値計算部５０５によって求められた和信号Ｍの複雑度ＰＥ＿ｍ（ｉ）をビット割り当て決定部４０７へ出力する。ここまでのステップにより、複雑度計算部８１０の処理が終了する。 Next, the PE ratio calculation unit 506 performs PE ratio calculation (step S828). The PE ratio calculation unit 506 receives the complexity PE_m (i) of the sum signal M and the complexity PE_s (i) of the correction difference signal S ′ from the PE value calculation unit 505. Then, the PE ratio calculation unit 506 obtains the ratio of the complexity PE_s of the modified difference signal S ′ to the complexity PE_m of the sum signal M by the following equation (9), and assigns the bit as the complexity ratio (PE ratio) as pe_ratio. The data is output to the determination unit 407. Further, the complexity PE_m (i) of the sum signal M obtained by the PE value calculation unit 505 is output to the bit allocation determination unit 407. With the steps so far, the processing of the complexity calculation unit 810 ends.

続いて、ビット割り当て決定部４０７における処理について説明する。まず、修正差信号Ｓ’（ｆ）の総ビット数を決定し（ステップＳ８２９）、続いて、和信号Ｍ（ｆ）の総ビット数を決定する（ステップＳ８３０）。修正差信号Ｓ’（ｆ）の総ビット数を決定する具体的な手順としては、実施の形態１と同様に、まず、複雑度ＰＥ＿ｒａｔｉｏに応じて、修正差信号Ｓ’（ｆ）の量子化ビット数ｂｉｔ＿ｓをあらかじめ定めておく。つぎに、１フレームで使用できる量子化ビット数ｂｉｔ＿ｔｏｔａｌからｂｉｔ＿ｓを引いた残りを和信号Ｍの量子化ビット数ｂｉｔ＿ｍとする。ここで、和信号Ｍの各周波数帯域に分配するビット数の上限を決定しておく。 Next, processing in the bit allocation determination unit 407 will be described. First, the total number of bits of the modified difference signal S ′ (f) is determined (step S829), and then the total number of bits of the sum signal M (f) is determined (step S830). As a specific procedure for determining the total number of bits of the corrected difference signal S ′ (f), as in the first embodiment, first, the quantization of the corrected difference signal S ′ (f) is performed according to the complexity PE_ratio. The number of bits bit_s is determined in advance. Next, the remainder obtained by subtracting bit_s from the number of quantization bits bit_total usable in one frame is set as the number of quantization bits bit_m of the sum signal M. Here, the upper limit of the number of bits distributed to each frequency band of the sum signal M is determined.

続いて、重み係数ｗ＿ｍ（ｉ）を決定する（ステップＳ８３１）。図９は、複雑度ＰＥ＿ｍと重み係数ｗ＿ｍの関係を示す図表である。図表９００は、横軸が複雑度ＰＥ＿ｍ（ｉ）を表し、縦軸が重み係数ｗ＿ｍ（ｉ）を表す。また、曲線９０１は、複雑度ＰＥ＿ｍと重み係数ｗ＿ｍの関係を示している。和信号Ｍの各周波数帯域に分配するビット数の上限を決定するには、曲線９０１のような関係をあらかじめ定めておく。各周波数帯域ｉについて複雑度ＰＥ＿ｍ（ｉ）の値と図表９００の関係から重み係数ｗ＿ｍ（ｉ）を決定する。 Subsequently, a weight coefficient w_m (i) is determined (step S831). FIG. 9 is a chart showing the relationship between the complexity PE_m and the weighting factor w_m. In the chart 900, the horizontal axis represents the complexity PE_m (i), and the vertical axis represents the weighting factor w_m (i). A curve 901 indicates the relationship between the complexity PE_m and the weight coefficient w_m. In order to determine the upper limit of the number of bits distributed to each frequency band of the sum signal M, a relationship like a curve 901 is determined in advance. For each frequency band i, the weighting factor w_m (i) is determined from the relationship between the value of the complexity PE_m (i) and the chart 900.

つぎに、重み係数の総和ｓｕｍ＿ｗの算出を行う（ステップＳ８３２）重み係数ｗ＿ｍ（ｉ）の総和ｓｕｍ＿ｗは、下記（１０）式を用いて求める。さらに、重み係数の修正を行うために、下記（１１）式を用いて重み係数ｗ＿ｍ（ｉ）を正規化（ｗ＿ｍ２（ｉ））する。なお、総和で正規化するため、ｗ＿ｍ２の総和は１になる。 Next, the summation sum_w of the weighting coefficients is calculated (step S832). The summation sum_w of the weighting coefficient w_m (i) is obtained using the following equation (10). Further, in order to correct the weighting factor, the weighting factor w_m (i) is normalized (w_m2 (i)) using the following equation (11). Since the sum is normalized by the sum, the sum of w_m2 is 1.

その後、和信号Ｍの各周波数帯域に分配するビット数の上限ｂｉｔ＿ｍ（ｉ）を、下記（１２）式を用いて決定し、ビット割り当て決定部４０７の処理を終了する。 Thereafter, the upper limit bit_m (i) of the number of bits distributed to each frequency band of the sum signal M is determined using the following equation (12), and the processing of the bit allocation determination unit 407 is terminated.

以上のようにして求めたビット数に応じて、和信号量子化器４０８ではビット数ｂｉｔ＿ｍで和信号Ｍ（ｆ）の量子化を行い（ステップＳ８３４）、差信号量子化器４０９ではビット数ｂｉｔ＿ｓで修正差信号Ｓ’（ｆ）の量子化を行い（ステップＳ８３５）、一連の処理を終了する。 In accordance with the number of bits obtained as described above, the sum signal quantizer 408 quantizes the sum signal M (f) with the bit number bit_m (step S834), and the difference signal quantizer 409 performs the bit number bit_s. Then, the corrected difference signal S ′ (f) is quantized (step S835), and the series of processing ends.

（実施の形態３）
実施の形態３は、和信号Ｍ（ｆ）と、修正差信号Ｓ’（ｆ）との電力の比に基づいて和信号Ｍ（ｆ）と、修正差信号Ｓ’（ｆ）とのビット数の分配の割合を決定する。したがって、実施の形態３にかかる符号化装置１０００は、実施の形態１において説明した符号化装置５００の複雑度計算部５１０を簡易化した複雑度計算部１０１０を備えた構成からなる。 (Embodiment 3)
In the third embodiment, the number of bits of the sum signal M (f) and the corrected difference signal S ′ (f) based on the power ratio of the sum signal M (f) and the corrected difference signal S ′ (f). Determine the proportion of distribution. Therefore, the encoding apparatus 1000 according to the third embodiment has a configuration including a complexity calculation unit 1010 obtained by simplifying the complexity calculation unit 510 of the encoding apparatus 500 described in the first embodiment.

図１０−１は、実施の形態３にかかる符号化装置の構成を示すブロック図である。図１０−１に示した符号化装置１０００は、図５−１に示した符号化装置５００の複雑度計算部５１０に代わり、複雑度計算部１０１０を備えている。この複雑度計算部１０１０は、電力計算部５０４と、電力比計算部１００１とによって構成される。なお、符号化装置１０００の他の構成は、符号化装置５００と同じであるため、同一の符号を付して説明を省略する。また、ビット割り当て決定部４０７は、複雑度計算部１０１０によって計算された複雑度に応じてビット割り当てを決定する。 FIG. 10A is a block diagram of the configuration of the encoding apparatus according to the third embodiment. The encoding apparatus 1000 illustrated in FIG. 10A includes a complexity calculation unit 1010 instead of the complexity calculation unit 510 of the encoding apparatus 500 illustrated in FIG. The complexity calculation unit 1010 includes a power calculation unit 504 and a power ratio calculation unit 1001. Since the other configuration of the encoding apparatus 1000 is the same as that of the encoding apparatus 500, the same reference numerals are given and description thereof is omitted. Also, the bit allocation determination unit 407 determines bit allocation according to the complexity calculated by the complexity calculation unit 1010.

続いて、実施の形態３にかかる符号化装置１０００の符号化処理の手順を説明する。図１０−２は、実施の形態３にかかる符号化装置の符号化処理の手順を示すフローチャートである。図１０−２のフローチャートにおいて、まず、ＭＤＣＴ５０１およびＭＤＣＴ５０２において、左右のステレオ信号Ｌ（ｔ），Ｒ（ｔ）のＭＤＣＴ変換を行う（ステップＳ１０２１）。 Subsequently, the procedure of the encoding process of the encoding apparatus 1000 according to the third embodiment will be described. FIG. 10-2 is a flowchart of an encoding process performed by the encoding apparatus according to the third embodiment. 10-2, first, MDCT conversion of the left and right stereo signals L (t) and R (t) is performed in the MDCT 501 and the MDCT 502 (step S1021).

続いて、ＭＳステレオ変換部４０３によって、左右のスペクトル信号Ｌ（ｆ），Ｒ（ｆ）にＭＳステレオ変換を行う（ステップＳ１０２２）。また、類似度計算部４０４においては、左右のステレオ信号Ｌ（ｔ），Ｒ（ｔ）の相関に基づいて和信号Ｍ（ｆ）と差信号Ｓ（ｆ）との類似度（相関ｃｏｒ（ｉ））を計算する（ステップＳ１０２３）。そして、ステップＳ１０２１において計算された類似度（相関ｃｏｒ（ｉ））に基づいて、差信号修正部４０５によって差信号Ｓ（ｆ）を修正する（ステップＳ１０２４）。 Subsequently, the MS stereo conversion unit 403 performs MS stereo conversion on the left and right spectrum signals L (f) and R (f) (step S1022). Further, the similarity calculation unit 404 determines the similarity (correlation cor (i)) between the sum signal M (f) and the difference signal S (f) based on the correlation between the left and right stereo signals L (t) and R (t). )) Is calculated (step S1023). Then, based on the similarity (correlation cor (i)) calculated in step S1021, the difference signal correcting unit 405 corrects the difference signal S (f) (step S1024).

つぎに、複雑度計算部１０１０における処理について説明する。まず、電力計算部５０４によって、和信号Ｍ（ｆ）と修正差信号Ｓ’（ｆ）との電力計算を行う（ステップＳ１０２５）。電力計算部５０４で計算された和信号Ｍの電力ｅ＿ｍと、修正差信号Ｓ’の電力ｅ＿ｓとは、電力比計算部１００１に出力される。なお、複雑度計算部１０１０は、上述したような電力比の計算に替わって、電力差を求め、ビット割り当て決定部４０７に出力してもよい。さらに、電力比、または電力差を求める際には、各信号の全周波数帯域の電力の合計や平均を用いてもよい。 Next, processing in the complexity calculator 1010 will be described. First, the power calculator 504 calculates the power of the sum signal M (f) and the corrected difference signal S ′ (f) (step S1025). The power e_m of the sum signal M calculated by the power calculation unit 504 and the power e_s of the correction difference signal S ′ are output to the power ratio calculation unit 1001. Note that the complexity calculation unit 1010 may obtain a power difference instead of calculating the power ratio as described above, and output the power difference to the bit allocation determination unit 407. Furthermore, when calculating | requiring a power ratio or a power difference, you may use the sum total and average of the electric power of all the frequency bands of each signal.

続いて、電力比計算部１００１によって、和信号Ｍの電力ｅ＿ｍと、修正差信号Ｓ’の電力ｅ＿ｓとの電力比を計算する（ステップＳ１０２６）。和信号Ｍと、修正差信号Ｓ’との電力比ｐｏｗ＿ｒａｔｉｏは、ｅ＿ｓ／ｅ＿ｍによって求められる。そして、求められた電力比ｐｏｗ＿ｒａｔｉｏは、ビット割り当て決定部４０７に出力される。 Subsequently, the power ratio calculation unit 1001 calculates the power ratio between the power e_m of the sum signal M and the power e_s of the correction difference signal S ′ (step S1026). The power ratio pow_ratio between the sum signal M and the corrected difference signal S ′ is obtained by e_s / e_m. Then, the obtained power ratio pow_ratio is output to the bit allocation determination unit 407.

つぎに、ビット割り当て決定部４０７における処理について説明する。まず、修正差信号Ｓ’（ｆ）の総ビット数を決定し（ステップＳ１０２７）、続いて、和信号Ｍ（ｆ）の総ビット数を決定する（ステップＳ１０２８）。修正差信号Ｓ’（ｆ）の総ビット数を決定する具体的な手順としては、電力比ｐｏｗ＿ｒａｔｉｏビット数と修正差信号Ｓ’（ｆ）とのビット配分の関係をあらかじめ定めておく手順がある。 Next, processing in the bit allocation determination unit 407 will be described. First, the total number of bits of the modified difference signal S ′ (f) is determined (step S1027), and then the total number of bits of the sum signal M (f) is determined (step S1028). As a specific procedure for determining the total number of bits of the correction difference signal S ′ (f), there is a procedure for predetermining the bit distribution relationship between the power ratio pow_ratio bit number and the correction difference signal S ′ (f). .

図１１は、電力比ｐｏｗ＿ｒａｔｉｏとビット配分の関係の一例を示す図表である。図表１１００は、横軸が電力比ｐｏｗ＿ｒａｔｉｏを表し、縦軸が修正差信号Ｓ’のビット配分量を表す。また、曲線１１０１は、電力比ｐｏｗ＿ｒａｔｉｏとビット配分の関係を示している。ビット割り当て決定部４０７は、図表１１００のような、電力比ｐｏｗ＿ｒａｔｉｏとビット配分の関係をあらかじめ定めておく。具体的には、電力比ｐｏｗ＿ｒａｔｉｏの値が大きいときには、修正差信号Ｓ’のビット数分配を多くし、電力比ｐｏｗ＿ｒａｔｉｏの値が小さいときには、修正差信号Ｓ’のビット数分配を少なくする。つまり、修正差信号Ｓ’の電力の大きい帯域に多くのビット数を分配するような、曲線１１０１を設定しておく。 FIG. 11 is a chart showing an example of the relationship between the power ratio pow_ratio and bit allocation. In the chart 1100, the horizontal axis represents the power ratio pow_ratio, and the vertical axis represents the bit allocation amount of the correction difference signal S '. A curve 1101 shows the relationship between the power ratio pow_ratio and bit allocation. The bit allocation determining unit 407 previously determines the relationship between the power ratio pow_ratio and the bit allocation as shown in the chart 1100. Specifically, when the value of the power ratio pow_ratio is large, the bit number distribution of the correction difference signal S ′ is increased, and when the value of the power ratio pow_ratio is small, the bit number distribution of the correction difference signal S ′ is decreased. That is, the curve 1101 is set so that a large number of bits are distributed to the band where the power of the correction difference signal S ′ is large.

和信号Ｍのビット数は、ステップＳ１０２７によって決定された修正差信号Ｓ’（ｆ）のビット数の分配に基づいて決定される。具体的には、１フレーム当たりの量子化ビット数をｂｉｔ＿ｔｏｔａｌとすると、図１１の曲線１１０１によって修正差信号Ｓ’のビット数ｂｉｔ＿ｓを求め、ｂｉｔ＿ｔｏｔａｌから修正差信号Ｓ’のビット数ｂｉｔ＿ｓを引き、和信号Ｍのビット数ｂｉｔ＿ｍを求める（ｂｉｔ＿ｍ＝ｂｉｔ＿ｔｏｔａｌ−ｂｉｔ＿ｓ）。 The number of bits of the sum signal M is determined based on the distribution of the number of bits of the modified difference signal S ′ (f) determined in step S1027. Specifically, assuming that the number of quantization bits per frame is bit_total, the number of bits bit_s of the modified difference signal S ′ is obtained from the curve 1101 in FIG. 11, and the number of bits bit_s of the modified difference signal S ′ is subtracted from bit_total. The number of bits bit_m of the sum signal M is obtained (bit_m = bit_total−bit_s).

以上のようにして求めたビット数に応じて、和信号量子化器４０８ではビット数ｂｉｔ＿ｍで和信号Ｍ（ｆ）の量子化を行い（ステップＳ１０２９）、差信号量子化器４０９ではビット数ｂｉｔ＿ｓで修正差信号Ｓ’（ｆ）の量子化を行い（ステップＳ１０３０）、一連の処理を終了する。 In accordance with the number of bits obtained as described above, the sum signal quantizer 408 quantizes the sum signal M (f) with the bit number bit_m (step S1029), and the difference signal quantizer 409 performs the bit number bit_s. Then, the corrected difference signal S ′ (f) is quantized (step S1030), and the series of processes is terminated.

以上説明したように、符号化装置、符号化方法、および符号化プログラムによれば、低ビットレート条件であっても音質劣化の少ない、高音質な音声（音楽）として再生することができる。 As described above, according to the encoding device, the encoding method, and the encoding program, it is possible to reproduce high-quality sound (music) with little deterioration in sound quality even under a low bit rate condition.

なお、本実施の形態１〜３で説明した符号化方法は、あらかじめ用意されたプログラムをパーソナル・コンピュータやワークステーションなどのコンピュータで実行することにより実現することができる。このプログラムは、ハードディスク、フレキシブルディスク、ＣＤ−ＲＯＭ、ＭＯ、ＤＶＤなどのコンピュータで読み取り可能な記録媒体に記録され、コンピュータによって記録媒体から読み出されることによって実行される。またこのプログラムは、インターネットなどのネットワークを介して配布することが可能な伝送媒体であってもよい。 Note that the encoding methods described in the first to third embodiments can be realized by executing a program prepared in advance on a computer such as a personal computer or a workstation. This program is recorded on a computer-readable recording medium such as a hard disk, a flexible disk, a CD-ROM, an MO, and a DVD, and is executed by being read from the recording medium by the computer. The program may be a transmission medium that can be distributed via a network such as the Internet.

（付記１）ステレオ信号の左成分信号と右成分信号との和信号と、差信号とを用いてステレオ信号を圧縮する符号化装置において、
前記和信号の複雑度と、前記差信号の複雑度とをそれぞれ求める複雑度算出手段と、
前記複雑度算出手段によって求めた複雑度に応じて前記和信号と、前記差信号とをそれぞれ量子化する際のビット数の分配割合を設定するビット数設定手段と、
前記ビット数設定手段によって決定された前記分配割合に応じて前記和信号と、前記差信号をそれぞれ量子化する量子化手段と、
を備えることを特徴とする符号化装置。 (Supplementary note 1) In an encoding apparatus that compresses a stereo signal using a sum signal of a left component signal and a right component signal of a stereo signal and a difference signal,
Complexity calculation means for respectively determining the complexity of the sum signal and the complexity of the difference signal;
A bit number setting means for setting a distribution ratio of the number of bits when each of the sum signal and the difference signal is quantized according to the complexity obtained by the complexity calculating means;
Quantization means for quantizing each of the sum signal and the difference signal according to the distribution ratio determined by the bit number setting means,
An encoding device comprising:

（付記２）前記複雑度算出手段の前段にモノラル化手段を備え、
前記モノラル化手段は、周波数帯ごとに前記差信号の出力を所定の閾値と比較する比較手段と、
前記比較手段による比較結果における前記差信号が前記閾値よりも低い場合には、前記差信号の値を零に修正する修正手段と、
を備えることを特徴とする付記１に記載の符号化装置。 (Supplementary Note 2) A monaural unit is provided before the complexity calculating unit.
The monaural unit includes a comparing unit that compares the output of the difference signal with a predetermined threshold value for each frequency band;
A correction means for correcting the value of the difference signal to zero when the difference signal in the comparison result by the comparison means is lower than the threshold;
The encoding apparatus according to Supplementary Note 1, further comprising:

（付記３）前記ビット数設定手段は、所定のビット数を、所定の間隔で時分割されたフレームごとの前記和信号および前記差信号に分配することを特徴とする付記１または２に記載の符号化装置。 (Supplementary note 3) The supplementary note 1 or 2, wherein the bit number setting means distributes a predetermined number of bits to the sum signal and the difference signal for each frame time-divided at a predetermined interval. Encoding device.

（付記４）前記ビット数設定手段は、前記量子化手段によって量子化を行う際に、複雑度の低い信号にはビット数の分配割合を低くし、複雑度の高い信号にはビット数の分配割合を高くすることを特徴とする付記１〜３のいずれか一つに記載の符号化装置。 (Supplementary Note 4) When the quantization is performed by the quantization unit, the bit number setting unit reduces the bit number distribution ratio for a low complexity signal and distributes the bit number for a high complexity signal. The encoding device according to any one of appendices 1 to 3, wherein the ratio is increased.

（付記５）前記複雑度算出手段は、前記和信号と、前記差信号とのそれぞれの心理聴覚エントロピー値（ＰＥ値）を求め、前記和信号および前記差信号のＰＥ値の比もしくは差を複雑度とすることを特徴とする付記１〜４のいずれか一つに記載の符号化装置。 (Additional remark 5) The said complexity calculation means calculates | requires each psychoacoustic entropy value (PE value) of the said sum signal and the said difference signal, Complicates the ratio or difference of the PE value of the said sum signal and the said difference signal. The encoding device according to any one of supplementary notes 1 to 4, wherein the encoding device is a degree.

（付記６）前記複雑度算出手段は、前記和信号および前記差信号の全周波数帯域のＰＥ値の平均もしくは合計から前記複雑度を求めることを特徴とする付記５に記載の符号化装置。 (Additional remark 6) The said complexity calculation means calculates | requires the said complexity from the average or the sum total of PE value of all the frequency bands of the said sum signal and the said difference signal, The encoding apparatus of Additional remark 5 characterized by the above-mentioned.

（付記７）前記複雑度算出手段は、前記和信号と、前記差信号とのそれぞれの電力値を求め、前記和信号と、前記差信号との電力値の比もしくは差を複雑度とすることを特徴とする付記１〜４のいずれか一つに記載の符号化装置。 (Supplementary note 7) The complexity calculation means obtains respective power values of the sum signal and the difference signal, and sets the ratio or difference of the power values of the sum signal and the difference signal as complexity. The encoding device according to any one of appendices 1 to 4, characterized by:

（付記８）前記複雑度算出手段は、前記和信号および前記差信号の全周波数帯域の電力値の平均もしくは合計から前記複雑度を求めることを特徴とする付記７に記載の符号化装置。 (Additional remark 8) The said complexity calculation means calculates | requires the said complexity from the average or the sum total of the electric power value of the all frequency band of the said sum signal and the said difference signal, The encoding apparatus of Additional remark 7 characterized by the above-mentioned.

（付記９）前記ビット数設定手段は、あらかじめ定めた前記差信号の複雑度と前記分配割合との対応関係に応じてビット数の分配割合を設定することを特徴とする付記１〜８のいずれか一つに記載の符号化装置。 (Supplementary note 9) Any one of Supplementary notes 1 to 8, wherein the bit number setting means sets a distribution ratio of the number of bits according to a predetermined correspondence relationship between the complexity of the difference signal and the distribution ratio. The encoding device according to claim 1.

（付記１０）前記ビット数設定手段は、あらかじめ定めた前記和信号の複雑度と前記分配割合との対応関係に応じてビット数の分配割合を設定することを特徴とする付記１〜８のいずれか一つに記載の符号化装置。 (Supplementary note 10) Any one of Supplementary notes 1 to 8, wherein the bit number setting means sets a distribution ratio of the number of bits according to a predetermined correspondence relationship between the complexity of the sum signal and the distribution ratio. The encoding device according to claim 1.

（付記１１）ステレオ信号の左成分信号と右成分信号との和信号と、差信号とを用いてステレオ信号を圧縮する符号化方法において、
前記和信号の複雑度と、前記差信号の複雑度とをそれぞれ求める複雑度算出工程と、
前記複雑度算出工程によって求めた複雑度に応じて前記和信号と、前記差信号とをそれぞれ量子化する際のビット数の分配割合を設定するビット数設定工程と、
前記ビット数設定工程によって決定された前記分配割合に応じて前記和信号と、前記差信号をそれぞれ量子化する量子化工程と、
を含むことを特徴とする符号化方法。 (Supplementary note 11) In an encoding method for compressing a stereo signal using a sum signal of a left component signal and a right component signal of a stereo signal and a difference signal,
A complexity calculation step for determining the complexity of the sum signal and the complexity of the difference signal;
A bit number setting step for setting a distribution ratio of the number of bits when each of the sum signal and the difference signal is quantized according to the complexity obtained by the complexity calculation step;
A quantization step of quantizing the sum signal and the difference signal according to the distribution ratio determined by the bit number setting step;
The encoding method characterized by including.

（付記１２）ステレオ信号の左成分信号と右成分信号との和信号と、差信号とを用いてステレオ信号を圧縮する符号化プログラムにおいて、
前記和信号の複雑度と、前記差信号の複雑度とをそれぞれ求める複雑度算出工程と、
前記複雑度算出工程によって求めた複雑度に応じて前記和信号と、前記差信号とをそれぞれ量子化する際のビット数の分配割合を設定するビット数設定工程と、
前記ビット数設定工程によって決定された前記分配割合に応じて前記和信号と、前記差信号をそれぞれ量子化する量子化工程と、
をコンピュータに実行させることを特徴とする符号化プログラム。 (Additional remark 12) In the encoding program which compresses a stereo signal using the sum signal of the left component signal and right component signal of a stereo signal, and a difference signal,
A complexity calculation step for determining the complexity of the sum signal and the complexity of the difference signal;
A bit number setting step of setting a distribution ratio of the number of bits when each of the sum signal and the difference signal is quantized according to the complexity obtained by the complexity calculation step;
A quantization step of quantizing the sum signal and the difference signal according to the distribution ratio determined by the bit number setting step;
An encoding program for causing a computer to execute.

以上のように、本発明にかかる符号化装置、符号化方法、および符号化プログラムは、ステレオ音声データの圧縮に有用であり、特に、低ビットレートの圧縮条件に適している。 As described above, the encoding apparatus, the encoding method, and the encoding program according to the present invention are useful for compressing stereo audio data, and are particularly suitable for low bit rate compression conditions.

通常のモノラル化を示す説明図である。It is explanatory drawing which shows normal monauralization. 和信号Ｍの複雑度に応じてビット数を振り分ける方法を示す説明図である。It is explanatory drawing which shows the method of distributing bit number according to the complexity of the sum signal M. 差信号Ｓの複雑度に応じてビット数を振り分ける方法を示す説明図である。4 is an explanatory diagram showing a method of distributing the number of bits according to the complexity of the difference signal S. FIG. 本発明にかかる符号化装置の基本構成を示すブロック図である。It is a block diagram which shows the basic composition of the encoding apparatus concerning this invention. 実施の形態１にかかる符号化装置の構成を示すブロック図である。1 is a block diagram showing a configuration of an encoding apparatus according to a first embodiment. 実施の形態１にかかる符号化装置の符号化処理の手順を示すフローチャートである。3 is a flowchart illustrating a procedure of encoding processing of the encoding device according to the first exemplary embodiment; 信号の帯域の上限と下限の関係を示す図表である。It is a graph which shows the relationship between the upper limit and the lower limit of the band of a signal. ＰＥ比とビット分配の関係を示す図表である。It is a graph which shows the relationship between PE ratio and bit distribution. 実施の形態２にかかる符号化装置の構成を示すブロック図である。FIG. 3 is a block diagram illustrating a configuration of an encoding apparatus according to a second embodiment. 実施の形態２にかかる符号化装置の符号化処理の手順を示すフローチャートである。10 is a flowchart illustrating a procedure of encoding processing of the encoding device according to the second exemplary embodiment; 複雑度ＰＥ＿ｍと重み係数ｗ＿ｍの関係を示す図表である。It is a graph which shows the relationship between complexity PE_m and weighting factor w_m. 実施の形態３にかかる符号化装置の構成を示すブロック図である。FIG. 6 is a block diagram illustrating a configuration of an encoding apparatus according to a third embodiment. 実施の形態３にかかる符号化装置の符号化処理の手順を示すフローチャートである。10 is a flowchart illustrating a procedure of encoding processing of the encoding device according to the third exemplary embodiment; 電力比ｐｏｗ＿ｒａｔｉｏとビット配分の関係の一例を示す図表である。It is a graph which shows an example of the relationship between power ratio pow_ratio and bit allocation. ＭＳステレオ符号化の符号化手順を示すブロック図である。It is a block diagram which shows the encoding procedure of MS stereo encoding. 適応モノラル化の原理を示す説明図である。It is explanatory drawing which shows the principle of adaptive monauralization.

Explanation of symbols

４００符号化装置
４０１Ｌ直交変換部
４０２Ｒ直交変換部
４０３ＭＳステレオ変換部
４０４類似度計算部
４０５差信号修正部
４０６複雑度計算部
４０７ビット割り当て決定部
４０８和信号量子化器
４０９差信号量子化器 400 Encoder 401 L Orthogonal Transformer 402 R Orthogonal Transformer 403 MS Stereo Transformer 404 Similarity Calculation Unit 405 Difference Signal Correction Unit 406 Complexity Calculation Unit 407 Bit Allocation Determination Unit 408 Sum Signal Quantizer 409 Difference Signal Quantization vessel

Claims

In an encoding device that compresses a stereo signal using a sum signal of a left component signal and a right component signal of a stereo signal and a difference signal,
Complexity calculation means for respectively determining the complexity of the sum signal and the complexity of the difference signal;
A bit number setting means for setting a distribution ratio of the number of bits when each of the sum signal and the difference signal is quantized according to the complexity obtained by the complexity calculating means;
Quantization means for quantizing each of the sum signal and the difference signal according to the distribution ratio determined by the bit number setting means,
An encoding device comprising:

A monaural unit is provided before the complexity calculating unit,
The monaural unit includes a comparison unit that compares the output of the difference signal with a predetermined threshold for each frequency band;
A correction means for correcting the value of the difference signal to zero when the difference signal in the comparison result by the comparison means is lower than the threshold;
The encoding apparatus according to claim 1, further comprising:

3. The encoding apparatus according to claim 1, wherein the bit number setting unit distributes a predetermined number of bits to the sum signal and the difference signal for each frame time-divided at a predetermined interval. .

When the quantization is performed by the quantization unit, the bit number setting unit lowers the bit number distribution ratio for a low complexity signal and increases the bit number distribution ratio for a high complexity signal. The encoding apparatus according to any one of claims 1 to 3, wherein

The complexity calculation means obtains a psychoacoustic entropy value (PE value) of each of the sum signal and the difference signal, and sets a ratio or difference between the sum signal and the PE value of the difference signal as the complexity. The encoding device according to any one of claims 1 to 4, wherein:

6. The encoding apparatus according to claim 5, wherein the complexity calculation unit obtains the complexity from an average or a sum of PE values in all frequency bands of the sum signal and the difference signal.

The complexity calculating means obtains respective power values of the sum signal and the difference signal, and uses a ratio or difference between the power values of the sum signal and the difference signal as complexity. The encoding apparatus as described in any one of Claims 1-4.

The encoding apparatus according to claim 7, wherein the complexity calculation unit obtains the complexity from an average or a sum of power values in all frequency bands of the sum signal and the difference signal.

In an encoding method for compressing a stereo signal using a sum signal of a left component signal and a right component signal of a stereo signal and a difference signal,
A complexity calculation step for determining the complexity of the sum signal and the complexity of the difference signal;
A bit number setting step for setting a distribution ratio of the number of bits when each of the sum signal and the difference signal is quantized according to the complexity obtained by the complexity calculation step;
A quantization step of quantizing the sum signal and the difference signal according to the distribution ratio determined by the bit number setting step;
The encoding method characterized by including.

In an encoding program for compressing a stereo signal using a sum signal of a left component signal and a right component signal of a stereo signal and a difference signal,
A complexity calculation step for determining the complexity of the sum signal and the complexity of the difference signal;
A bit number setting step of setting a distribution ratio of the number of bits when each of the sum signal and the difference signal is quantized according to the complexity obtained by the complexity calculation step;
A quantization step for quantizing the sum signal and the difference signal according to the distribution ratio determined by the bit number setting step;
An encoding program for causing a computer to execute.