TWI281356B - Device and method for generate a coded multi-channels signal and device and method for decode a coded multi-channels signal and recordable medium - Google Patents

Device and method for generate a coded multi-channels signal and device and method for decode a coded multi-channels signal and recordable medium Download PDF

Info

Publication number
TWI281356B
TWI281356B TW94145265A TW94145265A TWI281356B TW I281356 B TWI281356 B TW I281356B TW 94145265 A TW94145265 A TW 94145265A TW 94145265 A TW94145265 A TW 94145265A TW I281356 B TWI281356 B TW I281356B
Authority
TW
Taiwan
Prior art keywords
parameter set
channel
parameter
information
data stream
Prior art date
Application number
TW94145265A
Other languages
Chinese (zh)
Inventor
Juegen Herre
Ralph Sperschneider
Johannes Hilpert
Karsten Linzmeier
Harald Popp
Original Assignee
Fraunhofer Ges Forschung
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fraunhofer Ges Forschung filed Critical Fraunhofer Ges Forschung
Application granted granted Critical
Publication of TWI281356B publication Critical patent/TWI281356B/en

Links

Landscapes

  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

In a multi-channel encoder generating several different parameter sets for reconstructing a multi-channel output signal using at least one transmission channel, the data stream is written such that the two parameter sets are decodable independently of each other. Thus, a multi-channel decoder is enabled to skip a parameter set which is marked as optional and/or has a higher version number when reading the data stream and still to perform a valid multi-channel reconstruction using a data set marked as mandatory or a data set having a sufficiently low version number. This achieves a flexible encoder/decoder concept suitable for future updates characterized by backward compatibility and reliability.

Description

1281356 九、發明說明: 【發明所屬之技術領域】 本發明係關於參數音頻之多聲道處理技術,且尤其是 關於當有數種不同參數集要重建時之參數側資訊的有效佈 置。 【先前技術】1281356 IX. Description of the Invention: TECHNICAL FIELD OF THE INVENTION The present invention relates to multi-channel processing techniques for parametric audio, and more particularly to efficient placement of parameter side information when there are several different sets of parameters to be reconstructed. [Prior Art]

除雙立體聲道外,所建議之多聲道環繞表現包含一個 中央聲道C和兩個環繞聲道,即,左環繞聲道Ls和右環繞 聲道Rs,且另外,如果適用的話,含亦稱爲LFE聲道(LFE = 低頻增強)之超低音聲道。這參照聲音格式亦稱爲3/2 (加 LFE)立體聲且最近亦稱爲5」多聲道,其意爲有三個前聲 道’兩個環繞聲道和一個LFE聲道。這所建議之多聲道環 繞表現通常需要五個或六個傳輸聲道。在一重現環境中, 在各別五個位置至少需五個喇叭,以得到離五個正確擺設 喇叭既定距離之最理想的所謂甜美音點。然而,針對其擺 設位置,可以相當自由之方式使用超低音喇叭。 有數種技術用於降低要傳輸多聲道音頻信號所需之資 料量。這此技術亦稱爲聯合立體聲技術。爲此起見,參照 第5圖。第5圖表示一聯合立體聲裝置6〇。這裝置可執行 ,例如’強度立體聲技術(IS技術)或雙聲道提示編碼(BCC) 。這種裝置通常接收至少兩個聲道(CH1,CH2,…,CHn)作 爲輸入信號並輸出至少一個單載波聲道(下行混合)及參數 資料’即一或多個參數集。將參數資料界定成使得該參數 資料可在解碼器中槪算原始聲道(CH1,CH2,…,CHn)。 1281356 正常說來,該載波聲道會包含次頻帶取樣,頻譜係數 或時域取樣等,其提供相當精緻表現之基本信號,而參數 資料及/或參數集不包含任何這種取樣或頻譜係數。取而代 之的是,參數資料包含控制參數’用於控制如藉倍增、時 間位移、頻率位移.....加以權重之既定重建演算法。參In addition to the dual stereo channels, the proposed multi-channel surround performance consists of a center channel C and two surround channels, namely the left surround channel Ls and the right surround channel Rs, and, if applicable, The subwoofer channel for the LFE channel (LFE = low frequency boost). This reference sound format is also known as 3/2 (plus LFE) stereo and has recently been referred to as 5" multi-channel, which means that there are three front channels 'two surround channels and one LFE channel. This proposed multi-channel surround performance typically requires five or six transmission channels. In a reproducible environment, at least five horns are required at each of the five locations to obtain the ideal so-called sweet point from the five correct horns. However, the subwoofer can be used in a fairly free manner for its position. There are several techniques for reducing the amount of data needed to transmit multi-channel audio signals. This technology is also known as joint stereo technology. For this reason, refer to Figure 5. Figure 5 shows a joint stereo device 6〇. This device can be implemented, for example, 'Intensity Stereo Technology (IS Technology) or Two-Channel Cue Code (BCC). Such devices typically receive at least two channels (CH1, CH2, ..., CHn) as input signals and output at least one single carrier channel (downstream mixing) and parameter data', i.e., one or more parameter sets. The parameter data is defined such that the parameter data can be used to calculate the original channels (CH1, CH2, ..., CHn) in the decoder. 1281356 Normally, the carrier channel will contain sub-band samples, spectral coefficients or time-domain samples, etc., which provide a fairly sophisticated representation of the underlying signal, while the parameter data and/or parameter set does not contain any such samples or spectral coefficients. Instead, the parameter data contains control parameters 'for controlling the established reconstruction algorithm such as multiplication, time shift, frequency shift, etc. with weights. Reference

數資料因此僅包含相當粗略表示之信號或關聯聲道。以數 字表示,載波聲道所需之資料量範圍爲60至70kbit/s,而 參數側資訊所需資料量大小爲一個聲道起自1 Jkbit/s。如 以下之說明,參數資料之實例爲已知之縮放比例因子、強 度立體聲資訊或雙聲道提示參數。 強度立體聲編碼技術是說明在1 994年2月,阿姆斯特 丹(AMSTERDAM)、由赫利(J. HERRE)、布蘭登堡(K.h. BRANDENBURG)、利德勒(D· LEDERER)所著標題爲”強度立 體聲編碼’’之AES待出版品3 799號中。通常,強度立體聲 之槪念是根據被應用到兩個立體聲音頻聲道之資料的主軸 轉換。如果將大半資料點圍繞第一主軸擺設,可以藉由旋 轉編碼前之該兩信號一既定角度以達成編碼增益。然而, 這不永遠適用於真正之立體聲重現技術。左右聲道之重建 信號包含相異權重或縮放比例版本之相同傳輸信號。雖然 如此,重建信號之振幅相異,但針對其相位資訊而言,他 們相同。然而,兩原始音頻聲道之能量時間包線的維持是 藉由向來以選頻方式加以操作之選用性縮放比例操作。這 等於高頻下之人聲感覺,其中,主要之空間提示是由能量 包線所決定。 1281356 此外,在實際執行中,所傳輸信號,即載波聲道,是 由左右聲道之加總信號而非旋轉兩成分而形成。而且,這 種處理,即產生強度立體聲參數以實施縮放比例操作,是 以選頻方式加以實施,亦即,與各比例因子頻帶,即各編 碼器頻率隔區彼此無關。較佳的方式是,結合兩個聲道形 成一結合或’’載波”聲道。除該結合聲道外,強度立體聲資 訊是其依第一聲道能量、第二聲道能量及結合或加總聲道 能量而定。The data therefore contains only signals that are fairly coarsely represented or associated channels. In digital terms, the amount of data required for the carrier channel ranges from 60 to 70 kbit/s, while the amount of data required for parameter side information is from 1 Jkbit/s for one channel. As explained below, examples of parameter data are known scaling factors, intensity stereo information, or two-channel cue parameters. The intensity stereo coding technique is described in February 1994, in Amsterdam (AMSTERDAM), by J. HERRE, Kh BRANDENBURG, and D. LEDERER. Stereo encoding ''AES to be published in No. 3, 799. Usually, the intensity stereo is based on the spindle conversion applied to the data of two stereo audio channels. If you put most of the data points around the first spindle, you can The encoding gain is achieved by rotating the two signals at a given angle before encoding. However, this does not always apply to true stereo reproduction techniques. The reconstructed signals of the left and right channels contain identical transmission signals of different weights or scaled versions. Nonetheless, the amplitudes of the reconstructed signals are different, but they are the same for their phase information. However, the energy time envelope of the two original audio channels is maintained by the selective scaling of the frequency-selective operation. Operation. This is equal to the vocal sensation at high frequencies, where the main spatial cues are determined by the energy envelope. 1281356 In addition, In execution, the transmitted signal, ie the carrier channel, is formed by the summed signal of the left and right channels instead of the two components of rotation. Moreover, this processing, that is, generating the intensity stereo parameter to perform the scaling operation, is to select the frequency. The manner is implemented, that is, independent of the scale factor bands, i.e., the encoder frequency partitions. Preferably, a combined or ''carrier' channel is formed in combination with the two channels. In addition to the combined channel, the intensity stereo signal is based on the first channel energy, the second channel energy, and the combined or added channel energy.

BCC是說明在2002年5月,慕尼黑(MUNCHEN)、由 福勒(C. FALLER)、邦迦第(F· BAUMGARTE)所著標題爲 ”應用到立體聲及多聲道音頻壓縮之雙聲道提示編碼”之 AES會議文件5 5 74號中。在BCC編碼中,使用具重疊視 窗之DFT爲基準的轉換將許多音頻輸入聲道轉換成頻譜表 示。將該形成之頻譜分成非重疊隔區。各隔區之頻寬與等 效直角頻寬(ERB)成正比。針對各隔區,即各頻帶及各時框 K,即時間取樣區加以計算所謂的相互聲道位準差(IC LD) 以及所謂的相互聲道時間差(ICTD)。將ICLD和ICDT參數 量化且加以編碼得到BCC位元流。各聲道相對於一參照聲 道有相互聲道位準差和相互聲道時間差。尤其是,依要處 理之特別信號分區而定,根據預定之公式加以計算參\數。 在解碼器端,解碼器接收一單音信號和BCC··位元流, 即相互聲道時間差之第一參數集及相互聲道位準差之第二 參數集。將單音信號轉換成頻域並予輸入至亦接收已解碼 ICLD和ICTD値之合成區。在該合成區或重建區中,使用BCC is a description of the two-channel prompt titled "Applying to Stereo and Multi-Channel Audio Compression" by MUNCHEN, C. FALLER, F. BAUMGARTE in May 2002. Coded AES Conference Paper 5 5 74. In BCC coding, a number of audio input channels are converted to a spectral representation using a DFT-based conversion with overlapping views. The formed spectrum is divided into non-overlapping compartments. The bandwidth of each partition is proportional to the equivalent right-angle bandwidth (ERB). The so-called mutual channel level difference (IC LD) and the so-called mutual channel time difference (ICTD) are calculated for each partition, i.e., each frequency band and time frame K, i.e., the time sampling area. The ICLD and ICDT parameters are quantized and encoded to obtain a BCC bit stream. Each channel has a mutual channel level difference and a mutual channel time difference with respect to a reference channel. In particular, depending on the particular signal partition to be processed, the parameters are calculated according to a predetermined formula. At the decoder end, the decoder receives a single tone signal and a BCC·· bit stream, that is, a first parameter set of mutual channel time differences and a second parameter set of mutual channel level differences. The tone signal is converted to the frequency domain and input to a synthesis zone that also receives the decoded ICLD and ICTD. In the synthesis zone or reconstruction zone, use

1281356 BCC參數(ICLD和ICTD)實施單音信號之權重操作,重建 多聲道信號,在頻率/時間轉換後,該多聲道信號代表原始 多聲道音頻信號之重建。 在B C C之情況中,聯合立體聲模組6 0作用在輸出聲 道側資訊以達參數聲道資料之量化及IC LD和ICTD參數之 編碼,其中可使用其中一原始聲道爲參照聲道供聲道側資 訊編碼用。正常而言,載波聲道由帶有參數資料之原始聲 道的加總所形成。 當然,以上技術僅提供那些只能使載波聲道解碼,但 不能產生用於產生多於一個輸入聲道或更多槪算之參數資 料的解碼器之單音表示。 在美國專利申請案之US 2003/0219130 A1、2003/ 0026441 A1及2003/00355 5 3 A1中進一步說明稱爲BCC技 術之音頻編碼技術。此外,進一步參見福勒和邦迦第之 ”雙聲道提示編碼,第二部:設計及應用’’,1 9 93年1 1月, 第1 1冊,第6號之IEEE :音頻與語音處理期刊。而且, 亦參見2002年5月,福勒和邦迦第所著之”應用於立體聲 及多聲道音頻壓縮之雙聲道提示編碼”,未出版品,第1 1 2 屆之音頻工程學會(AES)會議,及由赫利、福勒、俄特爾(C. ERTEL)、希派特(J. HILPERT)、候哲(A. HOELZER)、史賓 格(C. SPENGER)所著之"MP3環境:多聲道音頻之有效及相 容編碼,2004年柏林(BERLIN)之第116屆AES會議,未出 版品6049號。以下當中,針對第6圖至第8圖有更詳細表 示多聲道編碼之典型通用BCC設計。第6圖表示多聲道音 1281356The 1281356 BCC parameters (ICLD and ICTD) implement the weighting operation of the tone signal to reconstruct the multi-channel signal, which represents the reconstruction of the original multi-channel audio signal after frequency/time conversion. In the case of BCC, the joint stereo module 60 acts on the output channel side information to quantify the parameter channel data and encode the IC LD and ICTD parameters, wherein one of the original channels can be used as the reference channel for sound supply. Road side information coding. Normally, the carrier channel is formed by the sum of the original channels with the parameter data. Of course, the above technique only provides a tone representation of a decoder that can only decode the carrier channel, but does not produce a parameter for generating more than one input channel or more. An audio coding technique known as BCC technology is further described in U.S. Patent Application Publication Nos. 2003/0219130 A1, 2003/ 0026 441 A1 and 2003/00355 5 3 A1. In addition, see Fowler and Bunga's "Two-Channel Cue Code, Part 2: Design and Application'', January 1st, 1994, Volume 1, IEEE No. 6: Audio and Voice Handling of journals. Also, see Fowler and Bongaldi's "Two-Channel Cue Code for Stereo and Multi-Channel Audio Compression", May 2002, unpublished, 1st 2nd Audio Engineering Society (AES) meeting, and by Hurley, Fowler, C. ERTEL, J. HILPERT, A. HOELZER, C. SPENGER The "MP3 environment: effective and compatible encoding of multi-channel audio, the 116th AES conference in Berlin (BERLIN) in 2004, unpublished product 6049. In the following, there are more for the 6th to 8th pictures. Detailed representation of a typical universal BCC design for multichannel coding. Figure 6 shows multichannel tone 1281356

頻信號編碼/傳輸用之通用BCC編碼設計。在BCC編碼器 1 1 2之輸入1 1 0處輸入多聲道音頻輸入信號並使其”下行混 合,,在所謂的下行混合區1 1 4中,即轉換成單加總聲道。在 本實施例中,輸入Π 〇處之信號爲具有一前左聲道和一前 右聲道,一左環繞聲道和一右環繞聲道,及一中央聲道之 5聲道環繞信號。向來,下行混合區藉由將這些五個聲道 簡易添加成一單音信號而產生加總信號。在本技術中己知 的所有其它下行混合設計,利用多聲道輸入信號,在任何 情況下,無論是產生單一聲道或是具有多數下行混合聲道 之下行混合信號,其輸出訊號數量都小於原始輸入聲道之 數量。在本實施例中,如果從五個輸入聲道產生四個載波 聲道,則其下行混合之操作已經達成。將單一輸出聲道及/ 或輸出聲道數量輸出在加總信號線1 1 5上。 將由B C C分析區1 1 6所得到之側資訊輸出在側資訊線 117上。在BCC分析區中可計算ICLD、ICTD或相互聲道 相互關聯値(ICC値)之參數集。因此,在BCC合成區122 中之重建至多有三個不同之參數集(ICLD、ICTD和ICC)。 具有參數集之加總信號及側資訊通常以量化和編碼格 式傳輸至BCC解碼器120。BCC解碼器將所傳輸之加總信 號分成許多次頻帶並實施縮放比例,延遲及進一步之處理 ,產生要重建之數個聲道之次頻帶。實施這種處理使得在 輸出121處所重建多聲道信號之ICLD、ICTD及ICC參數 (提示)類似於在進入BCC解碼器1 12中輸入1 10處之原始 多聲道信號的對應之提示。爲此起見,BCC解碼器120包Generic BCC coding design for frequency signal encoding/transmission. Input the multi-channel audio input signal at the input 1 1 0 of the BCC encoder 1 1 2 and make it "downstream mixing", in the so-called downmixing zone 1 1 4, that is converted into a single plus total channel. In an embodiment, the signal input to the 为 is a front left channel and a front right channel, a left surround channel and a right surround channel, and a center channel 5-channel surround signal. The downmix region produces a summed signal by simply adding these five channels as a single tone signal. All other downstream hybrid designs known in the art utilize multichannel input signals, in any case, whether Producing a single channel or having a mixed signal of a plurality of downlink mixed channels, the number of output signals is smaller than the number of original input channels. In this embodiment, if four carrier channels are generated from five input channels, Then, the operation of the downmixing has been completed. The single output channel and/or the number of output channels are output on the sum signal line 1 15 . The side information obtained by the BCC analysis area 1 16 is output on the side information line 117. On. BCC points The parameter set of ICLD, ICTD or mutual channel correlation (ICC値) can be calculated in the analysis area. Therefore, the reconstruction in the BCC synthesis area 122 has at most three different parameter sets (ICLD, ICTD and ICC). The aggregated sum signal and side information are typically transmitted to the BCC decoder 120 in a quantized and encoded format. The BCC decoder divides the transmitted summed signal into a number of sub-bands and performs scaling, delay and further processing to produce a reconstruction. Sub-bands of several channels. This processing is implemented such that the ICLD, ICTD, and ICC parameters (cue) of the reconstructed multi-channel signal at output 121 are similar to the original multi-sound at input 1 10 in the BCC decoder 1 12 Corresponding hint of the channel signal. For this reason, the BCC decoder 120 pack

1281356 含B C C合成區1 2 2和側資訊處理區1 2 3。 以下將針對第7圖說明BCC合成區122之 將線1 1 5上之加總信號輸入在向來以濾波器組 現之時間/頻率轉換區內。在區125輸出處,有 信號或在特殊情況中,如果音頻濾波器組1 2 5 時域取樣產生N個頻譜係數之轉換時則有一頻f BCC合成區122進一步包含延遲級126, 127,相互關聯處理級128及代表反相濾波器組 。如第6圖中所示,在級1 2 9之輸出處可將在 系統情況中具有例如5聲道之重建多聲道音頻 喇叭124機組上。 第7圖進一步說明利用元件125將輸入信 域或濾波器組域。如節點1 3 0所示,元件1 2 5 號會加以放大使能得到數版相同之信號。原始 數等於在要重建之輸出信號中之輸出聲道數量 之原始信號在節點1 3 0受到既定延遲d !、d2、 ’結果爲在區塊1 2 6輸出處之情況包含相同信 遲不同。以第6圖中之側資訊處理區丨2 3計算 該延遲參數如他們爲B C C分析區1 1 6所決定的 聲道時間差。 同樣適用在倍增參數、a2.....ai、aN, 分析區1 1 6所決定之相互聲道位準差,亦由側 123加以計算。 ICC參數由BCC分析區116加以計算並用 內部結構。 FB125來實 N個次頻帶 實施從N個 I係數區。 位準修飾級 之級IFB 1 29 5聲道環繞 信號輸出在 號轉換成頻 所輸出之信 信號之版本 。如果各版 …、d i、d n 號版,但延 延遲參數且 ,得自相互 其根據BCC 資訊處理區 於控制區塊 (S) -10-1281356 contains B C C synthesis zone 1 2 2 and side information processing zone 1 2 3 . In the following, the sum signal on the line 1 15 of the BCC synthesis area 122 will be input to the time/frequency conversion area in which the filter is grouped in the filter. At the output of zone 125, there is a signal or, in special cases, if the audio filter bank 1 2 5 time domain samples produce a conversion of N spectral coefficients, then a frequency f BCC synthesis region 122 further includes delay stages 126, 127, mutual The processing stage 128 and the representative inverting filter bank are associated. As shown in Fig. 6, at the output of stage 1 29, there can be a reconstituted multi-channel audio horn 124 unit having, for example, 5 channels in the system. Figure 7 further illustrates the use of component 125 to input the input domain or filter bank domain. As indicated by node 1 3 0, component 1 2 5 will be amplified to enable the same number of signals to be obtained. The original signal equal to the number of output channels in the output signal to be reconstructed is subjected to a predetermined delay d !, d2, ' at the node 1 3 6 and the same signal delay is included at the output of block 1 2 6 . The delay parameters such as the channel time difference determined by the B C C analysis area 1 16 are calculated by the side information processing area 丨 2 3 in Fig. 6. The same applies to the multiplier parameter, a2.....ai, aN, and the mutual channel level difference determined by the analysis area 1 16 , which is also calculated by the side 123. The ICC parameters are calculated by the BCC analysis area 116 and used for internal structure. FB125 is implemented by N sub-bands from N I-coefficient regions. The level of the level modification level IFB 1 29 5-channel surround signal output is the version of the signal outputted by the number converted into frequency. If each version ..., d i, d n version, but delay parameters and, from each other according to BCC information processing area in the control block (S) -10-

1281356 1 2 8之功能使得在區塊1 2 8輸出處得到所延遲和位準操縱 信號間之既定相互關聯値。要注意的是級126、127和128 之順序可與第7圖中所表示者相異。 進一步注意到在針對區塊處理之音頻信號中,亦針對 區塊實施B C C分析。而且,亦針對頻率,即選頻方式實施 BCC分析。這意爲,對各頻譜帶而言,有ICLD參數、ICTD 參數及ICC參數。對於橫跨所有頻帶之至少一個聲道之 IC LD參數因此代表IC TD參數集。同樣應用到IC LD參數 集,其對於至少一個輸出聲道之重建,代表所有頻帶之所 有ICLD參數。依序同樣適用於ICC參數集,根據輸入聲 道或加總聲道,對於至少一個輸出聲道之重建其再次包含 各種頻道之數個個別ICC參數。 以下當中,參考第8圖,該圖表示從此可看到決定BCC 參數之情況。正常而言,可在任何聲道配對之間界定ICLD 、:[CTD和ICC參數。然而向來,在一參照聲道和各其它輸 入聲道之間決定ICLD和ICTD參數,使得各輸入聲道有相 異參數集。這在第8B圖中亦有說明。 然而,可相異地界定ICC參數。亦如第8B圖之示意 圖所示,通常在任何聲道配對間之編碼器中可產生ICC參 數。在這情況中,解碼器會實施IC C合成,使得當它位於 任何聲道配對間之原始信號中時大致得到相同結果。然而 ,已有建議在任何時間,即各時框,只計算兩最強聲道間 之ICC參數。這種設計表示在第8C圖中,其表示當中一 次計算並傳輸聲道1和2之間IC C參數,且當中另一次計 1281356 算聲道1和5之間IC C參數之一實例。然後,在解碼器中 合成其中兩個最強聲道間之相互聲道相互關聯並執行進一 步典型之啓發性規則,用於合成剩下聲道配對之相互聲道 的一致性。 針對例如根據所傳輸之ICLD參數,計算倍增參數a! .....aN,參考引用之AES會議文件5 5 74。ICLD參數代 表在原始多聲道信號中之能量分佈。不失一般性,第8A 圖表示有代表所有其它聲道和前左聲道之間能量差之四個The function of 1281356 1 2 8 results in a definite correlation between the delayed and level steering signals at the output of block 128. It is to be noted that the order of stages 126, 127, and 128 can be different from those represented in FIG. It is further noted that in the audio signal processed for the block, the B C C analysis is also performed for the block. Moreover, BCC analysis is also performed for the frequency, that is, the frequency selection method. This means that for each spectrum band, there are ICLD parameters, ICTD parameters and ICC parameters. The IC LD parameter for at least one channel across all frequency bands therefore represents the IC TD parameter set. The same applies to the IC LD parameter set, which for the reconstruction of at least one output channel, represents all ICLD parameters for all bands. The same applies to the ICC parameter set. According to the input channel or the total channel, for the reconstruction of at least one output channel, it again contains several individual ICC parameters of various channels. In the following, referring to Fig. 8, the figure shows the case where the BCC parameter is determined from this. Normally, ICLD can be defined between any channel pairing: [CTD and ICC parameters. However, the ICLD and ICTD parameters have traditionally been determined between a reference channel and each of the other input channels such that each input channel has a different set of parameters. This is also illustrated in Figure 8B. However, ICC parameters can be defined differently. As also shown in the schematic of Figure 8B, ICC parameters are typically generated in an encoder between any of the channel pairs. In this case, the decoder performs an IC C synthesis such that it gets roughly the same result when it is in the original signal between any of the channel pairs. However, it has been suggested to calculate only the ICC parameters between the two strongest channels at any time, i.e., at each time frame. This design is shown in Figure 8C, which shows an example of calculating and transmitting IC C parameters between channels 1 and 2, and another one of 1281356 counting IC C parameters between channels 1 and 5. Then, in the decoder, the mutual channels between the two strongest channels are synthesized and a further typical heuristic rule is performed for synthesizing the consistency of the mutual channels of the remaining channel pairs. The multiplication parameter a! .....aN is calculated, for example, according to the transmitted ICLD parameters, referenced to the AES conference file 5 5 74. The ICLD parameter represents the energy distribution in the original multichannel signal. Without loss of generality, Figure 8A shows four energy differences between all other channels and the front left channel.

IC LD參數。在側資訊處理區123中,倍增參數a!.....aN 是得自ICLD參數,使得所有重建輸出聲道之總能量與現 有已傳輸之加總信號之能量相同或至少與該能量成正比。 要決定這些參數之一種方式爲兩階段程序,在當中之第一 階段中,將左前聲道之倍增因子設成1而將第8 C圖中其它 聲道之倍增因子設成已傳輸之ICLD値。然後,在第二階 段中,計算所有五個聲道之能量並和已傳輸加總信號之能 量比較。然後,縮小所有聲道之比例,亦即,使用對於所 有聲道皆均等之比例因子,其中,選定該比例因子,使得 縮放比例後所有重建輸出聲道之總能量等於已傳輸加總信 號及/或已傳輸加總信號之總能量。 針對從B C C編碼器傳輸至B C C解碼器,作爲進一步參 數集之相互聲道一致性測量方法之I C C,要注意的是一致 性之操縱可藉修飾倍增因子加以實施,如藉介於20 log 1 0_6和2 0 1 〇 g 1 0 6間之隨機數値乘以所有次頻帶之權重因 子。一向選定假隨機序列,使得所有關鍵頻帶之變化大致 -12-IC LD parameters. In the side information processing area 123, the multiplication parameters a!.....aN are derived from the ICLD parameters such that the total energy of all reconstructed output channels is the same as or at least the energy of the existing transmitted summed signal. Just proportional. One way to determine these parameters is a two-stage procedure. In the first phase, the multiplication factor of the left front channel is set to 1 and the multiplication factor of the other channels in the 8th C picture is set to the transmitted ICLD. . Then, in the second stage, the energy of all five channels is calculated and compared to the energy of the transmitted summed signal. Then, reduce the proportion of all channels, that is, use a scale factor that is equal for all channels, wherein the scale factor is selected such that the total energy of all reconstructed output channels after scaling is equal to the transmitted summed signal and / Or the total energy of the summed signal has been transmitted. For the ICC transmitted from the BCC encoder to the BCC decoder as a method of mutual channel consistency measurement for further parameter sets, it should be noted that the manipulation of consistency can be implemented by a modified multiplication factor, such as by 20 log 1 0_6 Multiply the random number between 2 0 1 〇g 1 0 6 by the weighting factor of all sub-bands. The pseudo-random sequence has always been chosen so that all key bands change roughly -12-

1281356 等於零且各關鍵頻帶內之平均値爲零。對各相異音框或區 塊之頻譜係數使用相同序列。因此,藉修飾假隨機序列之 變異量,控制音頻情境寬度。較大之變化產生較大之收聽 寬度。可在具有關鍵頻帶之寬度的個別頻帶中實施變異量 之修飾。這允許在各物體具有相異收聽寬度之收聽情境中 同時存在數個物體。假隨機序列之適當振幅分佈如以專利 公開案2002/02 1 9 1 3 0 A1中之表示,依對數比例呈均勻分 佈。 爲了以相容方式,例如以亦適合正常立體聲解碼器之 位元流格式傳輸五個聲道,可使用1 9 9 2年1 0月,泰爾(G. THEILE)和史鐸(G· STOLL)在舊金山(SAN FRAN CISCO) AES未出版品之”音樂攝錄環繞(MUSICAM SURROUND): 與IS O/IEC 1 1 172-3相容之萬用多聲道編碼系統”中所說明 之所謂的矩陣式技術。 而且,進一步參見1994年2月葛利爾(B· GRILL)、赫 利、布蘭登堡、艾伯頓恩(E· EBERLEIN)、庫勒(J. KOLLER) 、米勒(J· MILLER)在阿姆斯特丹AES末出版品3 8 65號之 發表”增進之MPEG2音頻多聲道編碼”中所說明之多聲道編 碼技術’其中,使用相容性矩陣從原始輸入聲道得到下行 混合聲道。 總而言之,例如,亦如2004年舒晳(E. SCHUIJER)、 布里巴特(J· BREEBAART)、朋海根(H. PURNHAGEN)、英 格德迦在柏林第1 19屆AES會議,末出版品6073號,標 題爲’’低複雜參數立體聲編碼”之專家刊行中之說明,你可 說是B C C技術允許多聲道音頻材料之有效且亦往後相容編1281356 is equal to zero and the average 値 in each critical band is zero. The same sequence is used for the spectral coefficients of each phased box or block. Therefore, the audio context width is controlled by modifying the variation of the pseudo-random sequence. Larger changes result in a larger listening width. Modification of the variation can be performed in individual frequency bands having a width of a critical frequency band. This allows for the simultaneous presence of several objects in a listening context where each object has a different listening width. The appropriate amplitude distribution of the pseudo-random sequence is uniformly distributed in logarithmic scale as indicated in the patent publication 2002/02 1 9 1 3 0 A1. In order to transmit five channels in a compatible manner, for example in a bit stream format suitable for normal stereo decoders, use G. THEILE and G. STOLL in October, 1990. In San Francisco (SAN FRAN CISCO) AES unpublished "Music Video Surrounding (MUSICAM SURROUND): Universal Multi-Channel Coding System Compatible with IS O/IEC 1 1 172-3" Matrix technology. Also, see February 1994, B. GRILL, Hurley, Brandenburg, E. EBERLEIN, J. KOLLER, J. MILLER The multi-channel coding technique described in the publication of "Promoted MPEG2 Audio Multi-Channel Coding", published by Amsterdam AES, No. 3, 85, in which a downlink mixing channel is obtained from the original input channel using a compatibility matrix. All in all, for example, as in 2004, E. SCHUIJER, J. BREEBAART, H. PURNHAGEN, Ingdgga at the 19th AES conference in Berlin, final publication No. 6073, titled ''low complex parameter stereo coding'' in the expert publication, you can say that BCC technology allows multi-channel audio material to be effective and backward compatible

-13- 1281356-13- 1281356

碼。就此而論,亦應提到MPEG-4標準且尤其是擴及參數音 頻技術,其中,該標準部分亦爲標號ISO/IEC 1 4496-3 ·· 200 1 /FDAM2(參數音頻)所知。在這方面,應提到的是,尤 其是標題爲’’ps_data()語法”之MPEG-4標準的表8、9中之 語法。在這實例中,我們應提到語法要素” enable + icc’·和 "enable-ipdopd”,其中,使用這些語法要素,開啓及關閉 傳輸ICC參數及對應於相互聲道時間差之相位。應進一步 提到的是語法要素 n i c c _ d a t a ( )π ’’ i p d _ d a t a () ” 和 π 〇 p d __ d a t a ()" 總而言之,要注意的是,通常使用這種參數多聲道技 術是利用一或數個已傳輸之載波聲道,其中從N個原始聲 道形成Μ個已傳輸聲道,再次重建N個輸出聲道或K個輸 出聲道,其中Κ等於或小於原始聲道數Ν。 在到如今所說明之所有技術中之問題爲如何在例如, BCC解碼器和相異版本之參數側資訊的多聲道解碼之不同 型解碼器之間產生格式相容性。尤其是,當相異多聲道解 碼器存在市場上,而同時市場上有由相異多聲道解碼器所 產生之相異參數集的側資訊且因此爲只具單一解碼器之使 用可用時產生兩個問題。 首先,預期使具有高度計算容量之解碼器在解碼時提 供最佳之多聲道音質。然而,同時將亦有解碼器在有限資 源之條件下運作,如爲行動電話之行動裝置的解碼器。當 然,這種解碼器應提供具有仍儘可能爲良好品質之多聲道 輸出,但亦應只具有之有限的計算能力。這造成之問題爲 (Γ -14-code. In this connection, reference should also be made to the MPEG-4 standard and in particular to the parametric audio technology, which is also known by the reference ISO/IEC 1 4496-3 ·· 200 1 /FDAM2 (parametric audio). In this regard, it should be mentioned, in particular the syntax in Tables 8, 9 of the MPEG-4 standard entitled "'ps_data() syntax". In this example, we should mention the syntax element" enable + icc '· and "enable-ipdopd', where these syntax elements are used to turn on and off the transmission ICC parameters and the phase corresponding to the mutual channel time difference. It should be further mentioned that the syntax element nicc _ data ( ) π '' ipd _ data () ” and π 〇pd __ data ()" In summary, it is important to note that the multichannel technology is usually used with one or several transmitted carrier channels, from which N original sounds are used. The track forms one transmitted channel, and N output channels or K output channels are reconstructed again, where Κ is equal to or smaller than the original channel number Ν. The problem in all of the techniques described so far is how to produce format compatibility between different types of decoders for multi-channel decoding of, for example, BCC decoders and parameter side information of disparate versions. In particular, when dissimilar multi-channel decoders are available on the market, while there are side information on disparate parameter sets generated by disparate multi-channel decoders on the market and therefore available for use with only a single decoder There are two problems. First, it is expected that a decoder with a high computational capacity will provide the best multi-channel quality when decoding. However, there will also be decoders operating under conditions of limited resources, such as decoders for mobile devices. Of course, such a decoder should provide multi-channel output with still as good a quality as possible, but should only have limited computing power. The problem caused by this is (Γ -14-

1281356 支援這種可調節性,即允許以高複雜度且因此爲最佳品質 加以解碼及以減低複雜度以解碼,但亦相對應地減低品質 之空間重建是否能爲具有參數集之位元流格式。 當引進新一代/版本之BCC編碼器且爲這般之BCC位 元流要考慮到之進一步觀點爲如何維持相異版本之BCC位 元流和BCC解碼器間相容性之問題。另言之,預期新BCC 參數集以及更新之舊參數集爲往後相容。因此,當因技術 進展,有多聲道設計時當然預期提供升級路徑給允許引入 新增進之多聲道設計的BCC使用者。另一方面,新BCC 位元流格式在這些位元流和各種(較舊)BCC解碼器版本間 通常造成不相容性。 尤其是,要注意的是在漸增數量之應用領域中使用多 聲道編碼器/解碼器,其中該多聲道編碼器/解碼器未必有 最大計算容量,但亦並非永遠需要全音質。 本發明之目的在提供一種有效及彈性之槪念,即,其 例如,允許整合新參數集或更新舊參數集並同時彈性地使 用在各種不同應用上。 這目的之達成是藉由如申請專利範圍第1項之已編碼 多聲道信號產生用之裝置、如申請專利範圍第1 5項之將已 編碼多聲道信號解碼用之裝置、如申請專利範圍第2 1項已 編碼多聲道信號產生用之方法、如申請專利範圍第2 2項將 已編碼多聲道信號解碼用之方法、或如申請專利範圍第2 3 項之電腦程式。 本發明是根據發現到當將已編碼之多聲道信號寫成資 (8) -15- 1281356 料流時,達成已編碼之多聲道信號的往後相容解碼,除至 少一個傳輸聲道或載波聲道外,該資料流包含至少兩個不 同參數集’其中將該兩個參數集寫成資料流,使得可以小 於至少兩個參數集實施輸出聲道之重建。如本發明,寫入 資料流,使得解碼器可確認需要那一個參數集供重建且選 用性地需要那一參數集供重建。在這情況下,解碼器可只 使用不可或缺(即義務性的)之參數集供重建,且如果外在 情況需要的話就忽略選用之參數集。當只使用必要參數集 供重建時,這結果爲解碼器爲快速並以有限之計算容量加 以s理,然而同時,根據代表已編碼多聲道信號之相同資 料流’另一解碼器可實施高品質多聲道之重建,然而該解 碼器亦需要更多時間及/或更多計算容量及/或,更一般而 言,更多解碼器資源。1281356 supports this adjustability, which allows decoding with high complexity and therefore the best quality and reduces complexity for decoding, but also correspondingly reduces the quality of spatial reconstruction can be a bit stream with parameter sets format. A further consideration when introducing a new generation/version of BCC encoder and such a BCC bit stream is how to maintain the compatibility between the BCC bitstream and the BCC decoder in a different version. In addition, it is expected that the new BCC parameter set and the updated old parameter set will be backwards compatible. Therefore, when there is a multi-channel design due to technological advances, it is of course expected to provide an upgrade path to BCC users who are allowed to introduce a new enhanced multi-channel design. On the other hand, the new BCC bitstream format typically creates incompatibilities between these bitstreams and various (older) BCC decoder versions. In particular, it is important to note that a multi-channel encoder/decoder is used in an increasing number of applications where the multi-channel encoder/decoder does not necessarily have the maximum computational capacity, but does not always require full sound quality. It is an object of the present invention to provide an effective and flexible concept that allows, for example, integration of new parameter sets or updating of old parameter sets while being resiliently used in a variety of different applications. This object is achieved by a device for generating a coded multi-channel signal as claimed in claim 1 of the patent application, a device for decoding a coded multi-channel signal as claimed in claim 15 of the patent application, such as a patent application A method for generating a coded multi-channel signal of the twenty-first aspect, a method for decoding a coded multi-channel signal according to the second aspect of the patent application, or a computer program according to the second aspect of the patent application. The present invention is based on the discovery that when encoding an encoded multi-channel signal into a stream of (8) -15 - 1281356, a backward compatible decoding of the encoded multi-channel signal is achieved, in addition to at least one transmission channel or Outside the carrier channel, the data stream contains at least two different parameter sets 'where the two parameter sets are written as a data stream such that reconstruction of the output channels can be performed less than at least two parameter sets. As in the present invention, the data stream is written so that the decoder can confirm which parameter set is needed for reconstruction and optionally which parameter set is needed for reconstruction. In this case, the decoder can only use the indispensable (i.e., obligatory) set of parameters for reconstruction, and ignore the selected parameter set if needed by the external situation. When only the necessary set of parameters is used for reconstruction, the result is that the decoder is fast and singular with a limited computational capacity, while at the same time, the other decoder can be implemented according to the same data stream representing the encoded multi-channel signal. Quality multi-channel reconstruction, however, the decoder also requires more time and/or more computational capacity and/or, more generally, more decoder resources.

在本發明之較佳實施例中,必要參數集包含相互聲道 位準差。如根據本發明已發現的,在界定所有型式之再生 情況的輸出聲道間之基本多聲道聲音分佈時這些相互聲道 位準差極爲重要。可將該相互聲道時間差歸類爲選用參數 集,因他們之主要有關時是當經由,即,從一個傳輸聲道 之兩個輸出聲道之頭載式耳機加以呈現時,或當多聲道音 響以所謂的相當π乾”音情況,即幾乎不含回音之情況呈現 時。相互聲道時間差可因此已被歸類爲選用參數集。 相互聲道之相互關係値之重要性在提供音源寬度且進 一步產生聆聽者之感受,該聆聽者之他或她是位在具有複 雜音源之劇情中’例如包含許多非相互關聯聲音成份之古 -16-In a preferred embodiment of the invention, the set of necessary parameters includes mutual channel level differences. As has been found in accordance with the present invention, these mutual channel level differences are extremely important in defining the basic multi-channel sound distribution between the output channels of all types of reproduction. The mutual channel time difference can be classified as an optional parameter set, because their main relevant time is when the sound is transmitted through the headphone of the two output channels of one transmission channel, or when multiple sounds are present. The sound of the channel is presented in the so-called equivalent π dry tone case, that is, when there is almost no echo. The mutual channel time difference can therefore be classified as an optional parameter set. The mutual relationship between the channels is important. Width and further the feeling of the listener, who is in a story with complex sound sources, such as the ancient-16- containing many non-interrelated sound components

1281356 參數集取得另一版本編號,使得當解碼器接受無法處理較 高版本編號之參數集時,它就只是使用對應於較低版本編 號之參數集加以重建。 最後’要注意的是代表多聲道信號之資料流未必亦一 定要包含傳輸聲道。反倒是,他們已被產生並分別傳輸, 如在將BCC參數集寫到CD之往後一個對應聲道的情況中 ,其中CD包含Μ個(=等於或大於傳輸聲道。 以下將針對隨圖,詳細說明本發明之較佳實施例,其 中: 【實施方式】 第2a圖表示代表未編碼之多聲道信號的已編碼多聲 道信號產生用裝置的較佳實施圖,該未編碼之多聲道信號 包含被進給至手段2 2之輸入2 0內的1^個原始聲道加以提 供Μ個傳輸聲道及具有至少兩個參數集之參數資訊。尤其 是,在手段22之輸出23處所輸出之傳輸聲道個數Μ小於 原始音頻聲道個數Ν。將一起代表參數資訊供重建κ個輸 出聲道之個別參數集應用至手段22之輸出24a,24b,24c 以便提供。將其中Μ等於或大於1且小於n之Μ個傳輸聲 道供應至手段25,將資料流寫在適用於輸出26之輸出端 ,就像輸出24a,24b,24c處之參數集。 如以上討論,亦可從參數資訊個別傳輸/儲存下行混合 資訊(M個傳輸聲道)。 將寫入代表已編碼多聲道信號之資料流的手段2 5設 計成將Μ個傳輸聲道寫入資料流內且進而將第一、第二和 -18- 1281356 第三參數集寫入資料流內,使得不需使用三個參數集中之 一且最佳是甚至不需使用三個參數集中之至少兩個參數集 可完成K個輸出聲道之重建。關於這方面,標示供應用之 手段22輸出24a至24c處的參數集,使得如第一參數集之 一個參數集絕對需要重建,而將兩個進一步之參數集,即 第二和第三參數集界定成使其只是選擇需要重建。The 1281356 parameter set takes another version number so that when the decoder accepts a parameter set that cannot process a higher version number, it simply rebuilds using the parameter set corresponding to the lower version number. Finally, it should be noted that the data stream representing the multi-channel signal does not necessarily have to include the transmission channel. Instead, they have been generated and transmitted separately, as in the case of writing a BCC parameter set to a corresponding channel in the CD, where the CD contains one (= equal to or greater than the transmission channel. DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS A preferred embodiment of the present invention will be described in detail. [Embodiment] FIG. 2a shows a preferred embodiment of an apparatus for generating an encoded multi-channel signal representing an uncoded multi-channel signal, which is not encoded. The channel signal contains 1^ original channels that are fed into the input 20 of the means 2 2 to provide one transmission channel and parameter information having at least two parameter sets. In particular, at the output 23 of the means 22 The number of transmission channels output by the location is smaller than the number of original audio channels. The individual parameter sets representing the parameter information for reconstruction of the κ output channels are applied to the outputs 24a, 24b, 24c of the means 22 for providing.传输 One transmission channel equal to or greater than 1 and less than n is supplied to means 25, and the data stream is written at an output suitable for output 26, just like the parameter set at outputs 24a, 24b, 24c. As discussed above, Available from parameters Individually transmitting/storing downlink mixed information (M transmission channels). The means 25 for writing a data stream representing the encoded multi-channel signal is designed to write one transmission channel into the data stream and thus the first The second and -18-1281356 third parameter sets are written into the data stream, so that it is not necessary to use one of the three parameter sets and optimally, even without using at least two parameter sets of the three parameter sets, K can be completed. Reconstruction of the output channel. In this regard, the set of parameters for the supply means 22 outputs 24a to 24c are marked such that a parameter set such as the first parameter set absolutely requires reconstruction, and two further parameter sets, ie The second and third parameter sets are defined such that they are only selected for reconstruction.

寫入用之手段2 5則將作爲必要參數集之第一參數集 寫入資料流內並如以下之討論,將只作爲選用參數集之第 二參數集和第三參數集寫入資料流中。 將第2a圖輸出26處之資料流進給至第2b圖中所說明 之多聲道解碼器之資料流輸入2 7內。將資料流之資料供應 至手段2 8供讀取資料流用,其中供讀取資料流用之手段 28,就像第2a圖中所示之編碼器,再次包含供抽取自資料 流用之Μ個傳輸聲道用之邏輯輸出29且進而包含供內含 在資料流中之參數集用之邏輯輸出30a, 30b。在本發明之 較佳實施例中,將第一參數集標示爲必要或絕對需要重建 ,供讀取用之手段2 8將經由邏輯輸出3 0 a,提供該第一參 數集給重建用之手段3 1。例如,將讀取用之手段2 8固定 設爲只讀取必要參數集並將他們供應至重建用之手段3 1, 手段28將在輸入27就只略過資料流中之第二參數集,該 第二參數集以符號由第2b圖中之中斷邏輯輸出30b表示。 亦可經由控制輸入3 2將是否從資料流抽取僅有之必 要參數集或亦另爲選用參數集並將其供應至手段31的控 制信號供應至手段2 8,其中,從那裡所得到之資源可用性 -1 9 - C§) 1281356 道重建所需之至少兩參數集加以歸類。將感受最重大’即 已重建之多聲道信號品質之參數集標示爲資料流中之必要 參數集,而只將其它參數集標示爲選用參數集。進而將參 數集分級爲必要,選用,以及例如,亦可實施只是錄音室 重建所需之參數集以達成,例如,三個比例化步驟,而非 只是兩個步驟。要注意的是足以標示義務性的或較佳是選 用之參數集,因爲各別未標示之參數集型式自動由缺乏標 示所造成。The means for writing 2 5 writes the first parameter set as a necessary parameter set into the data stream and, as discussed below, writes only the second parameter set and the third parameter set of the selected parameter set into the data stream. . The data stream at output 26 of Figure 2a is fed into the data stream input 27 of the multi-channel decoder illustrated in Figure 2b. Supplying data to the means for reading data streams, wherein the means for reading the data stream 28, like the encoder shown in Figure 2a, again contains one of the transmitted sounds for the data stream The logic output 29 for the channel and, in turn, includes logic outputs 30a, 30b for the set of parameters contained in the data stream. In a preferred embodiment of the invention, the first set of parameters is marked as necessary or absolutely necessary for reconstruction, and the means for reading 28 will provide the first set of parameters to the means for reconstruction via the logical output 3 0 a 3 1. For example, the means for reading 28 is fixed to read only the necessary parameter sets and supply them to the means for reconstruction 3 1, means 28 will only skip the second set of parameters in the data stream at input 27. The second set of parameters is represented by the interrupt logic output 30b in Figure 2b. It is also possible to supply, via the control input 3 2, whether the necessary parameter set is extracted from the data stream or a control signal which is also selected as a parameter set and supplied to the means 31, to the means 2 8, wherein the resources obtained therefrom Availability-1 9 - C§) At least two parameter sets required for 1281356 channel reconstruction are categorized. The parameter set that feels the most significant', that is, the reconstructed multi-channel signal quality, is marked as the necessary parameter set in the data stream, and only the other parameter sets are marked as the selected parameter set. The parameter set is then graded as necessary, selected, and, for example, a set of parameters needed for studio reconstruction can be implemented to achieve, for example, three scaling steps instead of just two steps. It is important to note that it is sufficient to indicate a mandatory or preferably a parameter set, since the unlabeled parameter set patterns are automatically caused by a lack of indication.

第la圖表示資料流示意圖,在第la圖中所示之實施 例中,該資料流包含編號爲1 1,當中有Μ個傳輸聲道之資 料的第一控制區10,及各參數集之12a,12b,12c區。在本 發明之較佳實施例中,控制區1 0如第1 b圖中之示意說明 ,包含各種資訊片段。因此,控制區1 0之入口 1 〇〇以標題 爲’’ n u m B c c D a t a M a n d π欄發出必要參數集個數之信號。而且 ,欄位1 0 1發出是否有選用參數集之信號。爲了這個目的 起見,使用標示noptBccDataPresent”之欄位。進一步控制 區10欄位更發出具有變數"numBccDataOpt”之選用參數集 個數信號。進一步之103,104,105區發出各參數集之型式 及/或參數集i版本編號之信號。這使用名稱爲"BccDatalcT 之欄位。進一步之選用欄位序列106,107,108付予名爲 ’’Lengthinfo”之選用長度資訊給標示爲選用,即包含在選用 參數集個數中之各參數集。該長度資訊付予相對應有關例 如第i參數集之位元長度。如以下將要討論者,"Lengthinfo’1 可亦包含有關發出長度信號或另一種替代方式是可亦包含 實際長度規格所需之位元數的資訊。 -22-Figure la shows a data flow diagram. In the embodiment shown in Figure la, the data stream includes a first control area 10 numbered 1 1 with data of one of the transmission channels, and a set of parameters. Zone 12a, 12b, 12c. In a preferred embodiment of the invention, control region 10, as schematically illustrated in Figure 1b, contains various pieces of information. Therefore, the entry of the control region 10 信号 signals the number of necessary parameter sets in the column entitled '' n u m B c c D a t a M a n d π column. Moreover, field 1 0 1 signals whether there is a signal of the selected parameter set. For this purpose, the field labeled noptBccDataPresent" is used. Further control field 10 fields are issued with the number of optional parameter sets having the variable "numBccDataOpt". Further, the 103, 104, 105 zone signals the type of each parameter set and/or the parameter set i version number. This uses a field named "BccDatalcT. Further, the selected field sequence 106, 107, 108 is given to the selected length information named ''Lengthinfo' to be selected as the selection, that is, each parameter set included in the selected parameter set number. The length information is paid correspondingly. For example, the bit length of the i-th parameter set. As will be discussed below, "Lengthinfo'1 may also contain information about the number of bits required to issue a length signal or another alternative that may also include the actual length specification. twenty two-

1281356 I CLD — V 1。另外,也可以將資料集寫成使得義務性之資料 集永遠只以一個版本存在資料流中。 第3c圖表示選用參數集之識別。因此,在有關選用參 數集之資訊中,讀進各參數集之第lb圖中之參數集識別碼 1 〇 3至1 0 5,得到有關選用之各參數集的資訊。而且,如第 3c圖中之指令M〇ptChunkLen()’’所示,如果在位元流中傳 輸參數集長度,則讀進各選用參數集之參數集長度。 有關選用參數集長度資訊之決定,參照第3d圖,該圖 說明在本發明較佳實施例中如何從與各選用參數集有關之 資料讀進各參數集之位元長度。 第4 a圖中示意說明解碼器所實施之參數集讀取迴圈 。因此,以BccD at a()讀進在第1圖12a至12c區中之實 際參數集資料。 第4b圖中說明長度資訊之讀取。例如,BccDataLen1281356 I CLD — V 1. Alternatively, the data set can be written such that the obligatory data set is always stored in only one version of the data stream. Figure 3c shows the identification of the selected parameter set. Therefore, in the information about the selected parameter set, the parameter set identification codes 1 〇 3 to 1 0 5 in the lb diagram of each parameter set are read, and information about each parameter set selected is obtained. Moreover, as indicated by the instruction M〇ptChunkLen()'' in Fig. 3c, if the parameter set length is transmitted in the bit stream, the parameter set length of each of the selected parameter sets is read. For the decision to select the parameter set length information, reference is made to Figure 3d, which illustrates how the bit lengths of the various parameter sets are read from the data associated with each of the selected parameter sets in a preferred embodiment of the present invention. The parameter set read loop implemented by the decoder is schematically illustrated in Figure 4a. Therefore, the actual parameter set data in the area of Fig. 1a to Fig. 12c is read in BccD at a(). The reading of the length information is illustrated in Figure 4b. For example, BccDataLen

Bits說明發出一資料塊之實際位元長度信號所需之位元個 數。然後BccD at aLen實際上付予一資料所具有之位元長度 。這兩級之系統一方面有彈性且另一方面節省資料,因爲 尤其是當資料塊具有重大變化之位元長度,其尤其是適用 於非常不同型式且因此長度之參數集時,它是有效的。這 將允許未來資料塊之長度可以任意界定。 第4c圖最後表示參數集之切換,其中評估如第3b圖 中所說明之參數集識別碼,使得參數集和相對應之重建演 算法聯結,故未發生例如採取相互聲道位準差爲相互聲道 時間差,及反之的情況。 -24- (8>Bits specifies the number of bits required to signal the actual bit length of a block. Then BccD at aLen actually pays the bit length of a data. The two-stage system is flexible on the one hand and saves data on the other hand, because it is effective especially when the data block has a significant change in the bit length, especially when applied to a very different type and therefore a length parameter set. . This will allow the length of future data blocks to be arbitrarily defined. Finally, Figure 4c shows the switching of the parameter set, wherein the parameter set identification code as illustrated in Figure 3b is evaluated such that the parameter set is associated with the corresponding reconstruction algorithm, so that no mutual channel difference is taken, for example, Channel time difference, and vice versa. -24- (8>

1281356 第4c圖進一步表示當已確認參數集爲選用且不預期 利用選用參數集之解碼時,略過該參數集之位元數(”略過 並繼_ ”),當已讀進所有必要參數集(或有解碼器未知之資 料,例如,篸數集)時,開始輸出,不需考慮進一步之選用 參數集。(’’停止分析,開始輸出”)。當解碼器已讀進至少一 義務性資料塊且無法分析資料流中進一步之資訊時,這種 解碼器將因此開始輸出。因此,以它所不了解之資料流內 容未促使解碼器朝向完全錯誤之出口。這產生非常有強健 性之解碼器。 以下當中,將根據本發明之較佳實施例,更詳細地說 明本發明之功能性。例如,如I C L D、I C T D、I C C各種型式 之參數資訊和未來可加以界定之其它參數集資訊是被收容 在不同且分開之資料部中,即不同之比例化層中。爲了這 個起見,再次參照第4a圖至第4c圖。將參數集區分爲如 相互聲道位準差參數集之必要或(義務性)參數集,及如相 互聲道時間差參數集和相互聲道相互關係値之參數集的選 用參數集。 提供有關必要參數集個數(numBccDataMand)和存在 (OptBccDataPresent)以及選用參數集個數(numBccDataOpt) 之貝訊°正常而g ’有關必要參數集個數(niimBccData Mand) 之資訊依系統規格而定且因此未必一定要明確地加以傳輸 ’但可固定地將它儲備在編碼器和解碼器之間。對照之下 ’車父佳的是明確地傳輸選用參數集個數(numBccDataOpt) 。當存在參數(OptBccDataPresent)表示選用參數集之存在1281356 Figure 4c further shows that when the confirmed parameter set is selected and is not expected to be decoded using the selected parameter set, the number of bits in the parameter set is skipped ("skip and succeed _"), when all necessary parameters have been read. When the set (or the data that is unknown to the decoder, for example, the set of turns), the output is started, and no further selection of parameter sets is required. (''Stop analysis, start output'). When the decoder has read at least one obligatory data block and cannot analyze further information in the data stream, the decoder will start outputting accordingly. Therefore, it does not understand it. The data stream content does not cause the decoder to face a completely erroneous exit. This produces a very robust decoder. In the following, the functionality of the present invention will be explained in more detail in accordance with a preferred embodiment of the present invention. For example, The parameter information of various types of ICLD, ICTD, ICC and other parameter sets that can be defined in the future are contained in different and separate data sections, ie different proportioning layers. For this reason, refer again to Figure 4a. Up to Fig. 4c. The parameter set is divided into necessary or (obligatory) parameter sets such as mutual channel level difference parameter sets, and parameter parameters such as mutual channel time difference parameter sets and mutual channel correlation parameters. Provides the number of necessary parameter sets (numBccDataMand) and presence (OptBccDataPresent) and the number of selected parameter sets (numBccDataOpt). Normally, the information about the number of necessary parameter sets (niimBccData Mand) depends on the system specifications and therefore does not necessarily have to be explicitly transmitted 'but can be fixedly stored between the encoder and the decoder. 'Car father's is to explicitly transmit the number of selected parameter sets (numBccDataOpt). When there is a parameter (OptBccDataPresent) indicates the existence of the selected parameter set

-25--25-

1281356 時,如第3 a圖中之說明,開始有關 應部位。 本發明之較佳實施例中,進一 碼(Be cD at a Id)。亦如第3b圖中之語 如ICLD、ICTD或ICC之參數集型 法版本的資訊。正常而言,必要參 確地發出,而明確地發出選用參數 在這情況中,必需將它儲備在編碼 例如,碰到之第一參數集爲必要之 情況下,包含,例如,相互聲道位 由規定參數集型式之次序亦可明確J 參數集將最佳是包含參數集長 集長度資訊允許解碼器藉由就只略 數集’甚至不需解碼器知道參數集 了這個起見,參照第4b圖。 在本發明較佳實施例中,反正 理有關任何情況中必要參數集之資 們,因此,必要之參數集不需要包 此’解碼器發現一參數集且該參數 長度資訊時,可能實施解碼器爲假 在既定可用參數集之間,且由於不 實,該參數集爲一必要參數集。 對於選用之參數集,可傳輸參 用情況而定。爲了增進編碼器和解 選用參數集資訊之相對 步設有各參數集之識別 Ϊ明,該識別碼提供有關 式,及/或某一參數集語 數集識別碼信號不會明 之識別碼信號。然而, 器和解碼器之間,使得 參數集,其在固定儲備 準差參數集。另外,藉 早定參數集型式之資訊。 度資訊。提供這種參數 過相關位元,忽略這參 之確實位元流語法。爲 因解碼器必須分析並處 料,而無法就只丟棄他 含參數集長度資訊。因 集不含任何相關進一步 設參數集(例如,ICLD) 含任何相對應資訊之事 數集長度資訊或不依應 碼器間之互通性,簡易 -26- 1281356 規則可爲所有選用參數集包含參數集長度資訊。然而’爲 了節省位元,可不傳輸最後參數集之長度資訊’因爲不再 需要略過這些資料並存取一隨後之參數集’因爲該參數集 反正爲最後參數集。如第la圖中之說明,當資料區爲第彳 個參數集1 2 c所實際上終止時且當隨後,例如’不再有加 總信號及/或剛處理之Μ個傳輸聲道區之控制資訊等時’該 步驟明顯有用。At 1281356, as indicated in Figure 3a, the relevant part is started. In a preferred embodiment of the invention, a code (Be cD at a Id) is entered. Also as in the figure in Figure 3b, such as the information on the version of the parameter set method of ICLD, ICTD or ICC. Normally, it is necessary to issue it arbitrarily and explicitly issue the optional parameter. In this case, it must be stored in the code, for example, if the first parameter set encountered is necessary, including, for example, mutual channel bits. It is also clear from the order of the specified parameter set patterns that the J parameter set will best contain the parameter set long set length information. The decoder allows the decoder to know only the parameter set by even a few sets. 4b picture. In the preferred embodiment of the present invention, the counter-rational information is related to the necessary parameter sets in any case. Therefore, the necessary parameter set does not need to include the decoder when a parameter set is found and the parameter length information is used, and the decoder may be implemented. This parameter set is a necessary parameter set if it is between the set of available parameters and because it is not true. For the selected parameter set, it can be transferred depending on the application. In order to improve the encoder and the solution, the relative step of selecting the parameter set information is provided with an identification of each parameter set, and the identification code provides a correlation code, and/or an identification code signal whose signal set identifier signal is not clear. However, between the device and the decoder, the parameter set is set to a fixed reserve parameter set. In addition, the information of the parameter set type is determined early. Information. This parameter is provided with an overcorrelated bit, ignoring the exact bitstream syntax. In order for the decoder to analyze and process, it is impossible to discard only the parameter length information. Since the set does not contain any relevant further parameter sets (for example, ICLD), the length information of any corresponding information, or the interoperability between the non-compliant encoders, the simple -26-1281356 rule can include parameters for all selected parameter sets. Set length information. However, in order to save the bit, the length information of the last parameter set may not be transmitted because it is no longer necessary to skip the data and access a subsequent parameter set because the parameter set is the last parameter set anyway. As illustrated in FIG. 1a, when the data area is actually terminated by the second parameter set 1 2 c and subsequently, for example, 'there is no more total signal and/or one of the transmission channel areas just processed. This step is obviously useful when controlling information, etc.

例如根據資源可用性資訊32(第2b圖),明確之發信可 爲,如根據第3 d圖中已說明者,利用通知解碼器有關參數 集長度資訊之存在/長度的位元流要素,可由編碼器動態地 發出傳輸參數長度資訊之信號。 以下當中,將討論第2b圖中所示解碼器之解碼程序的 較佳實施例。較佳解碼器首先檢查最佳爲相互聲道位準差 參數集之必要(義務性)參數集的可用性。而且當ICLD參數 集之語法版本編號高於解碼器本身能解碼之版本編號時, 利用第2b圖之重建手段31不可完成重建,其中之解碼器 支援之語法版本例如從1至η。在所有其它情況中,藉由 將必要之參數集加以解碼且當未使用任何選用參數集時, 只使用必要之參數集實施多聲道合成可完成既定形式之正 常解碼程序。 然而,解碼器檢測到選用參數集時,它可使用它或捨 棄其內容,例如依以下討論之情況而定選取兩種可能性之 如果選用參數集之語法版本編號高於這種參數集型之 (§) -27- 1281356 施多聲道之重建。 總而言之,以下再次表示解碼器之基本特性,解碼器 可有利地使用該特性,以低資料率之資料流達成有效及高 品質之解碼。 針對已重建之多聲道信號品質,如果參數集在κ個輸 出聲道重建中比另一梦數集較不重要時則將寫入手段25 設計成寫入資料集’使得不需使用較不重要之資料集加以 重建是可能的。 較佳的是進而將寫入手段25設計成使參數集設有相 關聯之識別碼1 〇 〇至1 0 5,其中參數集之識別碼表示重建 絕對必須使用參數集’或其中另一參數集之識別碼表示重 建可非強制性地只使用參數集。 較佳的是進而將寫入手段25設計成將Μ個傳輸聲道 寫入資料流之資料集的傳輸聲道部U,將第一參數集寫人For example, according to the resource availability information 32 (FIG. 2b), the explicit signaling may be, for example, according to the description in FIG. 3D, using the bit stream element of the notification decoder regarding the existence/length of the parameter set length information, The encoder dynamically signals the transmission of parameter length information. In the following, a preferred embodiment of the decoding procedure of the decoder shown in Fig. 2b will be discussed. The preferred decoder first checks the availability of the necessary (obligatory) parameter sets that are optimal for the mutual channel level difference parameter set. Moreover, when the syntax version number of the ICLD parameter set is higher than the version number that the decoder itself can decode, the reconstruction may not be completed by the reconstruction means 31 of Fig. 2b, wherein the syntax version supported by the decoder is, for example, from 1 to η. In all other cases, the normal decoding procedure in the established form can be accomplished by decoding the necessary set of parameters and when not using any of the optional parameter sets, performing multi-channel synthesis using only the necessary parameter sets. However, when the decoder detects the selected parameter set, it can use it or discard its contents. For example, according to the following discussion, two possibilities are selected. If the grammar version number of the selected parameter set is higher than this parameter set type (§) -27- 1281356 Reconstruction of multi-channel. In summary, the following again represents the basic characteristics of the decoder, which can advantageously be used by the decoder to achieve efficient and high quality decoding with low data rate data streams. For the reconstructed multi-channel signal quality, if the parameter set is less important than the other dream set in the κ output channel reconstruction, then the writing means 25 is designed to write the data set 'so that no need to use less It is possible to rebuild important data sets. Preferably, the writing means 25 is further designed such that the parameter set is provided with an associated identification code 1 1 to 1 0 5, wherein the identification code of the parameter set indicates that the reconstruction must absolutely use the parameter set 'or one of the other parameter sets The identification code indicates that the reconstruction can optionally use only the parameter set. Preferably, the writing means 25 is further designed to write the one transmission channel to the transmission channel portion U of the data set of the data stream, and write the first parameter set to the person

第一參數集部12a中並將第二參數集寫入第二參數集部 1 2 b中,使得解碼器不需讀取和解譯第二參數集部〗2 b可 重建K個輸出聲道。 如果從包含相互聲道位準差、相互聲道時間差、相互 聲道相位差或相互聲道一致性資訊之以下族群選定參數集 ’則將舄入手段2 5 g受s十成將相2聲道位準差參數集標示爲 必須解碼並將族群中至少一其它參數集標示爲選用解碼。 最佳的是將寫入手段2 5 §§;計成使第二參數集設有表 示資料集中多大資料量屬於第二參數集之長度資訊1〇6至 使得解碼器能略過根據長度資訊之資料量,其中長度 -29- 108,The first parameter set portion 12a writes the second parameter set into the second parameter set portion 1 2 b so that the decoder can reconstruct the K output channels without reading and interpreting the second parameter set portion 2 b . If the parameter set is selected from the following groups containing mutual channel bit difference, mutual channel time difference, mutual channel phase difference or mutual channel consistency information, then the intrusion means 2 5 g will be s ten The set of track quasi-parameters is indicated as having to be decoded and at least one other parameter set in the population is marked as optional decoding. Preferably, the writing means 2 5 § § is calculated such that the second parameter set is provided with a length information indicating that the amount of data in the data set belongs to the second parameter set 1 〇 6 to enable the decoder to skip the information according to the length The amount of data, which is -29-108 in length,

1281356 資訊最佳是包含長度欄位之位元長度發信用的第 且其中長度欄位包含付予第二參數集之位元量的 最佳的是將寫入手段2 5進而設計成將個數資| 成表示許多選用參數集之資料流,不需該選用參數 碼器可完成K個輸出聲道之重建。 最佳的是將寫入手段2 5進而設計成使語法版 103至105與參數集聯結,使得僅當語法版本資震J 定狀態時,解碼器將使用相對應之參數集實施重建 最佳的是,第二參數集且進而之選用參數集, 的話,進而只有語法版本資訊。 而且,資料流中之參數集序列中之最後選用| 以不含任何相關聯之長度資訊。 而且,可將寫入手段2 5設計成在資料流中動ί 信號存在及參數集長度資訊長度之信號。 可將提供用之手段22設計成提供資料區序列 傳輸聲道,該傳輸聲道是基於至少一個原始聲道二 樣區序列。 依情況而定,可以硬體或軟體實施產生及/或 明方法。可在數位儲存媒體上完成實施,尤其是 或具有可以電氣方式讀出之控制信號的CD,其可 之電腦系統聯合以達方法之執行。通常,當電腦 在電腦上執行時,本發明因此亦在於具有儲存在 取之載波,供實施該方法之程式碼的電腦程式產 _攔位, :元長度 % 102 寫 :集,解 (本資訊 ^具有預 〇 如適用 =數集可 !地發出 給IV[個 .時間取 〖碼之發 :性磁碟 ί程式化 [式產品 ί器可讀 f。另言 -30- 1281356 之’當電腦程式在電腦上執行時,可因此以具有用於實施 該方法之程式碼的電腦程式使本發明實現。 【圖式簡單說明】 以下將針對隨圖,詳細說明本發明之較佳實施例,其 中: 第1 a圖爲如本發明實施例具有既定資料流語法之已 編碼多聲道信號的槪觀; 第1 b圖爲如本發明實施例之第1 a圖控制區塊的詳細1281356 The best information is the length of the bit length including the length field and the length field contains the amount of bits to be given to the second parameter set. The best way is to write the writing means 25 and then design the number.资|成 indicates that many data streams are selected for the parameter set. It is not necessary to use the parameter coder to complete the reconstruction of K output channels. Preferably, the writing means 25 is further designed to link the grammar versions 103 to 105 with the parameter set such that the decoder will perform the reconstruction optimally using the corresponding parameter set only when the grammar version is in the state of the state. Yes, the second parameter set and, in turn, the parameter set, then only the grammar version information. Moreover, the last selected in the sequence of parameter sets in the data stream does not contain any associated length information. Moreover, the writing means 25 can be designed to signal the presence of the signal and the length of the length of the parameter set in the data stream. The means 22 for providing may be designed to provide a data zone sequence transmission channel based on at least one original channel sample zone sequence. Depending on the situation, the production and/or method may be implemented in hardware or software. The implementation can be carried out on a digital storage medium, in particular or with a control signal that can be electrically read out, which can be combined with the computer system to perform the method. Generally, when the computer is executed on a computer, the present invention is therefore also embodied in a computer program that stores the code stored in the carrier for implementing the method. _Block, : Yuan length % 102 Write: Set, Solution (This information ^ has pre-appreciation = apply to the number set! Can be issued to the IV [time. Time to take the code: the disk is stylized [type product readable device f. Another word -30-1281356' when the computer When the program is executed on a computer, the present invention can be implemented by a computer program having a program code for implementing the method. [Simplified Description of the Drawings] The preferred embodiment of the present invention will be described in detail below with reference to the accompanying drawings. : FIG. 1a is a view of an encoded multi-channel signal having a predetermined data stream syntax according to an embodiment of the present invention; FIG. 1b is a detail of a control block of FIG. 1a according to an embodiment of the present invention;

第2a圖爲如本發明實施例之編碼器的方塊電路圖; 第2b圖爲如本發明實施例之解碼器的方塊電路圖; 第3a圖至第3d圖表示如本發明參數集配置之較佳實 施圖; 第4a圖至第4c圖表示如本發明參數集資料之較佳實 施圖; 第5圖表示多聲道編碼器之一般表示圖;2a is a block circuit diagram of an encoder according to an embodiment of the present invention; FIG. 2b is a block circuit diagram of a decoder according to an embodiment of the present invention; and FIGS. 3a to 3d are diagrams showing a preferred implementation of a parameter set configuration according to the present invention; Figure 4a to 4c show a preferred embodiment of the parameter set data of the present invention; Figure 5 shows a general representation of the multi-channel encoder;

第6圖爲BCC編碼器/BCC解碼器路徑之示意方塊圖 第7圖爲第6圖之BCC合成區之方塊電路圖;以及 第8A圖至第8C圖爲用於計算參數集ICLD、ICTD和 ICC之典型綱要的表示圖。 【主要元件符號說明】 10 控制區 11 Μ個傳輸聲道部. -31- 12813566 is a schematic block diagram of a BCC encoder/BCC decoder path. FIG. 7 is a block circuit diagram of a BCC synthesis area of FIG. 6; and 8A to 8C are diagrams for calculating parameter sets ICLD, ICTD, and ICC. A representation of a typical outline. [Main component symbol description] 10 Control area 11 One transmission channel section. -31- 1281356

12a 第一參數集部 12b 第二參數集部 20 輸入(N輸入聲道) 22 手段 23 輸出(M傳輸聲道) 24a 輸出(第一 參數集)(必要的) 24b 輸出(第二 參數集)(選用的) 24c 輸出(第三 參數集)(選用的) 25 手段 26 輸出 27 輸入 28 手段 29 邏輯輸出 30a 邏輯輸出 30b 邏輯輸出 3 1 手段 32 控制輸入 33 輸出 60 聯合立體聲裝置 110 輸入 112 B C C編碼器 114 下行混合區 115 加總信號線 116 B C C分析區12a first parameter set section 12b second parameter set section 20 input (N input channel) 22 means 23 output (M transmission channel) 24a output (first parameter set) (required) 24b output (second parameter set) (optional) 24c output (third parameter set) (optional) 25 means 26 output 27 input 28 means 29 logic output 30a logic output 30b logic output 3 1 means 32 control input 33 output 60 joint stereo device 110 input 112 BCC code 114 Downstream mixing zone 115 summing signal line 116 BCC analysis zone

Cs) -32- 1281356Cs) -32- 1281356

117 側資訊線 120 B C C解碼器 12 1 輸出 122 BCC合成區 123 側資訊處理區 124 喇叭 125 濾波器組 126 延遲級 127 位準修飾級 128 相互關聯處理級 129 級IFB 130 節點117 side information line 120 B C C decoder 12 1 output 122 BCC synthesis area 123 side information processing area 124 speaker 125 filter bank 126 delay stage 127 level modification level 128 interrelated processing level 129 level IFB 130 node

C8) -33-C8) -33-

Claims (1)

1281356 第94 1 45 265號「已編碼之多聲道信號的產生裝置和方法 、已編碼之多聲;道信號的解碼裝置和方法以及記錄媒體 」專利案 (2006年1 1月修正) 十、申請專利範圍: 1 . 一種已編碼多聲道信號的產生裝置,該信號代表包 含N個原始聲道之非編碼之多聲道信號,其中N等 於或大於2,該裝置包含: 提供手段(22),提供來自Μ個傳輸聲道(2 3)的 參數資訊(24a,24b,24c),用於重建Κ個輸出聲道 ’其中Μ等於或大於1且等於或小於n,其中K大 於Μ且等於或小於Ν,其中該參數資訊包含至少兩 個相異參數集,用於重建一個及相同之輸出聲道; 以及 寫入手段(2 5 ),用於寫入一資料流(2 6 ),其中將 該寫入手段(25)設計成:將該第一及第二參數集寫 入該資料流中,因此,不必使用該第二參數集及該 Μ個傳輸聲道(23)中至少一聲道,就可完成該κ個 輸出聲道中至少一聲道之重建。 2 . —種已編碼多聲道信號的解碼裝置,該信號代表包 含Ν個原始聲道之非編碼多聲道信號,其中,以包 含參數資訊之資料流來代表已編碼之多聲道信號, 而該參數資訊係來自Μ個傳輸聲道且用以重建反個 輸出聲道,其中Μ等於或大於1且等於或小於Ν, 其中Κ大於Μ且等於或小於ν,其中該參數資訊係 1281356 - 包含至少兩個相異參數集,用以重建一個及相同輸 出聲道,且其中將該第一及第二參數集寫入該資料 流中,藉以完成該K個輸出聲道之重建,而不必使 用該第二參數集,該裝置包含: 資料流讀取手段(28),用於讀取資料流,藉以 讀入該第一參數集(30a)且略過第二參數集(30b)。 • 3 .如申請專利範圔第2項之裝置,其中更包含: 重建手段(32),使用該Μ個傳輸聲道及該第一 φ 參數集而不使用該第二參數集來重建該Κ個輸出聲 道。 4 .如申請專利範圍第2或3項之裝置,其中該第一參 數集包含相關語法版本資訊(1 〇3至105),而且 其中將該讀取手段(2 8)設計成讀取該相關語法 版本之資訊並驅動該重建手段(3 1 ),藉此,只有在 已讀取之語法版本資訊與該解碼裝置之指定語法版 本資訊爲相容時,即以該重建手段實施重建。 ® 5.如申請專利範圍第2項之裝置,其中該第二參數集 係包含相關之語法版本資訊(1 0 3至1 0 5 )’而且其中 - 將該讀取手段(2 8 )設計成:當該已讀取之語法版本 ^ 資訊與該解碼裝置之指定語法版本資訊不相容時’ 則略過第二參數集;且當該已5買取之語法版本資曰只 與該指定之語法版本資訊相容時’則讀入該第二參 數集。 6 .如申請專利範圍第2項之裝置’其中該第一參數集 -2-1281356 No. 94 1 45 265 "Implementation apparatus and method for encoded multi-channel signals, multi-acoustic coded; decoding device and method for channel signal and recording medium" Patent case (amended in January 2006) X. Patent application scope: 1. A device for generating a multi-channel signal, the signal representing a non-coded multi-channel signal comprising N original channels, wherein N is equal to or greater than 2, the device comprises: providing means (22 Providing parameter information (24a, 24b, 24c) from one of the transmission channels (23) for reconstructing one of the output channels 'where Μ is equal to or greater than 1 and equal to or less than n, where K is greater than Μ and Equal to or less than Ν, wherein the parameter information includes at least two distinct parameter sets for reconstructing one and the same output channel; and a writing means (2 5 ) for writing a data stream (2 6 ), The writing means (25) is designed to: write the first and second parameter sets into the data stream, so that it is not necessary to use the second parameter set and at least one of the one transmission channels (23) Channel, you can complete the κ output channel Road reconstruction at least soon. 2 . A decoding device for encoding a multi-channel signal, the signal representing a non-coded multi-channel signal comprising a plurality of original channels, wherein the encoded multi-channel signal is represented by a data stream containing parameter information, The parameter information is from one transmission channel and is used to reconstruct the inverse output channel, where Μ is equal to or greater than 1 and equal to or less than Ν, where Κ is greater than Μ and equal to or less than ν, wherein the parameter information is 1281356 - Include at least two different parameter sets for reconstructing one and the same output channel, and wherein the first and second parameter sets are written into the data stream, thereby completing reconstruction of the K output channels without Using the second set of parameters, the apparatus includes: a data stream reading means (28) for reading the data stream, thereby reading the first parameter set (30a) and skipping the second parameter set (30b). • 3. The device of claim 2, further comprising: a reconstruction means (32) for reconstructing the 使用 using the one transmission channel and the first φ parameter set without using the second parameter set Output channels. 4. The apparatus of claim 2, wherein the first parameter set includes related grammar version information (1 〇 3 to 105), and wherein the reading means (28) is designed to read the correlation The information of the grammar version drives the reconstruction means (3 1 ), whereby the reconstruction is performed by the reconstruction means only when the read grammar version information is compatible with the specified grammar version information of the decoding device. ® 5. The device of claim 2, wherein the second parameter set contains relevant grammar version information (1 0 3 to 1 0 5 ) 'and wherein - the reading means (28) is designed to : when the read grammar version ^ information is incompatible with the specified grammar version information of the decoding device', then the second parameter set is skipped; and when the grammatical version of the 5 pirated version is only associated with the specified grammar When the version information is compatible, the second parameter set is read. 6. The device of claim 2, wherein the first parameter set -2- 1281356 係包含指出該相關第二參數集之資料量的長度資訊 ,且其中將讀取手段(2 8)設計成:根據該長度資訊 ,略過由長度資訊所指出之資料集中的資料量,不 必解析該第二參數集之資料。 7.如申請專利範圍第2項之裝置,其中該讀取手段(28) 係可被控制以取得資源可用資訊,以及1281356 includes length information indicating the amount of data of the related second parameter set, and wherein the reading means (28) is designed to: according to the length information, skip the amount of data in the data set indicated by the length information, without Parse the data of the second parameter set. 7. The apparatus of claim 2, wherein the reading means (28) is controllable to obtain resource available information, and 其中更將該讀取手段(2 8)設計成:當該資源可 用資訊指出充足資源時,讀入該第二參數集;且當 資源可用資訊指出不足資訊時,則略過該第二參數 集。 8 .如申請專利範圍第2項之裝置,其中相對於一已重 建多聲道信號之品質,一參數集係比K個輸出聲道 重建中之另一參數集較不重要,且其中將該資料流 讀取手段(2 8)設計成略過較不重要之資料集。 9 .如申請專利範圍第2項之裝置,其中該資料流係包 含具有一相關識別碼(100至105)的參數集,其中一 參數集之識別碼係指出該參數集絕對必須被使用於 重建,或其中另一參數集之識別碼係指出該參數集 僅可被選擇性地使用於重建,其中,將該資料流讀 取手段(2 8)設計成:檢測該識別碼及讀取該必需之 參數集,且根據該已檢測之識別碼來略過選用性之 參數集。 1 〇 .如申請專利範圍第2項之裝置,其中該資料流係包 含第一參數集部(12 a)中之第一參數集及第二參數 1281356 年月、修(更)正替換頁i 集部(12b)中之第二參數集,其中將該資料流讀取手 段設計成:解釋相對於該參數集部之該資料流,讀 入該第一參數集部,並且略過該第二參數集部。 1 1 .如申請專利範圍第2項之裝置,其中參數集係選自 包含下列的群組:交互聲道位準差、交互聲道時間 差、交互聲道相位差或交互聲道一致性資訊,其中 ,在該資料流中,將交互聲道位準差參數集標示爲 絕對需要用於解碼,且其中將該群組中至少一其它 參數集標示爲選擇性地用於解碼,且其中將該資料 流讀取手段(2 8)設計成:讀入該交互聲道位準差參 數集並略過來自該群組當中的另一參數集。 1 2 .如申請專利範圍第2項之裝置,其中該資料流係包 含指出許多選用性之參數集的數量資訊(1 02),而不 需要以該解碼器才可完成之該K個輸出聲道的重建 ,其中將該資料流讀取手段設計成:根據該數量資 訊,來讀入至少一選用參數集。 1 3 .如申請專利範圍第2項之裝置,其中在該資料流中 有相關之語法版本資訊,其用於該第二參數集,適 當的話更可用於選用參數集。 1 4 .如申請專利範圍第1或2項之裝置,其中該資料流 中一連串參數集中之最後選用參數集係不包含任何 相關之長度資訊,其中將該資料流讀取手段(2 8)設 計成:在讀入該最後選用參數集之前,不讀取及解 釋任何長度資訊。 -4- --Η 1281356 1 5 ·如申請專利範圍第2項之裝置,其中在該資料流中 動態地以信號表示參數集長度資訊之存在及長度, 且其中將該資料流讀取手段(2 8)設計成:首先在該 資料流中檢測參數集長度資訊之存在,然後根據一 已檢測到之存在,從該資料流中抽取該參數集長度 資訊之長度。 1 6 .如申請專利範圍第3項之裝置,其中該Μ個輸出聲 道係B C C下行混合聲道,且參數集係包含B C C參數 ,且其中將重建手段(3 2)設計成實施BCC合成。 1 7 . —種已編碼多聲道信號的產生方法,該信號代表包 含Ν個原始聲道之未編碼多聲道信號,其中Ν等於 或大於2,該方法包含: 提供(22)來自 Μ個傳輸聲道(23)的參數資料 (24a,24b,24〇,用於重建Κ個輸出聲道,其中Μ 等於或大於1且等於或小於Ν,其中Κ大於Μ且等 於或小於Ν,其中該參數資訊包含至少二個相異參 數集,用於重建一個和相同輸出聲道;以及 藉由將該第一及第二參數集寫入該資料流中’ 來寫入(2 5)—資料流(26),因此不必使用該第二參數 集與使用Μ個傳輸聲道(23)中至少一個聲道,就可 完成該Κ個輸出聲道中至少一個聲道之重建。 1 8 . —種已編碼多聲道信號的解碼方法,該信號代表包 含Ν個原始聲道之未編碼多聲道信號,其中,以包 含參數資訊之資料流來代表已編碼之多聲道信號’ -5- 27 1281356 而該參數資訊係來自Μ個傳輸聲道且用以重建K個 輸出聲道,其中Μ等於或大於1且等於或小於Ν, 其中Κ大於Μ且等於或小於Ν,其中該參數資訊係 包含至少兩個相異參數集,用於重建一個及相同輸 出聲道,且其中將該第一和第二參數集寫入該資料 流中,藉以完成該Κ個輸出聲道之重建而不必使用 該第二參數集,該方法包含: 讀取(2 8)該資料流,藉以讀入該第一參數集(30a) ,並略過該第二參數集(30b)。 1 9 . 一種儲存電腦程式之電腦記錄媒體,該電腦程式在 電腦上執行時,可用於實施如申請專利範圍第1 7項 或1 8項之方法之程式碼。The reading means (28) is further configured to: when the resource available information indicates sufficient resources, read in the second parameter set; and when the resource available information indicates insufficient information, skip the second parameter set . 8. The apparatus of claim 2, wherein a parameter set is less important than another parameter set of K output channel reconstructions relative to the quality of a reconstructed multi-channel signal, and wherein The data stream reading means (28) is designed to skip the less important data set. 9. The device of claim 2, wherein the data stream comprises a parameter set having a correlation identifier (100 to 105), wherein an identification code of a parameter set indicates that the parameter set must absolutely be used for reconstruction Or the identification code of another parameter set indicates that the parameter set can only be selectively used for reconstruction, wherein the data stream reading means (28) is designed to: detect the identification code and read the necessary a set of parameters, and the optional parameter set is skipped based on the detected identification code. 1 . The device of claim 2, wherein the data flow system comprises a first parameter set in the first parameter set portion (12 a) and a second parameter 1281356 month, repair (more) positive replacement page i a second parameter set in the set (12b), wherein the data stream reading means is designed to: interpret the data stream relative to the parameter set portion, read the first parameter set portion, and skip the second Parameter set section. 1 1. The device of claim 2, wherein the parameter set is selected from the group consisting of: an interactive channel bit difference, an interactive channel time difference, an interactive channel phase difference, or an interactive channel consistency information, Wherein, in the data stream, the set of interactive channel level deviation parameters is marked as absolutely necessary for decoding, and wherein at least one other parameter set in the group is marked as selectively used for decoding, and wherein The data stream reading means (28) is designed to read in the set of interactive channel level deviation parameters and skip the other parameter set from the group. 1 2 . The device of claim 2, wherein the data flow includes quantity information (1 02) indicating a plurality of optional parameter sets, and the K output sounds that can be completed by the decoder are not required. The reconstruction of the track, wherein the data stream reading means is designed to read at least one selected parameter set according to the quantity information. 1 3. The apparatus of claim 2, wherein the grammar version information is used in the data stream for the second parameter set, and if appropriate, the parameter set is used. 1 4. The device of claim 1 or 2, wherein the last selected parameter set in a series of parameter sets in the data stream does not contain any relevant length information, wherein the data stream reading means (28) is designed Cheng: Do not read and interpret any length information until the last selected parameter set is read. -4- -- Η 1281356 1 5 · The device of claim 2, wherein the data stream dynamically signals the existence and length of the parameter set length information, and wherein the data stream reading means 2 8) Designed to: first detect the existence of the parameter set length information in the data stream, and then extract the length of the parameter set length information from the data stream according to a detected existence. 16. The device of claim 3, wherein the one output channel is a B C C downstream mixing channel, and the parameter set comprises a B C C parameter, and wherein the reconstruction means (32) is designed to perform BCC synthesis. 1 7 - A method for generating an encoded multi-channel signal, the signal representing an uncoded multi-channel signal comprising one of the original channels, wherein Ν is equal to or greater than 2, the method comprising: providing (22) from The parameter data (24a, 24b, 24〇) of the transmission channel (23) is used to reconstruct one output channel, where Μ is equal to or greater than 1 and equal to or less than Ν, where Κ is greater than Μ and equal to or less than Ν, where The parameter information includes at least two distinct parameter sets for reconstructing one and the same output channel; and writing (2 5) - data stream by writing the first and second parameter sets into the data stream (26), so that it is not necessary to use the second parameter set and use at least one of the one transmission channels (23) to complete the reconstruction of at least one of the one of the output channels. A decoding method of a coded multi-channel signal representing an uncoded multi-channel signal comprising a plurality of original channels, wherein the encoded multi-channel signal is represented by a data stream containing parameter information ' -5 - 27 1281356 and the parameter information is from one transmission Channel and used to reconstruct K output channels, where Μ is equal to or greater than 1 and equal to or less than Ν, where Κ is greater than Μ and equal to or less than Ν, wherein the parameter information includes at least two distinct parameter sets for Reconstructing one and the same output channel, and wherein the first and second parameter sets are written into the data stream, thereby completing the reconstruction of the one output channel without using the second parameter set, the method comprising: reading Taking the data stream (2 8) to read the first parameter set (30a) and skipping the second parameter set (30b). 1 9. A computer recording medium for storing a computer program, the computer program is on the computer When executed, it can be used to implement the code of the method of claim 17 or 18. -6- 1281356^^一、圖式·· 957X1727 — 年月曰修(更)正替換頁 10 11 12a 12b 12c 控制區 M個麵聲道 第一 第二 第i個 參數集 參數集 參數集 圖 Aml-6- 1281356^^一,图·· 957X1727 — Year of the month repair (more) replacement page 10 11 12a 12b 12c Control area M face channels first and second i-th parameter set parameter set parameter set diagram Aml 00, 01, 02 03 04, 05, 97 98 BCCDaials Bcsataol nu 彐 BCCDataopt oPSGCDatapreses numDDCGD3t3Mand numBccDataMand: optBccDataPresent: numBccDataOpt: BccDatalDi: Lengthlnfo optional i; BCCDatao Lengthlnfo optional 1 Lengthlnfo optional 2 Lengsnfo oPHOnal 3 號 編度 本長 ? 版元 集及位 數數數式之 個參個型集 集用集之數 數選數I參 參有參集個 要否用數i 必是選參第 第IB圖 *'· Cfy^f 1281356 20、 編碼器 N聲道音頻 (N>2) 22 25 夕Μ個傳輸聲道 提供手段 第一參數集(必要的)t 用以將比例化參 數集寫入資料流 (第二參數集(選用的)〔 第三參數集(選用的、 24a 24b 24c 26、 具比例化參數集 之資料流 第2A圖00, 01, 02 03 04, 05, 97 98 BCCDaials Bcsataol nu 彐BCCDataopt oPSGCDatapreses numDDCGD3t3Mand numBccDataMand: optBccDataPresent: numBccDataOpt: BccDatalDi: Lengthlnfo optional i; BCCDatao Lengthlnfo optional 1 Lengthlnfo optional 2 Lengsnfo oPHOnal No. 3 editorial length? And the number of digits of the number of types of sets of sets of sets of numbers, the number of choices, the number of parameters, the reference number, the number of items, the number of items, i must be selected, the first IB figure *'· Cfy^f 1281356 20, the encoder N-channel audio (N>2) 22 25 传输 传输 a transmission channel provides means for the first parameter set (required) t to write the proportional parameter set to the data stream (second parameter set (optional) [ Three-parameter set (optional, 24a 24b 24c 26, data flow with scaled parameter set, Figure 2A) 資源可用性 資訊 第2B圖 1281356 年月日修(更)正替換頁丨 選用參數集之存在與個數 numBccData = numBccDataMand; if (optBccDataPresent) { 1 uimsbf numBccData += numBccDataOptMl 3 uimsbf + 1: OptChunklnfoQ;Resource availability Information Figure 2B Figure 1281356 Repair (more) replacement page 选用 Select the existence and number of parameter sets numBccData = numBccDataMand; if (optBccDataPresent) { 1 uimsbf numBccData += numBccDataOptMl 3 uimsbf + 1: OptChunklnfoQ; 第3A圖 參數集識別碼 BccDatalD Symbolic name 0 (reserved) 1 ICLD V1 2 ICTD V1 3 ICC V1 4 ICLDJ/2 > =5 Undefined (reserved) V1.V2: 語法 版本資訊Figure 3A Parameter Set Identification Code BccDatalD Symbolic name 0 (reserved) 1 ICLD V1 2 ICTD V1 3 ICC V1 4 ICLDJ/2 > =5 Undefined (reserved) V1.V2: Syntax Version Information ICLD:相互聲道位準差 ICTD:相互聲道時間差 ICC:相互聲道一致性資訊 第3B圖 1281356ICLD: mutual channel level difference ICTD: mutual channel time difference ICC: mutual channel consistency information 3B Figure 1281356 選用參數集之識別 OptChunklnfo() { for (bccData = numBccDataMand; bccData< numBccData; bccData++}{Use parameter set identification OptChunklnfo() { for (bccData = numBccDataMand; bccData<numBccData; bccData++}{ bccDataOptional [bccData] = 1; bccDatalD[bccData]; 6 uimsbf GetOptChunkLenQ; 第3C圖 選用參數集之長度資訊 OptChunkLen() { if (bccData != numBccData -1) { if (bccDataLenBitsPresent[bccData]) { 1 bccDataLenBits[bccData] = 4 uimsbf bccDataLenBitsMI [bccData] + 1; } } else { bccDataLenBitsPresent[bccData] = 0; 第3D圖 1281356 祖,丄f ? 年月日修(更试樣換:㈣1 參數集讀取迴圈 for (bccData = 0; bccData < numBccDaia; bccData十+) { ChunkLenQ; BccDataQ;bccDataOptional [bccData] = 1; bccDatalD[bccData]; 6 uimsbf GetOptChunkLenQ; 3C map selects the length of the parameter set OptChunkLen() { if (bccData != numBccData -1) { if (bccDataLenBitsPresent[bccData]) { 1 bccDataLenBits[ bccData] = 4 uimsbf bccDataLenBitsMI [bccData] + 1; } } else { bccDataLenBitsPresent[bccData] = 0; 3D Figure 1281356 ancestor, 丄f ? Year and month repair (more sample change: (four) 1 parameter set read loop for (bccData = 0; bccData <numBccDaia; bccData ten +) { ChunkLenQ; BccDataQ; 第4A圖 長度資訊讀取 ChunkLenQ { if( bccDataLenBitsPresent[bccData]) {Figure 4A Length information read ChunkLenQ { if( bccDataLenBitsPresent[bccData]) { bccDataLen[bccData] bccDataLenBits uimsbf 第4B圖 1281356 95^ ll 27 — 年月曰修(更j正替換頁 參數集切換 BccData() { switch (bccDatalD[BccData]){bccDataLen[bccData] bccDataLenBits uimsbf Figure 4B 1281356 95^ ll 27 — Year and month repair (more j is replacing page parameter set switching BccData() { switch (bccDatalD[BccData]){ case ICLD—V1: lcldVIData(numBccBlocks); break; case ICTD一V1: IctdVIDatai); break; case ICCJ/1: lccV1Data(); break; case ICLD_V2: lcldV2Data(); break; default: if (bccDataOptional[BccData]){ if (bccDataLenBitsPresent[bccData]) Γ skip and continue 7; else Γ stop parsing, start output 7 elseT exit 7;Case ICLD—V1: lcldVIData(numBccBlocks); break; case ICTD-V1: IctdVIDatai); break; case ICCJ/1: lccV1Data(); break; case ICLD_V2: lcldV2Data(); break; default: if (bccDataOptional[BccData] ) { if (bccDataLenBitsPresent[bccData]) Γ skip and continue 7; else Γ stop parsing, start output 7 elseT exit 7; 第4C圖 1281356 95.11 27 年月曰修(更;正聲換頁i4C Figure 1281356 95.11 27 years of repair (more; 60 CH1 -^ CH2 IS或BCC裝置 CHN 載波聲道 載波聲道之參數資料60 CH1 -^ CH2 IS or BCC device CHN carrier channel Parameter data of carrier channel 第5圖 •1281356 95,11 27 ,·Figure 5 • 1281356 95, 11 27 , · JJ 第6圖Figure 6 127127 第7圖 1281356 95.11 27 "…....... “二' Ji 中央Figure 7 1281356 95.11 27 "........ "Two ' Ji Central 第8圖Figure 8
TW94145265A 2004-09-03 2005-12-20 Device and method for generate a coded multi-channels signal and device and method for decode a coded multi-channels signal and recordable medium TWI281356B (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
DE10404281 2004-09-03

Publications (1)

Publication Number Publication Date
TWI281356B true TWI281356B (en) 2007-05-11

Family

ID=38741712

Family Applications (1)

Application Number Title Priority Date Filing Date
TW94145265A TWI281356B (en) 2004-09-03 2005-12-20 Device and method for generate a coded multi-channels signal and device and method for decode a coded multi-channels signal and recordable medium

Country Status (1)

Country Link
TW (1) TWI281356B (en)

Similar Documents

Publication Publication Date Title
US11343631B2 (en) Compatible multi-channel coding/decoding
JP4519919B2 (en) Multi-channel hierarchical audio coding using compact side information
AU2005281937B2 (en) Generation of a multichannel encoded signal and decoding of a multichannel encoded signal
US8296155B2 (en) Method and apparatus for decoding a signal
JP5883561B2 (en) Speech encoder using upmix
CN101553865B (en) A method and an apparatus for processing an audio signal
CN101479785B (en) Method for encoding and decoding object-based audio signal and apparatus thereof
JP4601669B2 (en) Apparatus and method for generating a multi-channel signal or parameter data set
KR101227932B1 (en) System for multi channel multi track audio and audio processing method thereof
Breebaart et al. Multi-channel goes mobile: MPEG Surround binaural rendering
TW200921642A (en) Methods and apparatuses for encoding and decoding object-based audio signals
CN104756186A (en) Decoder and method for multi-instance spatial-audio-object-coding employing a parametric concept for multichannel downmix/upmix cases
TWI281356B (en) Device and method for generate a coded multi-channels signal and device and method for decode a coded multi-channels signal and recordable medium
CN1934640B (en) Device and method for writing on an audio CD, and audio CD
KR20080049747A (en) Apparatus for encoding and decoding audio signal and method thereof
Grill et al. Closing the gap between the multi-channel and the stereo audio world: Recent MP3 surround extensions
Plogsties et al. MPEG Sorround binaural rendering-Sorround sound for mobile devices (Binaurale Wiedergabe mit MPEG Sorround-Sorround sound fuer mobile Geraete)