CN101385077B - Apparatus and method for encoding/decoding signal - Google Patents

Apparatus and method for encoding/decoding signal Download PDF

Info

Publication number
CN101385077B
CN101385077B CN 200780004527 CN200780004527A CN101385077B CN 101385077 B CN101385077 B CN 101385077B CN 200780004527 CN200780004527 CN 200780004527 CN 200780004527 A CN200780004527 A CN 200780004527A CN 101385077 B CN101385077 B CN 101385077B
Authority
CN
China
Prior art keywords
down
3d
mix signal
signal
mix
Prior art date
Application number
CN 200780004527
Other languages
Chinese (zh)
Other versions
CN101385077A (en
Inventor
吴贤午
房熙锡
林宰显
郑亮源
金东秀
Original Assignee
Lg电子株式会社
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority to US76574706P priority Critical
Priority to US60/765,747 priority
Priority to US77147106P priority
Priority to US60/771,471 priority
Priority to US77333706P priority
Priority to US60/773,337 priority
Priority to US77577506P priority
Priority to US60/775,775 priority
Priority to US78175006P priority
Priority to US60/781,750 priority
Priority to US78251906P priority
Priority to US60/782,519 priority
Priority to US60/792,329 priority
Priority to US79232906P priority
Priority to US60/793,653 priority
Priority to US79365306P priority
Application filed by Lg电子株式会社 filed Critical Lg电子株式会社
Priority to PCT/KR2007/000672 priority patent/WO2007091845A1/en
Publication of CN101385077A publication Critical patent/CN101385077A/en
Application granted granted Critical
Publication of CN101385077B publication Critical patent/CN101385077B/en

Links

Abstract

An encoding method and apparatus and a decoding method and apparatus are provided. The decoding method includes extracting a three-dimensional (3D) down-mix signal and spatial information from an input bitstream, removing 3D effects from the 3D down-mix signal by performing a 3D rendering operation on the 3D down-mix signal, and generating a multi-channel signal using the spatial information and a down-mix signal obtained by the removal. Accordingly, it is possible to efficiently encode multi-channel signals with 3D effects and to adaptively restore and reproduce audio signals with optimum sound quality according to the characteristics of a reproduction environment.

Description

用于编码/解码信号的装置和方法 Apparatus and method for encoding / decoding a signal

技术领域 FIELD

[0001] 本发明涉及编码/解码方法和编码/解码装置,尤其涉及可处理音频信号使得能够产生三维(3D)音响效果的编码/解码装置,以及利用该编码/解码装置的编码/解码方法。 [0001] The present invention relates to the encoding / decoding method and encoding / decoding apparatus, particularly relates to processing an audio signal can be generated so that a three-dimensional (3D) sound encoding / decoding apparatus, and the use of encoding / decoding method of the coding / decoding apparatus.

背景技术 Background technique

[0002] 编码装置将多声道信号缩减混音成具有较少声道的信号,并将经缩减混音的信号传送到解码装置。 [0002] The multichannel signal coding apparatus into a signal having fewer down-mix channels, and transmits the reduced-mix signal to the decoding apparatus. 然后,解码装置从经缩减混音的信号恢复多声道信号,并使用如5. 1声道扬声器之类的三个或更多扬声器再现所恢复的多声道信号。 Then, a decoding apparatus to restore a multichannel signal from a down-mix signal, and multichannel signals such as 5.1-channel speakers three or more of a speaker reproducing recovered.

[0003] 多声道信号可由诸如耳机之类的2声道扬声器来再现。 [0003] Multi-channel signals may be such as a headphone reproducing 2-channel speakers. 在这种情况下,为了使用户觉得由2声道扬声器输出的声音好像从三个或更多声源再现的一样,有必要开发出能够编码或解码多声道信号使得能够产生3D效果的三维(3D)处理技术。 In this case, in order to make the user feel that the sound outputted by the 2-channel speakers as if from three or more sound source reproduction of the same, it is necessary to develop capable of encoding or decoding multichannel signals so that 3D effects can be generated three-dimensional (3D) processing techniques.

发明内容 SUMMARY

[0004] 技术问题 [0004] Technical issues

[0005] 本发明提供一种能够通过高效率地处理具有3D效果的信号而在各种再现环境中再现多声道信号的编码/解码装置以及编码/解码方法。 [0005] The present invention provides a multi-channel encoding / decoding apparatus and an encoding / decoding method for a signal playback environment is capable of reproducing various signals with 3D effects by processing with high efficiency.

[0006] 技术解决方案 [0006] Technology Solutions

[0007] 根据本发明的一个方面,提供了一种解码音频信号的解码方法,该解码方法包括: 从输入比特流提取三维(3D)缩减混音信号,通过对所提取的3D缩减混音信号执行3D渲染操作来生成从其去除3D效果的缩减混音信号,并通过对所生成的缩减混音信号执行3D渲染操作来生成具有3D效果的3D缩减混音信号。 [0007] In accordance with one aspect of the present invention, there is provided a decoding method for decoding an audio signal, the decoding method comprising: extracting a three-dimensional (3D) down-mix bit stream from an input signal, by the extracted 3D down-mix signal performing a 3D rendering operation to generate 3D effects removed therefrom down-mix signal, and by performing a 3D rendering operation on the down-mix to generate a signal with 3D effects generated by the 3D down-mix signal.

[0008] 根据本发明的另一个方面,提供了一种用于解码音频信号的解码装置,该解码装置包括:比特拆分单元,其从输入比特流提取3D缩减混音信号;第一3D渲染单元,其通过对所提取的3D缩减混音信号执行3D渲染操作来生成从其去除3D效果的缩减混音信号;以及第二3D渲染单元,其通过对由第一3D渲染单元生成的缩减混音信号执行3D渲染操作来生成具有3D效果的3D缩减混音信号。 [0008] According to another aspect of the invention, there is provided a decoding apparatus for decoding an audio signal, the decoding apparatus comprising: a bit unpacking unit which extracts the stream of bits from the input 3D down-mix signal; a first 3D rendering means, by which the extracted 3D down-mix signal by performing a 3D rendering operation on 3D effects removed therefrom to generate a down-mix signal; and a second 3D rendering unit, by mixing a first 3D rendering unit for generating a reduced performing a 3D rendering operation sound signal to generate a 3D effect of a 3D down-mix signal.

[0009] 根据本发明的另一个方面,提供了一种具有用于执行上述的解码方法的计算机程序的计算机可读记录介质。 [0009] According to another aspect of the invention, there is provided a computer having a computer program for executing the above-described decoding method readable recording medium.

[0010] 有益效果 [0010] beneficial effects

[0011] 根据本发明,可高效地编码具有3D效果的多声道信号,并根据再现环境的特性以最优音质自适应地恢复和再现音频信号。 [0011] According to the present invention, can efficiently encode multi-channel signals with 3D effects and to adaptively restore and reproduce audio signals in an optimal quality according to the characteristics of a reproduction environment.

[0012] 附图简要说明 [0012] BRIEF DESCRIPTION OF DRAWINGS

[0013] 图1是根据本发明的实施例的编码/解码装置的框图; [0013] FIG. 1 is a block diagram of an embodiment of the present invention encoding / decoding apparatus according to;

[0014] 图2是根据本发明的实施例的编码装置的框图; [0014] FIG. 2 is a block diagram of an encoding apparatus of an embodiment of the present invention;

[0015] 图3是根据本发明的实施例的解码装置的框图;[0016] 图4是根据本发明的另一个实施例的编码装置的框图; [0015] FIG. 3 is a block diagram of a decoding apparatus of an embodiment of the present invention; [0016] FIG. 4 is a block diagram of an encoding apparatus according to another embodiment of the present invention;

[0017] 图5是根据本发明的另一个实施例的解码装置的框图; [0017] FIG. 5 is a block diagram of a decoding apparatus according to another embodiment of the present invention;

[0018] 图6是根据本发明的另一个实施例的解码装置的框图; [0018] FIG. 6 is a block diagram of a decoding apparatus according to another embodiment of the present invention;

[0019] 图7是根据本发明的实施例的三维(3D)渲染装置的框图; [0019] FIG. 7 is a block diagram of a rendering three-dimensional (3D) embodiment embodiment of the present invention;

[0020] 图8至11示出根据本发明的实施例的比特流; [0020] FIGS 8-11 illustrate an embodiment of the present invention, the bit stream;

[0021] 图12是根据本发明的实施例用于处理任意缩减混音信号的编码/解码装置的框图; [0021] FIG. 12 is a block diagram of a coded arbitrary mix signal processing / decoding apparatus according to an embodiment of the present invention is reduced to;

[0022] 图13是根据本发明的实施例的任意缩减混音信号补偿/3D渲染单元的框图; [0022] FIG. 13 is a block-mix signal compensation / 3D rendering unit according to the reduction of any of the embodiments of the present invention;

[0023] 图14是根据本发明的实施例用于处理兼容缩减混音信号的解码装置的框图; [0023] FIG. 14 is an embodiment of the present invention for processing a block diagram of the down-mix signal decoding apparatus compatible;

[0024] 图15是根据本发明的实施例的缩减混音兼容性处理/3D渲染单元的框图;以及 [0024] FIG. 15 is a block diagram of a down-mix compatibility processing according to an embodiment of the present invention / 3D rendering unit; and

[0025] 图16是根据本发明的实施例用于消除串扰的解码装置的框图。 [0025] FIG. 16 is a block diagram of a decoding apparatus according to the embodiment of the crosstalk elimination of the present invention.

[0026] 本发明的最佳实施方式 [0026] The preferred embodiment of the present invention.

[0027] 下文中将参考示出本发明的示例性实施例的附图更全面地描述本发明。 The present invention is more fully described in [0027] illustrated hereinafter with reference to an exemplary embodiment of the present invention, to the accompanying drawings. 图1是根据本发明的实施例的编码/解码装置的框图。 1 is a block embodiment of the present invention encoding / decoding apparatus according to. 参考图1,编码单元100包括多声道编码器110、三维(3D)渲染单元120、缩减混音编码器130以及比特打包单元140 Referring to FIG 1, the encoding unit 100 includes a multi-channel encoder 110, a three-dimensional (3D) rendering unit 120, a down-mix encoder 130 and a bit packing unit 140

[0028] 多声道编码器110将具有多个声道的多声道信息缩减混音成诸如立体声或单声道信号之类的缩减混音信号,并生成关于该多声道信号的声道的空间信息。 Multichannel information [0028] Multi-channel encoder 110 having a plurality of downmix channels such as a down-mix signal into a stereo or mono signal or the like, and generates a channel on the multichannel signal spatial information. 需要空间信息是为了从缩减混音信号恢复多声道信号。 Spatial information needed in order to restore the multi-channel signal from the down-mix signal.

[0029] 空间信息的例子包括:指示一对声道的能级之差的声道电平差(CLD)、声道预测系数(CPC)——即用于基于2声道信号生成3声道信号的预测系数、指示一对声道之间的相关性的声道间相关性(ICC)以及一对声道之间的时间间隔的声道时间差(CTD)。 Examples [0029] The spatial information comprises: an indication of the channel pairs channel level difference of the level difference (CLD), channel prediction coefficient (CPC) - i.e., based on channel 2-channel signal generation 3 prediction coefficient signals indicating inter-channel correlation between a pair of channel correlation (ICC) and a channel time between the time a pair of spaced channel difference (CTD).

[0030] 3D渲染单元120基于缩减混音信号生成3D缩减混音信号。 [0030] 3D rendering unit 120 3D down-mix signal based on the down-mix signal generated. 3D缩减混音信号可以是具有三个或更多方向性的2声道信号,因此可由诸如耳机之类的2声道扬声器再现并具有3D效果。 3D down-mix signal may be a 2-channel signal with three or more directivities, and therefore may be such as a headphone playback 2-channel speakers and having a 3D effect. 换言之,3D缩减混音信号可由2声道扬声器再现,使得用户觉得3D缩减混音信号好像是从具有三个或更多声道的声源再现的一样。 In other words, the 3D down-mix signal reproduced by 2-channel speakers so that a user feel as if the 3D down-mix signal is reproduced as having three or more channels from the sound source. 声源的方向可基于分别输入到两耳的两个声音的强度之差、两个声音之间的时间间隔以及两个声音的相位之差中的至少一个来确定。 Direction of the sound source may be based on the difference between intensities of two sounds respectively input to both ears, the time interval between the two sounds, and the difference between the phase of the two sounds at least one determined. 因此,3D渲染单元120可基于人类如何利用其听觉确定声源的3D位置来将缩减混音信号转换成3D缩减混音信号。 Thus, based on the 3D rendering unit 120 may determine how the human auditory use its sound source position of the 3D down-mix signal is converted into a 3D down-mix signal.

[0031] 3D渲染单元120可通过利用滤波器滤波缩减混音信号来生成3D缩减混音信号。 [0031] 3D rendering unit 120 may generate a 3D down-mix signal by down-mix signal using filter filters. 在这种情况下,可由外部源将滤波器相关信息——如滤波器系数输入到3D渲染单元120。 In this case, the filter may be external information source - A filter coefficient input to the 3D rendering unit 120. 3D渲染单元120可利用由多声道编码器110提供的空间信息来基于缩减混音信号生成3D 缩减混音信号。 3D rendering unit 120 may use the spatial information provided by the multichannel encoder 110 to generate the 3D down-mix signal based on the down-mix signal. 更具体地,3D渲染单元120可通过利用空间信息将缩减混音信号转换成想像的多声道信号并滤波该想像的多声道信号来将缩减混音信号转换成3D缩减混音信号。 More specifically, 3D rendering unit 120 may use the spatial information signal into the down-mix signal into an imaginary multichannel and filtering the imaginary multichannel signal to the down-mix signal into the 3D down-mix signal into.

[0032] 3D渲染单元120可通过利用报头相关传递函数(HRTF)滤波器滤波缩减混音信号来生成3D缩减混音信号。 [0032] 3D rendering unit 120 may generate the 3D down-mix signal using a head-related transfer functions packet (an HRTF) filter filters the down-mix signal.

[0033] HRTF是一种传递函数,它描述任意位置的声源和耳膜之间声波的传输,并返回根据声源的方向和高度变化的值。 [0033] HRTF is a transfer function which describes the transmission of sound waves between a sound source in an arbitrary position and the eardrum, and returns the value according to the direction and height of the sound source changes. 如果利用HRTF滤波没有方向性的信号,则可听到该信号好像从某方向再现一样。 If no directivity using HRTF filtered signals, the signal can be heard as if reproduced from a certain direction.

[0034] 3D渲染单元120可在例如离散傅立叶变换(DFT)域或快速傅立叶变换(FFT)域之类的频域中执行3D渲染操作。 [0034] The 3D rendering unit 120 may, for example, a discrete Fourier transform (DFT) domain or a fast Fourier transform (FFT) domain, a frequency domain or the like perform the 3D rendering operation. 在这种情况下,3D渲染单元120可在3D渲染操作前执行DFT或FFT,或者可在3D渲染操作后执行逆DFT (IDFT)或逆FFT (IFFT)。 In this case, the 3D rendering unit 120 may perform the 3D rendering DFT or FFT before the operation, or may perform inverse DFT (the IDFT) or an inverse FFT (the IFFT) after the 3D rendering operation.

[0035] 3D渲染单元120可在正交镜像滤波器(QMF) /混合域中执行3D渲染操作。 [0035] The 3D rendering unit 120 may perform a 3D rendering operation in a quadrature mirror filter (QMF) / hybrid domain. 在这种情况下,3D渲染单元120可在3D渲染操作之前或之后执行QMF/混合分析和合成操作。 In this case, 3D rendering unit 120 may render perform QMF / hybrid analysis and synthesis operations before or after the operation in 3D.

[0036] 3D渲染单元120可在时域中执行3D渲染操作。 [0036] The 3D rendering unit 120 may perform a 3D rendering operation in a time domain. 3D渲染单元120可根据所需的音质和编码/解码装置的操作能力确定将在哪个域中执行3D渲染操作。 3D rendering unit 120 may be performed to determine which domain the 3D rendering operation in accordance with an operation capacity required quality and the encoding / decoding apparatus.

[0037] 缩减混音编码器130编码由多声道编码器110输出的缩减混音信号或由3D渲染单元120输出的3D缩减混音信号。 [0037] The down-mix encoder 130 encoded down-mix signal or the down-mix signal output by the 3D rendering unit 110 outputs 120 the 3D multichannel encoder. 缩减混音编码器130可利用诸如高级音频译码(AAC) 法、MPEG层3(MP;3)法或比特片算法译码(BSAC)法之类的音频编码法来编码由多声道编码器110输出的缩减混音信号或由3D渲染单元120输出的3D缩减混音信号。 Down-mix encoder 130 may be utilized, such as advanced audio coding (AAC) method, MPEG Layer 3 (MP; 3) or bit sliced ​​arithmetic coding method (the BSAC) method or the like by the audio encoding method used to encode the multichannel encoder down-mix signal or the down-mix signal 110 is outputted from the 3D 3D rendering unit 120 outputs.

[0038] 缩减混音编码器130可编码非3D缩减混音信号或3D缩减混音信号。 [0038] The down-mix encoder 130 may encode a non-3D down-mix signal or a 3D down-mix signal. 在这种情况下,经编码的非3D缩减混音信号和经编码的3D缩减混音信号两者都可被包括在待传送的比特流中。 In this case, the non-3D down-mix signal and the encoded 3D encoded down-mix signal may both be included in a bitstream to be transferred.

[0039] 比特打包单元140基于空间信息以及或者经编码的非3D缩减混音信号或者经编码的3D缩减混音信号来生成比特流。 [0039] The bit packing unit 140 based on the spatial information and either the encoded non-3D down-mix signal or the encoded 3D down-mix signal to generate a bit stream.

[0040] 由比特打包单元140生成的比特流可包括空间信息、指示包括在比特流中的缩减混音信号是非3D缩减混音信号还是3D缩减混音信号的缩减混音标识信息、以及标识由3D 渲染单元120所使用的滤波器的信息(例如,HRTF系数信息)。 [0040] generated by the bit packing unit 140 may include a spatial information bitstream, indicating a non-3D down-mix signal comprising a down-mix signal in the bit stream of the down-mix signal or a 3D down-mix identification information, and is identified by the 3D rendering unit 120 using information of the filter (e.g., HRTF coefficient information).

[0041] 换言之,由比特打包单元140生成的比特流可包括还未经3D处理的非3D缩减混音信号以及通过由编码装置执行的3D处理操作获取的编码器3D缩减混音信号中的至少一个、以及标识包括在比特流中的缩减混音信号的类型的缩减混音标识信息。 [0041] In other words, the bit packing unit 140 generates a bit stream may further comprise non-treated non-3D 3D down-mix signal and the encoder 3D acquisition processing operation performed by the encoding device 3D down-mix signal at least a, and the identification included in the bit stream of the down-mix signal type down-mix identification information.

[0042] 可按用户选择或根据图1所示的编码/解码装置的能力和再现环境的特性确定非3D缩减混音信号和编码器3D缩减混音信号中的哪一个将被包括在由比特打包单元140生成的比特流中。 [0042] or according to user selection apparatus capable of decoding and reproducing characteristics determined in accordance with the environment shown in Figure 1 encoding / non-3D down-mix signal and the encoder 3D down-mix signal in which one bit will be included in a a bitstream packing unit 140 is generated.

[0043] HRTF系数信息可包括由3D渲染单元120所使用的HRTF的反函数的系数。 [0043] HRTF coefficient information may include coefficients of an inverse function of the HRTF is used by the 3D rendering unit 120. HRTF 系数信息可仅包括由3D渲染单元120所使用的HRTF的系数的简短信息,例如,HRTF系数的包络信息。 HRTF coefficient information may only include brief information of coefficients by the 3D rendering unit 120 of the HRTF used by, for example, HRTF coefficient information envelope. 如果将包括HRTF的反函数的系数的比特流传送到解码装置,则解码装置不需要执行HRTF系数转换操作,因此可减少解码装置的计算量。 If HRTF coefficients includes a bit stream to the inverse function of the decoding apparatus, the decoding apparatus HRTF coefficient conversion operation need not be performed, the calculation amount of the decoding apparatus can be reduced.

[0044] 由比特打包单元140生成的比特流还可包括关于由基于HRTF的滤波导致的信号中的能量变化的信息,即,关于将要滤波的信号的能量与已滤波的信号的能量之差或将要滤波的信号的能量与已滤波的信号的能量之比的信息。 [0044] The bit stream by the bit packing unit 140 generates may also include information regarding the signal produced by the HRTF filters based on a result of the energy change, i.e., the difference between the energy of the energy on the signal to be filtered and the filtered signal or the energy of the energy ratio information signal to be filtered and the filtered signal.

[0045] 由比特打包单元140生成的比特流还可包括指示其是否包括HRTF系数的信息。 [0045] The bit packing unit 140 generates a bit stream may further comprise information indicating whether it includes HRTF coefficients. 如果HRTF系数被包括在由比特打包单元140生成的比特流中,则该比特流还可包括指示其包括由3D渲染单元120所使用的HRTF的系数还是HRTF的反函数的系数的信息。 If HRTF coefficients are included in a bitstream generated by the bit packing unit 140, the bit stream may further comprise the indication that includes HRTF coefficients by the 3D rendering unit 120 or the coefficients used for the inverse function of the HRTF information.

[0046] 参考图1,第一解码单元200包括比特拆分单元210、缩减混音解码器220、3D渲染单元230以及多声道解码器240。 [0046] Referring to FIG 1, a first decoding unit 200 includes a bit unpacking unit 210, a down-mix decoder 230 220,3D rendering unit 240 and a multichannel decoder.

[0047] 比特拆分单元210从编码单元100接收输入比特流,并从该输入比特流中提取经编码的缩减混音信号和空间信息。 [0047] The bit unpacking unit 210 from the encoding unit 100 receives the input bit stream, and extracts an encoded down-mix signal and spatial information from the input bitstream. 缩减混音解码器220对经编码的缩减混音信号进行解码。 Down-mix decoder 220 pairs encoded down-mix signal decoding. 缩减混音解码器220可利用诸如AAC法、MP3法或BSAC法之类的音频信号解码方法来对经编码的缩减混音信号进行解码。 Down-mix decoder 220 may utilize the audio signal decoding method AAC method, MP3 method, or a BSAC method such as to decode the encoded down-mix signal.

[0048] 如上所述,从输入比特流中提取的经编码的缩减混音信号可以是经编码的非3D 缩减混音信号或经编码的、编码器3D缩减混音信号。 [0048] As described above, extracted from the input bitstream encoded down-mix signal may be via a non-3D down-mix signal or an encoded, encoder 3D down-mix signal encoding. 指示从输入比特流中提取的经编码的缩减混音信号是经编码的非3D缩减混音信号还是经编码的、编码器3D缩减混音信号的信息可被包括在输入比特流中。 Indication extracted from the input bitstream encoded down-mix signal is an encoded non-3D down-mix signal or an encoded, encoder 3D down-mix signal may be included in the input bitstream.

[0049] 如果从输入比特流中提取的经编码的缩减混音信号是编码器3D缩减混音信号, 则经编码的缩减混音信号可在由缩减混音解码器220解码后容易地再现。 [0049] If the extracted from the input bitstream encoded down-mix signal is an encoder 3D down-mix signal, the encoded down-mix signal may be readily reproduced by the down-mix decoder 220 decoded.

[0050] 另一方面,如果从输入比特流中提取的经编码的缩减混音信号是非3D缩减混音信号,则经编码的缩减混音信号可由缩减混音解码器220解码,且通过解码获取的缩减混音信号可通过由第三渲染单元233执行的3D渲染操作转换成解码器3D缩减混音信号。 [0050] On the other hand, if the extracted from the input bitstream encoded down-mix signal is a non-3D down-mix signal, the encoded down-mix signal via the decoder by down-mix decoder 220, and is obtained by decoding the down-mix signal may be converted by the 3D rendering operation performed by the third rendering unit 233 into a decoder 3D down-mix signal. 解码器3D缩减混音信号可被容易地再现。 The decoder 3D down-mix signal may be readily reproduced.

[0051] 3D渲染单元230包括第一渲染器231、第二渲染器232和第三渲染器233。 [0051] 3D rendering unit 230 includes a first renderer 231, a second renderer 232, and a third renderer 233. 第一渲染器231通过对由缩减混音解码器220提供的编码器3D缩减混音信号执行3D渲染操作来生成缩减混音信号。 The first renderer 231 by the down mix decoder 220 encoder 3D down-mix signal provided by performing a 3D rendering operation on the down-mix signal is generated. 例如,第一渲染器231可通过从编码器3D缩减混音信号去除3D效果来生成非3D缩减混音信号。 For example, the first renderer 231 may generate a non-3D down-mix signal by removing 3D effects from the encoder 3D down-mix signal. 编码器3D缩减混音信号的3D效果可能不能由第一渲染器231完全去除。 The encoder 3D down-mix signal 231 3D effect may not be completely removed by the first renderer. 在这种情况下,由第一渲染器231输出的缩减混音信号可具有相同的3D效 In this case, the down-mix signal output by the first renderer 231 may have the same 3D effect

:^ ο : ^ Ο

[0052] 第一渲染器231可将由缩减混音解码器220提供的3D缩减混音信号转换成使用编码单元100的3D渲染单元120所使用的滤波器的逆滤波器来将3D效果从其去除的缩减混音信号。 [0052] The first renderer 231 may by the 3D down-mix decoder 220 is converted into a down-mix signal using filter encoding unit 120 of the 3D rendering unit 100 used in the inverse filter to the 3D effects removed therefrom down-mix signal. 关于由3D渲染单元120所使用的滤波器或由3D渲染单元120所使用的滤波器的逆滤波器的信息可被包括在输入比特流中。 The information about the inverse filter filters the 3D rendering unit 120 or used by the 3D rendering unit 120 of the filter used may be included in the input bitstream.

[0053] 由3D渲染单元120所使用的滤波器可以是HRTF滤波器。 [0053] the filter by the 3D rendering unit 120 may be used HRTF filters. 在这种情况下,由编码单元100所使用的HRTF的系数或HRTF的反函数的系数也可被包括在输入比特流中。 In this case, the coefficients of the HRTF used by the encoding unit 100 or the coefficients of the inverse function of the HRTF may also be included in the input bitstream. 如果由编码器单元100所使用的HRTF的系数被包括在输入比特流中,则HRTF系数可被逆转换, 且可在由第一渲染器231执行的3D渲染操作期间使用该逆转换的结果。 If HRTF coefficients performed by the encoder unit 100 is used included in the input bitstream, the HRTF coefficients may be inversely converted, and the rendering results may be used during operation of the inverse conversion in the first 3D rendering unit 231 by the execution. 如果由编码单元100所使用的HRTF的反函数的系数被包括在输入比特流中,则它们可在由第一渲染器231 执行的3D渲染操作期间容易地使用,而不进行任何逆转换操作。 If HRTF coefficients comprises the inverse function of the coding unit 100 used in the input bit stream, they can be readily used during rendering operations by the first 3D rendering unit 231 performs, without any inverse conversion operation. 在这种情况下,可减少第一解码装置100的计算量。 In this case, the amount of computing a first decoding apparatus 100 may be reduced.

[0054] 输入比特流还可包括滤波器信息(例如,指示由编码单元100所使用的HRTF的系数是否被包括在输入比特流中的信息)以及指示该滤波器信息是否已被逆转换的信息。 [0054] The input bitstream may also include filter information (e.g., indicating whether the coefficients of the HRTF used by the encoding unit 100 is information included in the input bitstream) and information indicating whether the filter information has been inversely converted .

[0055] 多声道解码器240基于从其去除3D效果的缩减混音信号和从输入比特流中提取的空间信息来生成具有三个或更多声道的3D多声道信号。 [0055] Multi-channel decoder 240 generates a down-mix signal is removed from the 3D effect and spatial information extracted from the input bitstream based on the 3D multichannel signal having three or more channels.

[0056] 第二渲染器232可通过对从其去除3D效果的缩减混音信号执行3D渲染操作来生成具有3D效果的3D缩减混音信号。 [0056] The second renderer 232 may mix signal by removing 3D effects from the reduction of the down-mix performing a 3D rendering operation to generate a 3D signal with 3D effects. 换言之,第一渲染器231从由缩减混音解码器220提供的编码器3D缩减混音信号中去除3D效果。 In other words, the first renderer 231 by the down-mix from the encoder to the decoder 220 to provide 3D down-mix signal removing 3D effects. 之后,第二渲染器232可利用第一解码装置的滤波器,通过对由第一渲染器231执行去除而获取的缩减混音信号执行3D渲染操作来生成第一解码装置200所期望的具有3D效果的组合3D缩减混音信号。 Thereafter, the second renderer 232 may use a filter of the first decoding apparatus 231 for performing removed by the first renderer and the down-mix signal obtained by performing a 3D rendering operation to generate a first decoding device 200 is having a desired 3D the combined effect of a 3D down-mix signal.

[0057] 第一解码装置200可包括其中结合有执行相同操作的第一、第二和第三渲染器231,232和233中的两个或更多个的渲染器。 [0057] The first decoding apparatus 200 may comprise incorporated therein a first, 231, 232 and 233 in the two second and third or more renderer renderer perform the same operation. [0058] 由编码单元100生成的比特流可被输入到具有与第一解码装置200不同的结构的第二解码装置300。 [0058] by the bit stream generated by the encoding unit 100 may be inputted to the second decoding means 300 having a structure different from the first decoding means 200. 第二解码装置300可基于包括在向其输入的比特流中的缩减混音信号生成3D缩减混音信号。 Second decoding means 300 may include a down-mix signal based on the bit stream inputted thereto generates a 3D down-mix signal.

[0059] 更具体地,第二解码装置300包括比特拆分单元310、缩减混音解码器320以及3D 渲染单元330。 [0059] More specifically, the second decoding apparatus 300 includes a bit unpacking unit 310, a down-mix decoder 320, and a 3D rendering unit 330. 比特拆分单元310从编码单元100接收输入比特流,并从该输入比特流中提取经编码的缩减混音信号和空间信息。 The bit unpacking unit 310 from the encoding unit 100 receives the input bit stream, and extracts the information and the spatial down-mix signal from the input encoded bit stream. 缩减混音解码器320对经编码的缩减混音信号进行解码。 Down-mix decoder 320 pairs encoded down-mix signal decoding. 3D渲染单元330对经解码的缩减混音信号执行3D渲染操作,使得经解码的缩减混音信号可被转换成3D缩减混音信号。 The 3D rendering unit 330 on the 3D down-mix signal decoded down-mix signal by performing a 3D rendering operation, so that the decoded down-mix signal can be converted into.

[0060] 图2是根据本发明的实施例的编码装置的框图。 [0060] FIG. 2 is a block diagram of the encoding apparatus embodiment of the present invention. 参考图2,该编码装置包括渲染单元400和420以及多声道编码器410。 Referring to FIG 2, the encoding apparatus includes rendering units 400 and 420, and the multi-channel encoder 410. 将省略与图1的实施例相同的编码过程的详细描述。 Detailed description of the same encoding process according to the embodiment of FIG. 1 will be omitted.

[0061] 参考图2,可将3D渲染单元400和420分别设置在多声道编码器410的前面和后面。 [0061] Referring to FIG 2, the 3D rendering units 400 and 420 are respectively disposed at front of and behind the multichannel encoder 410. 因此,多声道信号可由3D渲染单元400进行3D渲染,然后,经3D渲染的多声道信号可由多声道编码器410编码,从而生成经预处理的、编码器3D缩减混音信号。 Thus, the multi-channel signal may be 3D rendering 3D rendering unit 400, and then, the multichannel signal may be 3D-rendered multi-channel encoder 410 encoding, thereby generating pre-processed, encoder 3D down-mix signal. 或者,多声道信号可由多声道编码器410进行缩减混音,然后,经缩减混音的信号可由3D渲染单元420进行3D渲染,从而生成经后处理的、编码器缩减混音信号。 Alternatively, the multichannel signal may be a multi-channel down-mix encoder 410, and then, the down-mix signal by the 3D rendering 3D rendering unit 420, thereby generating a post-processed, encoder down-mix signal.

[0062] 指示多声道信号在缩减混音前还是缩减混音后进行3D渲染的信息可被包括在待传送的比特流中。 [0062] indicates a multi-channel signal may be 3D rendering information included in the down-mix before or after the down-mix bit stream to be transferred.

[0063] 3D渲染单元400和420两者都可被设置在多声道编码器410的前面或后面。 Both 400 and 420 [0063] 3D rendering unit may be disposed in front of or behind the multichannel encoder 410.

[0064] 图3是根据本发明的实施例的解码装置的框图。 [0064] FIG. 3 is a block diagram of a decoding apparatus of an embodiment of the present invention. 参考图3,该解码装置包括3D渲染单元430和450以及多声道解码器440。 Referring to Figure 3, the decoding apparatus includes 3D rendering units 430 and 450, and the multi-channel decoder 440. 将省略与图1的实施例相同的解码过程的详细描述。 Detailed description of the same decoding process according to the embodiment of FIG. 1 will be omitted.

[0065] 参考图3,可将3D渲染单元430和450分别设置在多声道解码器440的前面和后面。 [0065] Referring to FIG 3, the 3D rendering unit 430 and 450 are respectively disposed at front of and behind the multichannel decoder 440. 3D渲染单元430可从编码器3D缩减混音信号去除3D效果,并将通过去除获取的缩减混音信号输入到多声道解码器430。 3D rendering unit 430 may remove 3D effects from the encoder 3D down-mix signal, and by removing the acquired down-mix signal is input to the multichannel decoder 430. 然后,多声道解码器430可解码向其输入的缩减混音信号,从而生成经预处理的3D多声道信号。 Then, the multichannel decoder 430 may decode the down-mix signal input thereto, thereby generating a 3D multi-channel signal preprocessing. 或者,多声道解码器430可从经编码的3D缩减混音信号恢复多声道信号,且3D渲染单元450可从所恢复的多声道信号去除3D效果,从而生成经后处理的3D多声道信号。 Alternatively, the multichannel decoder 430 may down-mix signal from an encoded 3D restored multichannel signal and the 3D rendering unit 450 may remove 3D effects from the restored multi-channel signal, thereby generating a post-processed 3D multi- channel signal.

[0066] 如果由编码装置提供的编码器3D缩减混音信号已经通过执行3D渲染操作和随后的缩减混音操作而生成,则编码器3D缩减混音信号可通过执行多声道解码操作和随后的3D渲染操作而被解码。 [0066] If the coding apparatus provided by the encoder 3D down-mix signal by performing a 3D rendering operation has followed the generated down-mix operation, the encoder 3D down-mix signal by performing multichannel decoding operation and the subsequent a 3D rendering operation is decoded. 另一方面,如果编码器3D缩减混音信号已经通过执行缩减混音操作和随后的3D渲染操作而生成,则编码器3D缩减混音信号可通过执行3D渲染操作和随后的多声道解码操作而被解码。 On the other hand, if the encoder 3D down-mix signal has been down-mix operation and the subsequent generating 3D rendering operation, the encoder 3D down-mix signal by performing by performing a 3D rendering operation and the subsequent multi-channel decoding operation the decoded.

[0067] 可从编码装置传送的比特流中提取指示经编码的3D缩减混音信号是通过在缩减混音操作前还是在缩减混音操作后执行3D渲染操作而获取的信息。 [0067] The down-mix signal can be acquired by performing a 3D rendering operation or before the down-mix operation after extracting a down-mix operation indication from a 3D encoded bit stream transmitted from the encoding apparatus.

[0068] 3D渲染单元430和450两者都可被设置在多声道解码器440的前面或后面。 [0068] 3D rendering units 450 and 430 can both be disposed in front of or behind the multichannel decoder 440.

[0069] 图4是根据本发明的另一个实施例的编码装置的框图。 [0069] FIG. 4 is a block diagram of an encoding apparatus according to another embodiment of the present invention. 参考图4,编码装置包括多声道编码器500、3D渲染单元510、缩减混音编码器520以及比特打包单元530。 4, the encoding apparatus with reference to FIG multichannel encoder comprising 500,3D rendering unit 510, a down-mix encoder 520 and a bit packing unit 530. 将省略与图1的实施例相同的编码过程的详细描述。 Detailed description of the same encoding process according to the embodiment of FIG. 1 will be omitted. [0070] 参考图4,多声道编码器500基于输入多声道信号生成缩减混音信号和空间信息。 [0070] Referring to FIG 4, the multi-channel encoder 500 generates a down-mix signal and spatial information based on the input multichannel signal. 3D渲染单元510通过对缩减混音信号执行3D渲染操作来生成3D缩减混音信号。 3D rendering unit 510 down-mix signal by performing a 3D down-mix signal to generate a 3D rendering operation.

[0071] 可按用户选择或根据编码装置的能力、再现环境的特性或所要求的音质确定是否对缩减混音信号执行3D渲染操作。 [0071] or according to user selection according to the ability of the encoding apparatus, the reproducing characteristics of the environment or required sound quality is determined whether down-mix signal by performing a 3D rendering operation.

[0072] 缩减混音编码器520编码由多声道编码器500生成的缩减混音信号或由3D渲染单元510生成的3D缩减混音信号。 [0072] The down-mix encoder 520 generates encoded by the multichannel encoder 500 or the down-mix signal by the 3D rendering unit 510 generates a 3D down-mix signal.

[0073] 比特打包单元530基于空间信息以及或者经编码的缩减混音信号或者经编码的、 编码器3D缩减混音信号生成比特流。 [0073] The bit packing unit 530 based on the spatial information and the down-mix signal or an encoded or encoded, encoder 3D down-mix signal generated bitstream. 由比特打包单元530生成的比特流可包括指示包括在比特流中的经编码的缩减混音信号是没有3D效果的非3D缩减混音信号还是具有3D效果的编码器3D缩减混音信号的缩减混音标识信息。 Bitstream generated by the bit packing unit 530 may include an indication of an encoded bit stream in a 3D down-mix signal is no effect of the non-3D down-mix signal with 3D effects or an encoder 3D down-mix signal down mix identification information. 更具体地,缩减混音标识信息可指示由比特打包单元530生成的比特流是否包括非3D缩减混音信号、编码器3D缩减混音信号或这两者。 More specifically, the down-mix identification information may indicate the bit packing unit 530 generates a bit stream includes a non-3D down-mix signal, an encoder 3D down-mix signal or both.

[0074] 图5是根据本发明的另一个实施例的解码装置的框图。 [0074] FIG. 5 is a block diagram of a decoding apparatus according to another embodiment of the present invention. 参考图5,解码装置包括比特拆分单元M0、缩减混音解码器550以及3D渲染单元560。 Referring to Figure 5, the decoding apparatus includes a bit unpacking unit M0, down-mix decoder 550, and a 3D rendering unit 560. 将省略与图1的实施例相同的解码过程的详细描述。 Detailed description of the same decoding process according to the embodiment of FIG. 1 will be omitted.

[0075] 参考图5,比特拆分单元540从输入比特流中提取经编码的缩减混音信号、空间信息和缩减混音标识信息。 [0075] Referring to FIG 5, the bit unpacking unit 540 extracts an encoded bit stream from an input down-mix signal, spatial information, and down-mix identification information. 缩减混音标识信息指示经编码的缩减混音信号是不具有3D效果的经编码的非3D缩减混音信号还是具有3D效果的经编码的3D缩减混音信号。 Down-mix identification information indicates that the encoded down-mix signal with 3D effects are not encoded non-3D down-mix signal with 3D effects or an encoded 3D down-mix signal.

[0076] 如果输入比特流包括非3D缩减混音信号和3D缩减混音信号两者,则仅可按用户选择或根据解码装置的能力、再现环境的特性或所需的音质来从输入比特流中提取非3D 缩减混音信号和3D缩减混音信号之一。 [0076] If the input bitstream includes a non-3D down-mix signal and a 3D down-mix signal both, the user can only select according to the ability or decoding apparatus, the reproducing property of the environment, or required sound quality from the input bit stream extracting non-3D down-mix signal and one 3D down-mix signal.

[0077] 缩减混音解码器550对经编码的缩减混音信号进行解码。 [0077] The down-mix decoder 550 pairs encoded down-mix signal decoding. 如果通过由缩减混音解码器550执行的解码获取的缩减混音信号是通过执行3D渲染操作获取的编码器3D缩减混音信号,则该缩减混音信号可被容易地再现。 If the down-mix signal obtained by the decoding performed by the down-mix decoder 550 is a 3D down-mix signal by performing a 3D rendering operation acquired by the encoder, the down-mix signal may be readily reproduced.

[0078] 另一方面,如果通过由缩减混音解码器550执行的解码获取的缩减混音信号是不具有3D效果的缩减混音信号,则3D渲染单元560可通过对由缩减混音解码器550执行解码所获取的缩减混音信号执行3D渲染操作来生成解码器3D缩减混音信号。 [0078] On the other hand, if the down-mix signal by down-mix signal obtained by the down mix decoder 550 performs decoding having no 3D effects, the 3D rendering unit 560 may down-mix by the decoder 550 performs decoding on the acquired down-mix signal by performing a 3D rendering operation on the decoder generates a 3D down-mix signal.

[0079] 图6是根据本发明的另一个实施例的解码装置的框图。 [0079] FIG. 6 is a block diagram of a decoding apparatus according to another embodiment of the present invention. 参考图6,解码装置包括比特拆分单元600、缩减混音解码器610、第一3D渲染单元620、第二3D渲染单元630以及滤波器信息存储单元640。 Referring to FIG 6, the decoding apparatus includes a bit unpacking unit 600, a down-mix decoder 610, a first 3D rendering unit 620, a second 3D rendering unit 630 and filter information storage unit 640. 将省略与图1的实施例相同的解码过程的详细描述。 Detailed description of the same decoding process according to the embodiment of FIG. 1 will be omitted.

[0080] 比特拆分单元600从输入比特流中提取经编码的、编码器3D缩减混音信号和空间信息。 [0080] The bit unpacking unit 600 extracts an encoded input bit stream from the encoder 3D down-mix signal and spatial information. 缩减混音解码器610对经编码的、编码器3D缩减混音信号进行解码。 Down-mix decoder 610 decodes the encoded, encoder 3D down-mix signal is decoded.

[0081] 第一3D渲染单元620使用用于执行3D渲染操作的编码装置的滤波器的逆滤波器,从由缩减混音解码器610执行的解码获取的编码器3D缩减混音信号去除3D效果。 [0081] The first 3D rendering unit 620 for performing inverse filter using the filter 3D rendering operation of the coding apparatus, from the acquired decoded by an encoder down-mix decoder 610 performs a 3D down-mix signal removing 3D effects . 第二渲染单元630利用存储在解码装置中的滤波器通过对由第一3D渲染单元620执行去除而获取的缩减混音信号执行3D渲染操作生成具有3D效果的组合3D缩减混音信号。 Second rendering using a filter stored in the decoding unit 630 by means of the mix signal with 3D effects by the combination of a first 3D rendering unit 620 performs removal and down-mix signal obtained by performing a 3D rendering operation on the 3D down.

[0082] 第二3D渲染单元630可利用其特性与用于执行3D渲染操作的编码单元的滤波器不同的滤波器执行3D渲染操作。 [0082] The second 3D rendering unit 630 may utilize the characteristics of the filters for performing different encoding 3D rendering filter operation unit performs a 3D rendering operation. 例如,第二3D渲染单元630可利用其系数与编码装置所使用的HRTF的系数不同的HRTF执行3D渲染操作。 For example, the second 3D rendering unit 630 may utilize different HRTF coefficients which coefficients used in the encoding apparatus HRTF performing a 3D rendering operation. [0083] 滤波器信息存储单元640存储关于用于执行3D渲染的滤波器的滤波器信息,例如,HRTF系数信息。 [0083] The filter information storage unit 640 stores filter information regarding a filter used for performing a 3D rendering, for example, an HRTF coefficient information. 第二3D渲染单元630可利用存储在滤波器信息存储单元640中的滤波器信息生成组合3D缩减混音。 The second 3D rendering unit 630 may generate a combined use of the filter information stored in the filter information storage unit 640 of the 3D down-mix.

[0084] 滤波器信息存储单元640可存储分别对应于多个滤波器的多条滤波器信息。 [0084] The filter information storage unit 640 may store a plurality of pieces of filter information respectively corresponding to a plurality of filters. 在这种情况下,可按用户选择或根据解码装置的能力或所要求的音质来选择多条滤波器信息之 In this case, the user can choose or select the plurality of pieces of filter information decoding apparatus according to the capability or required sound quality

ο ο

[0085] 来自不同种族的人可具有不同的耳结构。 [0085] People from different races may have different ear structures. 因此,针对不同个体优化的HRTF系数可彼此不同。 Thus, for HRTF coefficients optimized for different individuals may differ from one another. 图6中所示的解码装置可生成针对用户优化的3D缩减混音信号。 The decoding apparatus shown in FIG. 6 may generate a down-mix signal optimized for the user's 3D. 此外,图6中所示的解码装置可不管由3D缩减混音信号供应商所提供的HRTF的类型是什么,而生成与用户所期望的HRTF滤波器相对应的具有3D效果的3D缩减混音信号。 Further, the decoding apparatus shown in FIG. 6 may be made regardless of what type of HRTF 3D down-mix signal provided by the supplier is generated with the HRTF filter desired by the user corresponding to the 3D effect of a 3D down-mix signal.

[0086] 图7是根据本发明的实施例的3D渲染装置的框图。 [0086] FIG. 7 is a block diagram of the 3D rendering apparatus of the embodiment of the present invention. 参考图7,3D渲染装置包括第一和第二域转换单元700和720以及3D渲染单元710。 Referring to FIG 7,3D rendering apparatus includes first and second domain conversion units 700 and 720 and a 3D rendering unit 710. 为了在预定的域中执行3D渲染操作,可将第一和第二域转换单元700和720分别设置在3D渲染单元710的前面和后面。 In order to perform a 3D rendering operation in a predetermined domain, a first and second domain conversion units 700 and 720 are respectively disposed at front of and behind the 3D rendering unit 710.

[0087] 参考图7,输入缩减混音信号可由第一域转换单元700转换成频域缩减混音信号。 [0087] Referring to Figure 7, an input down-mix signal by the first domain conversion unit 700 converts a frequency domain down-mix signal. 更具体地,第一域转换单元700可通过执行DFT或FFT将输入缩减混音信号转换成DFT域缩减混音信号或FFT域缩减混音信号。 More specifically, the first domain conversion unit 700 may be performed by DFT or FFT converts the input down-mix signal into a DFT-domain down-mix signal or a FFT-domain down-mix signal.

[0088] 3D渲染单元710通过将空间信息施加于由第一域转换单元700提供的频域缩减混音信号来生成多声道信号。 [0088] 3D rendering unit 710 is applied to the spatial information to generate a multichannel signal frequency-domain down-mix signal by the first domain conversion unit 700 is provided. 之后,3D渲染单元710通过滤波多声道信号来生成3D缩减混音信号。 Then, filtered through a 3D rendering unit 710 generates a multichannel signal 3D down-mix signal.

[0089] 由3D渲染单元710生成的3D缩减混音信号由第二域转换单元720转换成时域3D 缩减混音信号。 [0089] 3D down-mix signal by the 3D rendering unit 710 generates the second converted by the domain converting unit 720 into a time-domain 3D down-mix signal. 更具体地,第二域转换单元720可对由3D渲染单元710生成的3D缩减混音信号执行IDFT或IFFT。 More specifically, the second domain conversion unit 720 down-mix signal may be performed on the IDFT or IFFT generated by the 3D rendering unit 710 3D.

[0090] 在频域3D缩减混音信号转换成时域3D缩减混音信号期间,可能发生混叠之类的数据丢失或数据失真。 [0090] During the mix signal converted into time-domain 3D down-mix signal in the frequency-domain 3D down, aliasing may occur such data loss or data distortion.

[0091] 为了生成频域中的多声道信号和3D缩减混音信号,可将每一个参数频带的空间信息映射到频域,并可将多个滤波器系数转换至频域。 [0091] In order to generate a multi-channel signal and a 3D down-mix signal in the frequency domain, spatial information can be mapped for each parameter band to the frequency domain, and converts the plurality of filter coefficients to the frequency domain.

[0092] 3D渲染单元710可通过使第一域转换单元700提供的频域缩减混音信号、空间信息和滤波器系数相乘来生成3D缩减混音信号。 [0092] 3D rendering unit 710 may be a frequency domain by the first domain conversion unit 700 provides the down-mix signal, the spatial information and the filter coefficient to generate the 3D down-mix signal.

[0093] 通过使全部在M点频域中表示的缩减混音信号、空间信息和多个滤波器系数相乘而获取的时域信号具有M个有效信号。 [0093] The down-mix signal, spatial information, and a plurality of filter coefficients are multiplied by all of the M-point frequency domain representation and the time domain signal obtained has M valid signals. 为了在M点频域中表示缩减混音信号、空间信息和滤波器系数,可执行M点DFT或M点FFT。 In order to represent the M-point frequency domain down-mix signal, the spatial information and the filter coefficients, perform an M-point DFT or M-point FFT.

[0094] 有效信号是不一定具有0值的信号。 [0094] Effective signal is not necessarily a signal having the value 0. 例如,可通过经由采样从音频信号中获取χ 个信号来生成总共X个有效信号。 For example, a total of X can be generated by obtaining the valid signals χ signal from the audio signal through sampling. 在该X个有效信号中,y个有效信号被补零。 In the X valid signals, y valid signals are zeros. 然后,有效信号的数目减少至(χ-y)。 Then, the number of valid signals is reduced to (χ-y). 之后,具有a个有效信号的信号和具有b个有效信号的信号被卷积,从而获取总共(a+b-Ι)个有效信号。 Thereafter, the signal having a valid signals and a signal having the b valid signals are convoluted, thereby obtaining a total of (a + b-Ι) valid signals.

[0095] M点频域中缩减混音信号、空间信息和滤波器系数的相乘可提供与在时域中卷积缩减混音信号、空间信息和滤波器系数相同的效果。 [0095] M-point frequency domain down-mix signal by multiplying, spatial information and the filter coefficient may provide the same down-mix signal, the spatial information and the filter coefficient results in a convolution in the time domain. 具有(3*M-2)个有效信号的信号可通过将M点频域中的缩减混音信号、空间信息和滤波器系数转换至时域并卷积该转换的结果来生成。 Signal having (3 * M-2) valid signals can be obtained by converting the down-mix signal, the spatial information and the filter coefficients of the M-point frequency domain to the time domain convolution result, and generates the conversion. [0096] 因此,通过使频域中的缩减混音信号、空间信息和滤波器系数相乘并将相乘的结果转换至时域而获取的信号中的有效信号的数目可能与通过卷积时域中的缩减混音信号、 空间信息和滤波器系数获取的信号中的有效信号的数目不同。 The number of [0096] Thus, by making the frequency domain down-mix signal, the spatial information and the filter coefficient to the multiplication result of the conversion to the time domain acquired valid signal when the signal is possible by convolving domain down-mix signal, a different number of valid signals of spatial information signal and the filter coefficient acquired in. 结果,在频域中的3D缩减混音信号转换至时域信号期间可发生混叠。 As a result, the 3D down-mix signal into the frequency domain to the time domain may occur during signal aliasing.

[0097] 为了防止混叠,时域中的缩减混音信号的有效信号的数目、映射到频域的空间信息的有效信号的数目、以及滤波器系数的数目的总和不能大于M。 The number of valid signals [0097] In order to prevent aliasing, the number of down-mix signal is a valid signal in the time domain, the spatial information mapped to a frequency domain, and the sum of the number of filter coefficients must not be greater than M. 映射到频域的空间信息的有效信号的数目可根据频域的点的数目确定。 The number of valid signals of spatial information mapped to a frequency domain may be determined according to the number of points in the frequency domain. 换言之,如果对每一参数频带表示的空间信息被映射至N点频域,则空间信息的有效信号的数目可以是N。 In other words, if spatial information represented for each parameter band is mapped to an N-point frequency domain, the number of valid signals of spatial information may be N.

[0098] 参考图7,第一域转换单元700包括第一补零单元701和第一频域转换单元702。 [0098] Referring to Figure 7, the first domain conversion unit 700 includes a first zero-padding unit 701 and a first frequency-domain conversion unit 702. 第三渲染单元710包括映射单元711、时域转换单元712、第二补零单元713、第二频域转换单元714、多声道信号生成单元715、第三补零单元716、第三频域转换单元717以及3D缩减混音信号生成单元718。 The third rendering unit 710 includes a mapping unit 711, time domain conversion unit 712, a second zero-padding unit 713, a second frequency-domain conversion unit 714, the multi-channel signal generating unit 715, a third zero-padding unit 716, a third frequency domain conversion unit 717, and a 3D down-mix signal generation unit 718.

[0099] 第一补零单元701对时域中的具有X个样本的缩减混音信号执行补零操作使得缩减混音信号的样本数目可从X增加至Mo第一频域转换单元702将补零缩减混音信号转换成M点频域信号。 [0099] down-mix signal with X performs zero padding operation 701 samples in the time domain of the zero-padding unit such that the first number of samples down-mix signal can be increased from X to Mo a first frequency-domain conversion unit 702 to make zero down-mix signal into an M-point frequency domain signal. 经补零的缩减混音信号具有M个样本。 By zero-padded down-mix signal has M samples. 在经补零的缩减混音信号的M个样本中,仅X个样本是有效信号。 In the M down-mix signal zero-padded samples, only X samples are valid signals.

[0100] 映射单元711将每一个参数频带的空间信息映射至N点频域。 [0100] The spatial mapping unit 711 information for each parameter band is mapped to an N-point frequency domain. 时域转换单元712 将通过映射单元711执行的映射而获取的空间信息转换至时域。 Time-domain conversion unit 712 to be acquired by the mapping unit 711 performs mapping spatial information into the time domain. 通过时域转换单元712执行转换而获取的空间信息具有N个样本。 Acquired by performing time-domain conversion unit 712 converts spatial information has N samples.

[0101] 第二补零单元713对时域中的具有N个样本的空间信息执行补零操作,使得空间信息的样本数目可从N增加至M。 [0101] The second zero-padding unit 713 in the time domain of the zero padding operation performing spatial information has N samples, the number of samples such that the spatial information can be increased from N to M. 第二频域转换单元714将经补零的空间信息转换成M点频域信号。 The second frequency-domain conversion unit 714 converts the zero-padded spatial information into an M-point frequency domain signals. 经补零的空间信息具有N个样本。 By zero-padded spatial information has N samples. 在经补零的空间信息的N个样本中,仅N个样本是有效的。 N samples in the zero-padded spatial information, only N samples are valid.

[0102] 多声道信号生成单元715通过使第一频域转换单元712提供的缩减混音信号和第二频域转换单元714提供的空间信息相乘生成多声道信号。 [0102] multichannel signal generating unit 715 by the first frequency-domain conversion unit 712 provides the spatial down-mix signal and the second frequency-domain conversion unit 714 generates Multiplies multichannel signal. 由多声道信号生成单元715 生成的多声道信号具有M个有效信号。 M valid signals having a multi-channel multi-channel signal generation unit 715 generates. 另一方面,通过在时域中卷积由第一频域转换单元712提供的缩减混音信号和由第二频域转换单元714提供的空间信息获取的多声道信号具有(X+N-1)个有效信号。 On the other hand, by having (X + convolution in the time domain down-mix signal provided by the first frequency-domain conversion unit 712 and a multi-channel spatial information signal acquired by the second frequency-domain conversion unit 714 provides the N- 1) valid signals.

[0103] 第三补零单元716可对时域中表示的Y滤波器系数执行补零操作,使得样本的数目可增加至M。 The number of filter coefficients Y [0103] The third zero-padding unit 716 may be represented in the time domain zero padding operation is performed, so that the sample may be increased to M. 第三频域转换单元717将补零滤波器系数转换成M点频域。 The third frequency-domain conversion unit 717 converts the zero-padded filter coefficients to the M-point frequency domain. 经补零的滤波器系数具有M个样本。 By zero-padded filter coefficients have M samples. 在M个样本中,仅Y个样本是有效信号。 In the M samples, only Y samples are valid signals.

[0104] 3D缩减混音信号生成单元718通过将多声道信号生成单元715所生成的多声道信号与第三频域转换单元717提供的多个滤波器系数相乘来生成3D缩减混音信号。 [0104] 3D down-mix signal by multiplying a plurality of filter coefficient generating unit 718 by the multichannel signal multichannel signal generating unit 715 generates the third frequency-domain conversion unit 717 provided to generate a 3D down-mix signal. 由3D缩减混音信号生成单元718生成的3D缩减混音信号具有M个有效信号。 A 3D down-mix signal generation unit 718 generates a 3D down-mix signal has M valid signals. 另一方面,通过在时域中卷积由多声道信号生成单元715所生成的多声道信号和由第三频域转换单元717所提供的滤波器系数来获取的3D缩减混音信号具有(X+N+Y-幻个有效信号。 On the other hand, the down-mix signal from the multi-channel signal generating unit multichannel signal 715 generated by the filter coefficient and a third frequency-domain conversion unit 717 provided to acquire a 3D convolution in the time domain having (X + N Y- + phantom valid signals.

[0105] 通过将第一、第二和第三频域转换单元702所使用的M点频域设置成满足以下方程式(X+N+Y-2)来防止混叠是可能的。 [0105] frequency domain is provided to satisfy the following equation (X + N + Y-2) it is possible to prevent aliasing by point M of the first, second, and third frequency-domain conversion unit 702 is used. 换言之,有可能通过使第一、第二和第三频域转换单元702、714和717能够执行满足以下方程式:M彡(X+N+Y-2)的M点DFT或M点FFT来防止混叠。 In other words, it is possible through the first, second, and third frequency-domain conversion units 702,714 and 717 can be performed to satisfy the following equation: M San (X + N + Y-2) of the M-point DFT or M-point FFT to prevent Aliasing.

[0106] 至频域的转换可利用除DFT滤波器组、FFT滤波器组和QMF组以外的滤波器组来执行。 [0106] to frequency domain conversion may utilize a filter bank other than the DFT filter bank, FFT filter bank, and QMF bank is performed. 3D缩减混音信号的生成可利用HRTF滤波器来执行。 Generating a 3D down-mix signal may be performed using HRTF filter.

[0107] 空间信息的有效信号的数目可利用除上述方法以外的方法来调节,或可利用上述方法中效率最高且需要计算量最少的一种方法来调节。 [0107] The number of valid signals of spatial information may use a method other than the method described above is adjusted, or a method for the least amount of computation necessary to adjust the maximum and the efficiency of the method described above.

[0108] 不仅在信号、系数或空间信息从频域转换至时域或反过来的期间,而且在信号、系数或空间信息从QMF域转换至混合域或反过来的期间,都可发生混叠。 [0108] Not only in the signal, the coefficient or spatial information to the time domain or vice versa during the conversion from the frequency domain, but also in the hybrid domain or vice versa during signal, coefficient or spatial information from a QMF domain to, aliasing can occur . 上述防止混叠的方法也可用于在信号、系数或空间信息从QMF域转换至混合域或反过来的期间防止混叠发生。 To prevent aliasing of the above method can be used during signal, coefficient or spatial information from a QMF domain to a hybrid domain or vice versa, to prevent the occurrence of aliasing.

[0109] 用于生成多声道信号或3D缩减混音信号的空间信息可改变。 [0109] or a multichannel signal for generating a 3D down-mix signal spatial information may vary. 作为空间信息改变的结果,在输出信号中可发生作为噪声的信号不连续。 As a result of the spatial information changes, it may occur in the output signal as a noise signal discontinuities.

[0110] 输出信号中的噪声可利用平滑法来减少,通过该平滑法可防止空间信息快速变化。 [0110] The noise in the output signal may be utilized to reduce the smoothing, the spatial information can be prevented by rapidly changing the smoothing process.

[0111] 例如,当在第一帧和第二帧彼此相邻时施加于第一帧的第一空间信息与施加于第二帧的空间信息不同时,在第一帧和第二帧之间极可能发生不连续。 [0111] For example, when adjacent to each other in the first and second frames are not applied simultaneously, the first and second frames between the first spatial information and spatial information applied to a second frame of the first frame most likely not occur continuously.

[0112] 在这种情况下,可利用第一空间信息补偿第二空间信息或利用第二空间信息补偿第一空间信息,使得第一空间信息和第二空间信息之差可减小,从而由第一和第二帧之间的不连续导致的噪声可降低。 [0112] In this case, the spatial information may be compensated using the first spatial information or the second information using the second spatial information of the first spatial compensation, such that the first spatial information and the second spatial information of the difference may be reduced, so that the noise caused by the discontinuity between the first and second frames can be reduced. 更具体地,第一空间信息和第二空间信息中的至少一个可用第一空间信息和第二空间信息的平均值来代替,从而降低噪声。 More specifically, a first at least one available space of the first spatial information and the second spatial information, the average value information and the second spatial information in place, thus reducing the noise.

[0113] 噪声也很可能由于一对相邻参数频带之间的不连续而生成。 [0113] Noise is also likely to be generated due to the discontinuity between a pair of adjacent parameter bands. 例如,当在第一和第二参数频带彼此相邻时对应于第一参数频带的第三空间信息与对应于第二参数频带的第四空间信息不同时,不连续可能在第一和第二参数频带之间发生。 For example, when the first and second parameter bands adjacent to each other to the third spatial information corresponding to parameter band of the first and the fourth spatial information corresponding to a second parameter band is not the same, may be discontinuous in the first and second between parameter band.

[0114] 在这种情况下,可利用第四空间信息补偿第三空间信息或利用第三空间信息补偿第四空间信息,使得第三空间信息和第四空间信息之差可减小,并且由第一和第二参数频带之间的不连续导致的噪声可降低。 [0114] In this case, the fourth spatial information may be compensated using the third spatial information or spatial information using the third compensation fourth spatial information, such that the third spatial information and the fourth spatial information of the difference may be reduced, and the noise caused by the discontinuity between the first and second parameter bands can be reduced. 更具体地,第三空间信息和第四空间信息中的至少一个可用第三空间信息和第四空间信息的平均值来代替,从而降低噪声。 More specifically, the third spatial information and the fourth spatial information is the average of at least one available space of the third information and the fourth spatial information in place, thus reducing the noise.

[0115] 由一对相邻帧之间或一对相邻参数频带之间的不连续导致的噪声可利用除上述方法以外的方法来降低。 [0115] by the noise caused by the discontinuity between a pair of adjacent frames or between a pair of adjacent parameter bands may be other than the method using the above method to reduce.

[0116] 更具体地,可将每个帧乘以诸如汉宁(Harming)窗口之类的窗口,并可将“重叠和添加”方案施加于乘法的结果,使得帧之间的改变可减小。 [0116] More specifically, each frame may be multiplied by a window such as a Hanning (Harming) window or the like, and may "overlap and add" program applied to the result of the multiplication, such that a change between frames may be reduced . 或者,施加有多条空间信息的输出信号可被平滑,使得可防止输出信号的多个帧之间的改变。 Alternatively, the output signal is applied to a plurality of spatial information can be smoothed, so that a change between the output signals a plurality of frames can be prevented.

[0117] 可利用例如ICC之类的空间信息将DFT域中的声道之间的解相关进行如下调节。 [0117] ICC can be utilized, for example, spatial information or the like between the decorrelated channel DFT domain is adjusted as follows.

[0118] 可通过使输入到一至二(OTT)或二至三(TTT)框的信号的系数乘以预定值来调节解相关的程度。 [0118] by adjusting the degree of decorrelation may be multiplied by a predetermined coefficient value inputted to one to two (the OTT) or two or three (TTT) box signal. 预定值可由以下方程式来限定:(A+(1-A*A) "0. 5*i),其中A指示施加于OTT 或TTT框的预定频带的ICC值,而i指示虚部。虚部可以是正的或负的。 A predetermined value by the following equation is defined: (A + (1-A * A) "0. 5 * i), wherein the ICC value of the predetermined frequency band A indicates applied to the OTT or TTT box, and i indicates an imaginary part imaginary part can. It is positive or negative.

[0119] 预定值可附有根据信号的特性的加权因子,信号的特性例如信号的能级、每一频率的信号的能量特性或施加ICC值A的框的类型。 [0119] predetermined value, for example, the signal level may be accompanied by a weighting factor depending on the characteristics of the signal characteristics of the signal, the energy characteristics of each type of signal frequency or frame ICC value A is applied. 作为引入加权因子的结果,可进一步调节解相关的程度,并可应用帧间平滑或内插法。 As a result of the introduction of the weighting factor, the degree of decorrelation may be further adjusted, and interframe smoothing or may use interpolation.

[0120] 如以上参考图7所描述的,可通过使用转换至频域的HRTF或报头相关脉冲响应(HRIR)在频域中生成3D缩减混音信号。 [0120] As described above with reference to FIG. 7, the frequency to be HRTF or a head related impulse response packet domain (the HRIR) generating a 3D down-mix signal in the frequency domain by using conversion.

[0121] 或者,3D缩减混音信号可通过在时域中卷积HMR和缩减混音信号来生成。 [0121] Alternatively, 3D down-mix signal by convolution of HMR and the down-mix signal in a time domain is generated. 可将频域中生成的3D缩减混音信号留在频域中,而不进行逆域变换。 It may be generated in the frequency domain 3D down-mix signal in the frequency domain left without performing an inverse domain transform.

[0122] 为了在时域中卷积HMR和缩减混音信号,可使用有限脉冲响应(FIR)滤波器或无限脉冲响应(IIR)滤波器。 [0122] In order to HMR and convolution in the time domain down-mix signal may be a finite impulse response (FIR) filter or an infinite impulse response (IIR) filter.

[0123] 如上所述,根据本发明实施例的编码装置和解码装置可利用涉及使用频域中的HRTF或转换至频域的HMR的第一方法、涉及在时域中卷积HMR的第二方法或第一和第二方法的组合来生成3D缩减混音信号。 [0123] As described above, according to the embodiment of the present invention, the encoding apparatus and decoding apparatus of the first embodiment may utilize the method involves the use of HMR in the frequency domain to the frequency domain HRTF or involving convolution in the time domain of the second HMR method or combination of first and second methods of generating a 3D down-mix signal.

[0124] 图8至11示出根据本发明的实施例的比特流。 [0124] FIG. 8 to 11 show a bit stream according to an embodiment of the present invention.

[0125] 参考图8,比特流包括:包含生成多声道信号所需信息的多声道解码信息字段、包含生成3D缩减混音信号所需信息的3D渲染信息字段、以及包含利用包含在多声道解码信息字段中的信息和包含在3D渲染信息字段中的信息所需的报头信息的报头字段。 [0125] Referring to Figure 8, the bit stream comprising: generating a multi-channel signal comprising information necessary for multichannel decoding information field containing the information needed to generate the 3D down-mix signal 3D rendering information fields, and use contained in the multi comprising channel decoding information field and the information field of a header containing header information required for the 3D rendering information field of. 比特流可仅包括多声道解码信息字段、3D渲染信息字段和报头字段中的一个或两个。 Bitstream may include only the multichannel decoding information fields, 3D rendering information field and a header field or the two.

[0126] 参考图9,含有解码操作所必需的辅助信息的比特流可包括:包含整个经编码信号的报头信息的特定配置报头字段、以及包含关于多个帧的辅助信息的多个帧数据字段。 [0126] Referring to FIG 9, the bit stream comprising the auxiliary information decoding necessary for the operation may include: specific configuration header field contains the header information of the entire encoded signal and a plurality of frame data field contains additional information about the plurality of frames . 更具体地,每一个帧数据字段可包括:包含相应帧的报头信息的帧报头字段、以及包含相应帧的空间信息的帧参数数据字段。 More specifically, each frame data fields may include: a frame header field containing information corresponding to the header frame, and the frame parameter data field contains spatial information of the corresponding frame. 或者,帧数据字段中的每一个仅可包括帧参数数据字段。 Alternatively, each may comprise only one frame parameter data field of the frame data field.

[0127] 帧参数数据字段中的每一个可包括多个模块,每个模块包括标志和参数数据。 [0127] frame parameter data fields may include a plurality of each of modules, each module comprising a flag and parameter data. 模块是包括诸如空间信息之类的参数数据以及诸如缩减混音增益和平滑数据之类提高信号音质所必需的其他数据的数据集。 A module is a parameter data such as spatial information and the like such as down-mix gain and smoothing data type of data to improve the signal quality of the other data sets necessary.

[0128] 如果在没有任何附加标志的情况下接收关于由帧报头字段指定的信息的模块数据,如果由帧报头字段指定的信息被进一步分类,或者如果接收附加标志和数据连同未被帧报头指定的信息,则模块数据可不包括任何标志。 [0128] If the receiving data information specified by the frame header fields about without any additional flag, if the information specified by the frame header fields is further classified, or if the received additional flag and data designated together with the non-frame header information, the module data may not include any flag.

[0129] 关于3D缩减混音信号的辅助信息,例如HRTF系数信息,可被包括在特定配置报头字段、帧报头字段和帧参数数据字段中的至少一个中。 [0129] auxiliary information about the 3D down-mix signal, for example, HRTF coefficient information, may be included in at least a specific configuration header field, the frame header and the frame parameter data fields in the field.

[0130] 参考图10,比特流可包括:包含生成多声道信号所必需的信息的多个多声道解码信息字段、以及包含生成3D缩减混音信号所必需的信息的多个3D渲染信息字段。 [0130] Referring to FIG 10, a bitstream may include: comprising generating a plurality of the multichannel decoding information fields of information necessary for the multichannel signal, and comprising generating a 3D down-mix signal information necessary for a plurality of 3D rendering information field.

[0131] 当接收比特流时,解码装置可使用多声道解码信息字段或3D渲染信息字段来执行解码操作并跳过任何未在解码操作中使用的多声道解码信息字段和3D渲染信息字段。 [0131] When receiving the bitstream, a decoding apparatus may use the multichannel decoding information fields or the 3D rendering information field to perform a decoding operation and skip any multichannel decoding information fields are not used in the decoding operation and the 3D rendering information fields . 在这种情况下,可根据要再现的信号的类型确定多声道解码信息字段和3D渲染信息字段中的哪些将被用于执行解码操作。 In this case, the multichannel decoding information fields and the 3D rendering information fields which will be used to perform a decoding operation of the type of signal to be reproduced is determined in accordance with.

[0132] 换言之,为了生成多声道信号,解码装置可跳过3D渲染信息字段,并读取包含在多声道解码信息字段中的信息。 [0132] In other words, in order to generate a multichannel signal, a decoding apparatus may skip the 3D rendering information fields, and read information included in the multichannel decoding information fields. 另一方面,为了生成3D缩减混音信号,解码装置可跳过多声道解码信息字段,并读取包含在3D渲染信息字段中的信息。 On the other hand, in order to generate 3D down-mix signal, a decoding apparatus may skip the multi-channel decoding information fields, and read information included in the 3D rendering information fields.

[0133] 跳过比特流中的多个字段中的某些的方法如下。 [0133] Certain methods of the plurality of fields in the skip bit stream as follows.

[0134] 首先,关于字段的比特大小的字段长度信息可被包括在比特流中。 [0134] First, field length information regarding the size of the bit fields may be included in a bitstream. 在这种情况下, 可通过跳过对应于字段比特大小的比特数来跳过该字段。 In this case, by skipping a number of bits corresponding to the bit size field to skip the field. 可将字段长度信息设置在字段的开头。 The field length information may be provided at the beginning of the field.

[0135] 第二,可将同步字设置在字段的末端或开头。 [0135] Second, the sync word may be disposed at the end or beginning of the field. 在这种情况下,可通过基于同步字的位置定位字段来跳过该字段。 In this case, the field may be skipped by locating the position of the synchronization word field based.

[0136] 第三,如果提前确定并固定了字段的长度,则可通过跳过对应于该字段的长度的数据量来跳过该字段。 [0136] Third, if determined in advance and fixed length field, the field may be skipped by skipping an amount of data corresponding to the length of the field. 可将关于字段长度的固定字段长度信息包括在比特流中或存储在解码装置中。 May be fixed field length information regarding the length of the field comprises a decoding means or in the bitstream stored.

[0137] 第四,可利用上述字段跳过方法中的两种或更多种的组合来跳过多个字段之一。 [0137] Fourth, two methods may be skipped or more thereof to one of a plurality of fields using the skip field.

[0138] 诸如字段长度信息、同步字或固定字段长度信息之类的字段跳过信息是跳过字段所必需的信息,可将其包括在图9所示的特定配置报头字段、帧报头字段和帧参数数据字段之一中,或可将其包括在图9所示字段以外的字段中。 [0138] Information such as field length, word sync, or fixed field length information field like the information field skip skip information is necessary, which comprises a specific configuration header field shown in FIG. 9, the frame header fields, and may be one frame parameter data fields, or may be included in a field other than the field 9 shown in FIG.

[0139] 例如,为了生成多声道信号,解码装置可参照设置在每个3D渲染信息字段的开头的字段长度信息、同步字或固定字段长度信息跳过3D渲染信息字段,并读取包括在多声道解码信息字段中的信息。 [0139] For example, in order to generate a multichannel signal, a decoding apparatus may be provided with reference to field length information, the synchronization word or fixed field length information of each skip the 3D rendering information fields beginning 3D rendering information fields, and read in comprising information of the multichannel decoding information fields.

[0140] 另一方面,为了生成3D缩减混音信号,解码装置可参照设置在每个多声道解码信息字段的开头的字段长度信息、同步字或固定字段长度信息跳过多声道解码信息字段,并读取包括在3D渲染信息字段中的信息。 [0140] On the other hand, in order to generate 3D down-mix signal, a decoding apparatus may be provided with reference to field length information, the synchronization word or fixed field length information is skipped at the beginning of each of the multichannel decoding information fields multichannel decoding information field, and read information included in the 3D rendering information field.

[0141] 比特流可包括指示包括在该比特流中的数据是生成多声道信号所必需的还是生成3D缩减混音信号所必需的信息。 [0141] bitstream may include information indicating data included in the bitstream is necessary for generating a multi-channel signal or a 3D down-mix signal generation information necessary.

[0142] 然而,即使比特流不包括诸如CLD之类的任何空间信息,而仅包括生成3D缩减混音信号所必需的数据(例如,HRTF滤波器系数),也可通过利用生成3D缩减混音信号所必需的数据进行解码来再现多声道信号,而不需要空间信息。 [0142] However, even if a bitstream does not include any spatial information such as CLD or the like, but only includes generating data (e.g., an HRTF filter coefficients) necessary for 3D down-mix signal can also be generated by using a 3D down-mix signal decoding data necessary to reproduce a multichannel signal, without spatial information.

[0143] 例如,从缩减混音信号获取作为关于两个声道的空间信息的立体声参数。 [0143] For example, as the spatial information acquired on the two channels from the stereo down-mix signal parameters. 然后,将立体声参数转换成关于待再现的多个声道的空间信息,并通过将由转换获取的空间信息施加于缩减混音信号来生成多声道信号。 Then, the stereo parameter converted into spatial information regarding a plurality of channels to be reproduced and obtained by conversion by the spatial information is applied to the down-mix signal to generate a multichannel signal.

[0144] 另一方面,即使比特流仅包括生成多声道信号所必需的数据,也可再现缩减混音信号而不需要附加解码操作,或可通过利用附加HRTF滤波器对缩减混音信号执行3D处理来再现3D缩减混音信号。 [0144] On the other hand, even if a bitstream includes only generating a multichannel signal necessary data, but also down-mix signal can be reproduced without requiring additional decoding operation, or by using an additional HRTF filter performs down-mix signal 3D down-mix signal is reproduced 3D processing.

[0145] 如果比特流包括生成多声道信号所必需的数据和生成3D缩减混音信号所必需的数据两者,则可允许用户决定是再现多声道信号还是3D缩减混音信号。 [0145] If the bit stream comprises generating a multichannel signal necessary to generate both the data and the data necessary for 3D-mix signal reduction, the user may be allowed to decide whether to reproduce a multichannel signal or a 3D down-mix signal.

[0146] 将在下文中参考各自相应的句法详细描述跳过数据的方法。 [0146] reference to the respective method of skip data syntax is described in detail below.

[0147] 句法1指示以帧为单位解码音频信号的方法。 [0147] Syntax 1 indicates a frame unit of a method of decoding an audio signal.

[0148][句法 1] [0148] [Syntax 1]

[0149] [0149]

SpatialFrame() SpatialFrame ()

{ {

Framinglnfo(); Framinglnfo ();

bsIndependencyFIag ; bsIndependencyFIag;

OttDataO ; OttDataO;

Figure CN101385077BD00151

[0150] 在句法1中,OttdataO和TttDataO是表示从缩减混音信号恢复多声道信号所必需的参数(诸如包括CLD、ICC和CPC的空间信息)的模块,而SmgDataO > TempShapeData ()、 ArbitraryDownmixDataO和ResidualDataO是表示通过校正编码操作期间可能发生的信号失真来改善音质所必需的信息的模块。 [0150] In the syntax 1, OttdataO and TttDataO is a block restored multichannel signal necessary for the down-mix signal parameters (such as spatial information including CLD, ICC and CPC), and SmgDataO> TempShapeData (), ArbitraryDownmixDataO ResidualDataO and is a signal that may occur during operation by the distortion correction coding information to improve sound quality module necessary.

[0151] 例如,如果诸如CLD、ICC或CPC之类的参数和包括在模块ArbitraryDownmixDataO中的信息仅在解码操作期间使用,则设置在模块TttDataO和ArbitraryDownmixData() fS]Wll^i SmgDataO 禾口TempShapeDatii() ifC·的。 [0151] For example, if the parameters such as CLD, ICC or CPC and the like and the information included in the module ArbitraryDownmixDataO is only used during a decoding operation, is set at block TttDataO and ArbitraryDownmixData () fS] Wll ^ i SmgDataO Wo port TempShapeDatii ( ) ifC ·'s. ESift, 跳过模块SmgDataO和iTempShapeData ()是有效率的。 ESift, and skipping module SmgDataO iTempShapeData () is efficient.

[0152] 在下文中将参考以下的句法2详细描述根据本发明的实施例的跳过模块的方法。 [0152] below with reference to the following syntax in the method skips module 2 of the present invention is an embodiment described in detail.

[0153][句法 2] [0153] [Syntax 2]

Figure CN101385077BD00152
Figure CN101385077BD00161

[0155] 参考句法2,可将模块SkipDataO设置在将被跳过的模块之前,且将被跳过的模块的比特大小在模块SkipDataO中被指定为bsSkipBits。 [0155] Referring Syntax 2, the module may be provided before SkipDataO module will be skipped, and the bit size skipped SkipDataO of module is designated as bsSkipBits.

[0156] 换言之,假设模块SmgData ()和TempSmpeData ()将被跳过,且组合的模块SmgData()和iTempShapeDataO的比特大小是150,则可通过将bsSkipBits设置为150来跳过模块SmgData ()禾口TempShapeData ()。 Bit size [0156] In other words, assuming the module SmgData () and TempSmpeData () will be skipped, and the combined module SmgData () and is iTempShapeDataO 150 may be provided by bsSkipBits module 150 to skip SmgData () Wo mouth TempShapeData ().

[0157] 在下文中将参考句法3详细描述根据本发明的另一个实施例的跳过模块的方法。 [0157] in the method described in detail hereinafter with reference to skip syntax module according to another embodiment of the present invention.

[0158][句法 3] [0158] [Syntax 3]

[0159] [0159]

Figure CN101385077BD00162

[0160] 参考图3,可通过使用bsSkipSyncflag和bsSkipSyncword跳过不必要的模块, bsSkipSyncflag是指示是否使用同步字的标志,bsSkipSyncword是可被设置在将被跳过的模块的末端的同步字。 [0160] Referring to FIG 3, and by using bsSkipSyncflag bsSkipSyncword skip unnecessary modules, bsSkipSyncflag is a flag indicating whether to use the synchronization word, it is bsSkipSyncword sync word may be provided at the end of the module to be skipped.

[0161] 更具体地,如果将标志bsSkipSyncflag设置成使得同步字可使用,则标志bsSkipSyncflag和同步字bsSkipSyncword之间的一个或多个模块——即模块SmgDataO 和iTempShapeDataO 可被跳过。 [0161] More specifically, if the flag is set such that bsSkipSyncflag sync word may be used, the one or more modules between the flag bsSkipSyncflag and the sync word bsSkipSyncword - i.e. SmgDataO and iTempShapeDataO module may be skipped.

[0162] 参考图11,比特流可包括:包含再现多声道信号所必需的报头信息的多声道报头字段、包含再现3D缩减混音信号所必需的报头信息的3D渲染报头字段、以及包含再现多声道信号所必需数据的多个多声道解码信息字段。 [0162] Referring to FIG 11, the bit stream can comprising: a reproducing multi-channel multi-channel header field of a header information needed for the signal, comprising reproducing a 3D down-header information-mix signal necessary for the 3D rendering header field, and comprising a plurality of the multichannel signal reproducing multichannel decoding information fields necessary data.

[0163] 为了再现多声道信号,解码装置可跳过3D渲染报头字段,并从多声道报头字段和多声道解码信息字段读取数据。 [0163] In order to reproduce a multichannel signal, a decoding apparatus may skip the 3D rendering header field, and read data from the multichannel header field and the multichannel decoding information fields.

[0164] 跳过3D渲染报头字段的方法与以上参考图10所述的字段跳过方法相同,因此,可跳过其详细描述。 [0164] skip the 3D rendering header field is the same as the method above with reference to FIG. 10, the field skipping method, therefore, detailed description thereof will be skipped.

[0165] 为了再现3D缩减混音信号,解码装置可从多声道解码信息字段和3D渲染报头字段读取数据。 [0165] In order to reproduce a 3D down-mix signal, a decoding apparatus may read data from the multichannel decoding information fields and the 3D rendering header field. 例如,解码装置可利用包括在多声道解码信息字段中的缩减混音信号和包括在3D缩减混音信号中的HRTF系数信息来生成3D缩减混音信号。 For example, the decoding apparatus may comprise using a multichannel decoding information fields and the down-mix signal comprising a down-mix signal in the HRTF coefficient information to generate a 3D down-mix signal 3D.

[0166] 图12是根据本发明的实施例的用于处理任意缩减混音信号的编码/解码装置的框图。 [0166] FIG. 12 is a block diagram of the encoding / decoding apparatus of the arbitrary down-mix signal processing according to an embodiment of the present invention. 参考图12,任意缩减混音信号是除了由包括在编码装置800中的多声道编码器801 生成的缩减混音信号以外的缩减混音信号。 Referring to Figure 12, an arbitrary down-mix signal is a down-mix signal comprising in addition to the down-mix signal other than the encoding apparatus 800. multichannel encoder 801 generated. 将省略与图1的实施例相同的过程的详细描述。 Detailed description of the process according to the same embodiment will be omitted in FIG. 1.

[0167] 参考图12,编码装置800包括多声道编码器801、空间信息合成单元802以及比较单元803。 [0167] Referring to FIG 12, the encoding apparatus 800 includes the multichannel encoder 801, a spatial information synthesis unit 802 and a comparison unit 803.

[0168] 多声道编码器801将输入多声道信号缩减混音成立体声或单声道缩减混音信号, 并生成从缩减混音信号恢复多声道信号所必需的基本空间信息。 [0168] Multi-channel encoder 801 down-mix the input multichannel signal into a stereo or mono down-mix signal, and generates basic spatial information restored multichannel signal necessary for the down-mix signal.

[0169] 比较单元803将缩减混音信号与任意缩减混音信号进行比较,并基于比较的结果生成补偿信息。 [0169] The comparison unit 803 down-mix signal and the arbitrary down-mix signal is compared, and generates compensation information based on the comparison result. 补偿信息是补偿任意缩减混音信号使得任意缩减混音信号可被转换成接近缩减混音信号所必需的。 Compensation information is compensated arbitrary down-mix signal so that the arbitrary-mix signal can be converted into the necessary proximity down-mix signal reduction. 解码装置可利用补偿信息来补偿任意缩减混音信号,并利用经补偿的任意缩减混音信号恢复多声道信号。 Decoding means may utilize any compensation information to compensate for the arbitrary down-mix signal, and using the compensated down-mix signal to recover the multi-channel signal. 恢复的多声道信号比从由多声道编码器801生成的任意缩减混音信号恢复的多声道信号更类似于原始的输入多声道信号。 Restored multichannel signal is more similar to the original input multichannel signal than the multichannel signal from the down-mix signal recovered from any of the multichannel encoder 801 generates.

[0170] 补偿信息可以是缩减混音信号和任意缩减混音信号之差。 [0170] Information may be compensated down-mix signal and the arbitrary down-mix signal of the difference. 解码装置可通过将缩减混音信号和任意缩减混音信号之差与任意缩减混音信号相加来补偿任意声道缩减混音信号。 Decoding means by the down-mix signal and the arbitrary down-mix signal of a difference between arbitrary down-mix signal compensated by adding an arbitrary downmix signal.

[0171] 缩减混音信号和任意缩减混音信号之差可以是指示缩减混音信号和任意缩减混音信号的能级之差的缩减混音增益。 [0171] down-mix signal and the arbitrary down-mix signal difference may be indicative of the down-mix signal and the arbitrary down-down-mix gain difference signal level of the mix.

[0172] 可针对每一频带、每一时间/时隙、和/或每一声道确定缩减混音增益。 [0172] The down-mix gain may be determined for each frequency band, for each time / time slot, and / or each channel. 例如,一部分缩减混音增益可针对每一频带确定,而另一部分缩减混音增益可针对每一时隙确定。 For example, a portion of the down-mix gain may be determined for each frequency band, and another portion of the down-mix gain may be determined for each time slot.

[0173] 缩减混音增益可针对每一参数频带或为任意缩减混音信号优化的每一频带来确定。 [0173] frequency band of the down-mix gain may be determined for each parameter band or for each arbitrary down-mix signal optimized. 参数频带是施加有参数类型的空间信息的频率间隔。 Parameter band is applied with a frequency interval parameter type of spatial information.

[0174] 可将缩减混音信号和任意缩减混音信号的能级之差量化。 [0174] and the down-mix signal may be the difference between the arbitrary down-mix signal level of quantization. 量化缩减混音信号和任意缩减混音信号的能级之差的量化电平的分辨率可与量化缩减混音信号和任意缩减混音信号之间的CLD的量化电平的分辨率相同或不同。 Quantized signal and the arbitrary down-mix to reduce the difference between the level of quantization levels of the resolution-mix signal may be the same or different from the down-mix signal and the quantized quantization level of the arbitrary down-mix signal CLD between resolution . 此外,缩减混音信号和任意缩减混音信号的能级之差的量化可涉及使用量化缩减混音信号和任意缩减混音信号之间的CLD的量化电平的全部或部分。 Further, the down-mix signal and the arbitrary down-mix of the quantization level difference between the signals may involve the use down-mix signal and the arbitrary quantization down all or part of the quantization levels between CLD-mix signal.

[0175] 因为缩减混音信号和任意缩减混音信号的能级之差的分辨率一般低于缩减混音信号和任意缩减混音信号之间的CLD的分辨率,所以与量化缩减混音信号和任意缩减混音信号之间的CLD的量化电平的分辨率相比,量化缩减混音信号和任意缩减混音信号的能级之差的量化电平的分辨率可具有微小的值。 [0175] Because the down-mix signal and the arbitrary down-mix poor resolution level of the signal is generally lower than the down-mix signal and the arbitrary down-mix signal CLD between the resolution, the down-mix signal and the quantized compared to CLD and the arbitrary down-mix signal resolution quantization levels, the quantization down-mix signal and the arbitrary down-level quantization of the difference between the level resolution-mix signal may have a minute value. [0176] 补偿任意缩减混音信号的补偿信息可以是包括残差信息的扩展信息,其指定不能利用任意缩减混音信号或缩减混音增益恢复的输入多声道信号的分量。 [0176] compensate for the arbitrary down-mix signal compensation information may include residual information of extended information, which can not be specified with an arbitrary down-mix signal or the down input of the multichannel signal component mix gain recovery. 解码装置可利用扩展信息来恢复不能利用任意缩减混音信号或缩减混音增益恢复的输入多声道信号的分量, 从而恢复几乎不可与原始的输入多声道信号区分的信号。 A decoding apparatus may use the extension information can not be restored using the arbitrary down-mix signal or the down signal component of the input multichannel mix gain recovery, thereby restoring the original hardly multichannel input signal differentiating signal.

[0177] 生成扩展信息的方法如下。 [0177] The method of generating the extension information are as follows.

[0178] 多声道编码器801可生成与缩减混音信号所缺少的输入多声道信号的分量有关的信息作为第一扩展信息。 [0178] Multi-channel encoder 801 may generate information down-mix signal components related to missing input multichannel signal as the first extension information. 解码装置可通过将第一扩展信息应用于利用缩减混音信号和基本的空间信息生成多声道信号来恢复几乎不可与原始的输入多声道信号区分的信号。 Decoding means by using the first extension information to the down-mix signal and basic spatial information generating multichannel signal hardly be restored to the original input multichannel signal differentiating signal.

[0179] 或者,多声道编码器801可利用缩减混音信号和基本空间信息来恢复多声道信号,并生成所恢复的多声道信号和原始输入的多声道信号之差作为第一扩展信息。 [0179] Alternatively, the multichannel encoder 801 may utilize down-mix signal and basic spatial information to recover a multichannel signal, a difference signal and generates multichannel recovered and the original input multichannel signal as the first Extended Information.

[0180] 比较单元803可生成与任意缩减混音信号所缺少的缩减混音信号的分量——即不能利用缩减混音增益补偿的缩减混音信号的分量——有关的信息作为第二扩展信息。 [0180] The comparison unit 803 may generate a missing signal component mix down-mix signal and the arbitrary down - i.e. not down-mix signal component using down-mix gain compensation - information relating to the second extension information . 解码装置可利用任意缩减混音信号和第二扩展信息恢复几乎不能与缩减混音信号区分的信号。 Decoding means may utilize any down-mix signal and a second extension information signal and the recovery hardly distinguishable down-mix signal.

[0181] 除上述方法外,扩展信息还可利用各种残差译码方法来生成。 [0181] In addition to the above methods, the extension information may also be utilized to generate various residual coding methods.

[0182] 缩减混音增益和扩展信息两者都可被用作补偿信息。 [0182] The down-mix gain and the extension information may both be used as compensation information. 更具体地,可针对缩减混音信号的整个频带获取缩减混音增益和扩展信息两者,并可将它们一起用作补偿信息。 More specifically, the down-mix gain and the extension acquire both the down-mix signal information for the entire frequency band, they can be used together as compensation information. 或者, 可将缩减混音增益用作针对缩减混音信号的一部分频带的补偿信息,并将扩展信息用作缩减混音信号的另一部分频带的补偿信息。 Alternatively, the down-mix gain may be used as compensation information for a portion of the frequency band for the down-mix signal and the extension information is used as compensation information for another part of the band-mix signal. 例如,可将扩展信息用作缩减混音信号的低频带的补偿信息,并将缩减混音增益用作缩减混音信号的高频带的补偿信息。 For example, the extension information may be used as compensation information for a low frequency band mix signal, the down-mix gain and used as compensation information for a high frequency band mix signal.

[0183] 还可将与除缩减混音信号的低频带以外的、诸如显著影响音质的峰值或凹口之类的部分缩减混音信号有关的扩展信息用作补偿信息。 [0183] The reduction may also be extended with information related to the mix signal down-mix signal other than a low frequency band, such as a significant effect on the peak or recess portion such as sound quality compensation information.

[0184] 空间信息合成单元802合成基本空间信息(例如,CLD、CPC、ICC和CTD)和补偿信息,从而生成空间信息。 [0184] Synthesis of 802 basic spatial information (e.g., CLD, CPC, ICC, and CTD) and spatial information synthesis unit compensation information, thereby generating spatial information. 换言之,传送至解码装置的空间信息可包括基本空间信息、缩减混音增益及第一和第二扩展信息。 In other words, to transfer the spatial information decoding apparatus may include a basic spatial information, down-mix gain and the first and second extension information.

[0185] 空间信息可连同任意缩减混音信号一起被包括在比特流中,且可将比特流传送至解码装置。 [0185] in conjunction with any spatial information may be included with the down-mix signal in a bitstream, and the bitstream may be sent to the decoding apparatus.

[0186] 扩展信息和任意缩减混音信号可利用诸如AAC法、MP3法或BSAC法之类的音频编码方法来编码。 [0186] extension information and the arbitrary down-mix signal may be encoded using the AAC audio coding method, MP3 method, or a BSAC method such as a method. 扩展信息和任意缩减混音信号可利用相同的音频编码方法或不同的音频编码方法来编码。 Extension information and the arbitrary down-mix signal may be encoded using the same audio encoding method or different audio encoding methods.

[0187] 如果扩展信息和任意缩减混音信号利用相同的音频编码方法编码,则解码装置可利用单一音频解码方法来解码扩展信息和任意缩减混音信号两者。 [0187] If the same extension information and the arbitrary down-mix audio signal using a coding method, the decoding device may utilize a single audio decoding method to decode both the extension information and the arbitrary down-mix signal. 在这种情况下,因为任意缩减混音信号总是能够被解码,所以扩展信息也总是能够被解码。 In this case, since the arbitrary down-mix signal can always be decoded, the extension information can also always be decoded. 然而,因为任意缩减混音信号一般被输入至解码装置作为脉冲编码调制(PCM)信号,用于编码任意缩减混音信号的音频编解码器的类型可能不能容易地识别,因此,用于编码扩展信息的音频编解码器的类型可能也不能容易地识别。 However, since the arbitrary down-mix signal is generally input to a decoding apparatus as a pulse code modulation (PCM) signals for encoding of any type of audio codec down-mix signal may not be easily recognized, and therefore, for coding extension the type of audio codec information may not be easily recognized.

[0188] 因此,与用于编码任意缩减混音信号和扩展信息的音频编解码器的类型有关的音频编解码器信息可被插入比特流中。 [0188] Thus, for the arbitrary down-mix signal and encoding the extended audio codec type information about the audio codec information may be inserted into the bitstream.

[0189] 更具体地,可将音频编解码器信息插入比特流的特定配置报头字段。 [0189] More specifically, the audio codec information is inserted into a specific configuration header field of a bitstream. 在这种情况下,解码装置可从比特流的特定配置报头字段提取音频编解码器信息,并使用所提取的音频编解码器信息解码任意缩减混音信号和扩展信息。 In this case, the specific configuration header field of a decoding apparatus may extract the bit stream from the audio codec information, and decodes the information using any audio codec extracted down-mix signal and the extension information.

[0190] 另一方面,如果任意缩减混音信号和扩展信息利用不同的编码方法来编码,则扩展信息可能不能够被解码。 [0190] On the other hand, if the arbitrary down-mix signal and the extension information using different encoding methods to encode, the extension information may not be able to decode. 在这种情况下,因为不能识别扩展信息的末端,所以不能执行进一步的解码操作。 In this case, because the tip does not recognize the extension information, the decoding operation can not be performed further.

[0191] 为了解决该问题,可将与分别用于编码任意缩减混音信号和扩展信息的音频编解码器的类型有关的音频编解码器信息插入比特流的特定配置报头字段。 [0191] In order to solve this problem, it can be respectively used to encode the arbitrary down-mix signal and the extension-specific configuration header field of the audio codec type information about the audio codec information is inserted into the bitstream. 然后,解码装置可从比特流的特定配置报头字段读取音频编解码器信息,并使用读取信息来解码扩展信息。 Then, the specific configuration header field of a decoding apparatus may read the audio bit stream from the codec information, and using the read information to decode the extension information. 如果解码装置不包括任何可解码扩展信息的解码单元,则可能不能进一步进行扩展信息的解码,且可读取紧接扩展信息之后的信息。 If the decoding apparatus does not include any information decodable extension decoding unit, it may not be further decode the extension information, and extension information immediately after the information may be read.

[0192] 与用于编码扩展信息的音频编解码器的类型有关的音频编解码器信息可由包括在比特流的特定配置报头字段中的句法元素表示。 [0192] and for encoding the extended audio codec information of the type of audio codec may include information regarding the specific configuration header field of a bitstream syntax element in FIG. 例如,音频编解码信息可由4比特句法元素bsResidualCodecType来表示,如以下的表1中指示的。 For example, the audio codec information may be a 4-bit syntax element bsResidualCodecType represented in Table 1 below as indicated.

[0193] 表1 [0193] TABLE 1

Figure CN101385077BD00191

[0195] 扩展信息不仅可包括残差信息,还可包括声道扩充信息。 [0195] extension information may include not only the residual information but also channel expansion information including. 声道扩充信息是将通过利用空间信息解码获取的多声道信号扩充成具有更多声道的多声道信号所必需的信息。 Extended information channel information is expanded into a multichannel signal having more channels required by the multichannel signal using the spatial information obtained by decoding. 例如,声道扩充信息可以是将5. 1声道信号或7. 1声道信号扩充成9. 1声道信号所必需的信肩、ο For example, the channel expansion information may be expanded to 5.1 channel signal or a 7.1-channel signal into a signal necessary for channel shoulder 9.1 channel, o

[0196] 可将扩展信息包括在比特流中,并可将比特流传送至解码装置。 [0196] The extension information may be included in a bitstream, and the bitstream to the decoding apparatus. 然后,解码装置可补偿缩减混音信号,或利用扩展信息扩充多声道信号。 Then, the decoding apparatus may compensate for the down-mix signal, the extension information using the extension or multichannel signal. 然而,解码装置可跳过扩展信息,而不是从比特流中提取扩展信息。 However, the decoding apparatus may skip the extension information, instead of extracting the extension information from the bitstream. 例如,在利用包括在比特流中的3D缩减混音信号生成多声道信号或利用包括在比特流中的缩减混音信号生成3D缩减混音信号的情况下,解码装置可跳过扩展信息。 For example, using the bitstream including the 3D down-mix signal or a multichannel signal generator included in a bitstream using a down-mix signal generated in the case where the 3D down-mix signal, a decoding apparatus may skip the extension information.

[0197] 跳过包括在比特流中的扩展信息的方法可与以上参考图10所述的字段跳过方法之一相同。 [0197] skipping the extension information included in a bitstream of one method may skip the same manner as the field 10 described above with reference to FIG.

[0198] 例如,扩展信息可利用附连到包括扩展信息的比特流的开头且指示扩展信息的比特大小的比特大小信息、附连到包括扩展信息的字段的开头或末端的同步字、以及指示扩展信息的固定比特大小的固定比特性大小信息中的至少一个来跳过。 [0198] For example, the extension information may be utilized attached to include expansion at the beginning of the bit stream information and indicating the extension information bit size bit size information, is attached to the extension field information of the beginning or the end of the sync word, indicating fixing extension ratio characteristic size information of fixed bit size of the information in at least one skipped. 可将比特大小信息、 同步字和固定比特大小信息全部包括在比特流中。 The bit size information may be the sync word and all the fixed bit size information included in the bit stream. 还可将固定比特大小信息存储在解码装置中。 The fixed bit size information may also be stored in the decoding means.

[0199] 参考图12,解码单元810包括缩减混音补偿单元811、3D渲染单元815以及多声道解码器816。 [0199] Referring to FIG 12, the decoding unit 810 includes a down-mix compensation unit 815 and a rendering unit 811,3D multichannel decoder 816.

[0200] 缩减混音补偿单元811利用包括在空间信息中的补偿信息——例如利用缩减混音增益或扩展信息来补偿任意缩减混音信号。 [0200] The down-mix compensation unit 811 includes a compensation information using the spatial information - for example, using down-mix gain or a mix signal to compensate for the arbitrary down extension information.

[0201] 3D渲染单元815通过对经补偿的缩减混音信号执行3D渲染操作来生成解码器3D 缩减混音信号。 [0201] The 3D rendering unit 815 down-mix signal by performing a 3D rendering operation on the decoder generates a 3D down-mix signal compensated. 多声道解码器816利用经补偿的缩减混音信号和包括在空间信息中的基本空间信息来生成3D多声道信号。 Multi-channel decoder 816 by using the compensated down-mix signal and basic spatial including information in the spatial information to generate a 3D multichannel signal.

[0202] 缩减混音补偿单元811可按以下方式补偿任意缩减混音信号。 [0202] compensate for any down-mix compensation unit 811 may down-mix signal in the following manner.

[0203] 如果补偿信息是缩减混音增益,则缩减混音补偿单元811利用缩减混音增益补偿任意缩减混音信号的能级,使得任意缩减混音信号可被转换成类似于缩减混音信号的信号。 [0203] If the compensation information is down-mix gain, the down-mix compensation unit 811 down-mix gain using the compensated arbitrary down-mix signal level, so that the arbitrary down-mix signal can be converted into a signal similar to the down-mix signal of.

[0204] 如果补偿信息是第二扩展信息,则缩减混音补偿单元811可利用第二扩展信息补偿任意缩减混音信息所缺少的分量。 [0204] If the compensation information is second extension information, the down-mix compensation unit 811 using the second extension information may be compensated arbitrary down-mix information is missing a component.

[0205] 多声道解码器816可通过将前置矩阵Ml、混音矩阵M2和后置矩阵M3顺序地施加于缩减混音矩阵信号来生成多声道信号。 [0205] Multi-channel decoder 816 by the pre-matrix Ml, mix matrix M2 and M3 rear sequentially applied to the matrix to generate a matrix of a multichannel signal down-mix signal. 在这种情况下,第二扩展信息可用于在将混音矩阵M2施加于缩减混音信号期间补偿缩减混音信号。 In this case, the second extension information may be used in the mix matrix M2 is applied to the down-mix signal compensated during the down-mix signal. 换言之,第二扩展信息可用于补偿已经施加有前置矩阵Ml的缩减混音信号。 In other words, the second extension information may be used to down-mix signal with a pre-matrix Ml compensation has been applied.

[0206] 如上所述,可通过将扩展信息应用于生成多声道信号来选择性地补偿多个声道中的每一个。 [0206] As described above, by generating the extension information to the multichannel signal to selectively compensate each of the plurality of channels. 例如,如果将扩展信息施加于混音矩阵M2的中心声道,则可由扩展信息补偿缩减混音信号的左声道和右声道分量。 For example, if the extension information is applied to the center channel of mix-matrix M2, the left and right channel components of the mix signal may be compensated by a reduced extension information. 如果将扩展信息施加于混音矩阵M2的左声道,则可由扩展信息补偿缩减混音信号的左声道分量。 If the extension information is applied to the left channel of mix-matrix M2, the left-channel component can mix signal compensated by the extension information reduction.

[0207] 缩减混音增益和扩展信息两者都可用作补偿信息。 [0207] both the down-mix gain and the extension information may be used as compensation information. 例如,可利用扩展信息补偿任意缩减混音信号的低频带,且可利用缩减混音增益补偿任意缩减混音信号的高频带。 For example, the extension information may be utilized to compensate a low frequency band of the arbitrary down-mix signal, and the down-mix gain may be compensated by using a high frequency band of the arbitrary down-mix signal. 此外, 还可利用扩展信息补偿除任意缩减混音信号的低频带以外的、诸如可显著影响音质的峰值或凹口之类的部分任意缩减混音信号。 Furthermore, the extension information may also be utilized in addition to the compensation of the arbitrary down-mix signal other than a low frequency band, such as a significant portion can be any peaks or notches affect the sound quality of such down-mix signal. 与将由扩展信息补偿的部分有关的信息可被包括在比特流中。 Information relating to the extended portion by the compensation information may be included in a bitstream. 指示包括在比特流中的缩减混音信号是否是任意缩减混音信号的信息以及指示比特流是否包括补偿信息的信息可被包括在比特流中。 Indicated in a bit stream comprising a reduced-mix signal whether the arbitrary down-mix signal, and information indicating whether the bitstream includes compensation information may be included in a bitstream.

[0208] 为了防止由编码单元800生成的缩减混音信号被削波,可将缩减混音信号除以预定增益。 [0208] In order to prevent clipping of a down-mix signal generated by the encoding unit 800, the down-mix signal may be divided by predetermined gain. 预定增益可具有静态值或动态值。 Predetermined gain may have a static value or a dynamic value.

[0209] 缩减混音补偿单元811可通过利用预定增益补偿为防止削波而削弱的缩减混音信号来恢复原始缩减混音信号。 [0209] The down-mix compensation unit 811 may compensate by using a predetermined gain to prevent clipping impaired down-mix signal to restore the original down-mix signal.

[0210] 可容易地再现由缩减混音补偿单元811补偿的任意缩减混音信号。 [0210] can be easily reproduced by the arbitrary down-mix compensation unit 811 down-mix signal compensated. 或者,还待补偿的任意缩减混音信号可被输入至3D渲染单元815,并可由3D渲染单元815转换成解码器3D缩减混音信号。 Alternatively, also be compensated arbitrary down-mix signal may be input to the 3D rendering unit 815, and converted by the 3D rendering unit 815 into a decoder 3D down-mix signal.

[0211] 参考图12,缩减混音补偿单元811包括第一域转换器812、补偿处理器813和第二域转换器814。 [0211] Referring to FIG 12, the down-mix compensation unit 811 includes a first domain converter 812, a compensation processor 813 and the second domain converter 814.

[0212] 第一域转换器812将任意缩减混音信号的域转换成预定域。 [0212] The first domain converter 812 arbitrary down-mix signal into a predetermined domain domain. 补偿处理器813利用补偿信息——例如,缩减混音增益或扩展信息——来补偿预定域中的任意缩减混音信号。 Compensation processor 813 using the compensation information - for example, down-mix gain or extension information - to compensate for any predetermined domain down-mix signal.

[0213] 任意缩减混音信号的补偿可在QMF/混合域中进行。 [0213] the arbitrary down-mix signal may be compensated in QMF / hybrid domain. 为此,第一域转换器812可对任意缩减混音信号执行QMF/混合分析。 To this end, a first domain converter 812 may perform mix signal QMF / hybrid analysis on the arbitrary down. 第一域转换器812可将任意缩减混音信号的域转换成除QMF/混合域以外的域,例如,诸如DFT或FFT域之类的频域。 The first domain converter 812 may be any domain down-mix signal is converted into a domain other than QMF / hybrid domain, for example, as a frequency domain DFT or FFT domain or the like. 任意缩减混音信号的补偿也可在除QMF/混合域以外的域中进行,例如,频域或时域。 Compensated arbitrary down-mix signal can also be performed in a domain other than QMF / hybrid domain, for example, a frequency or time domain. [0214] 第二域转换器814将经补偿的任意缩减混音信号的域转换成与原始任意缩减混音信号相同的域。 [0214] The second domain converter 814 down-mix signal compensated arbitrary domain as the original arbitrary down-converted to the same field-mix signal. 更具体地,第二域转换器814通过反向地执行由第一域转换器812所执行的域转换操作来将经补偿的任意缩减混音信号的域转换成与原始任意缩减混音信号相同的域。 More specifically, the second domain converter 814 converts the domain operated by a first domain converter 812 to perform the down-mix signal by inversely performing a domain compensated arbitrary converted into the original arbitrary down-mix signal is identical domain.

[0215] 例如,第二域转换器814可通过对经补偿的任意缩减混音信号执行QMF/混合合成将经补偿的任意缩减混音信号转换成时域信号。 [0215] For example, the second domain converter 814 may perform QMF down-mix signal by any of the compensated / hybrid synthesis converts the down-mix signal into a time domain signal compensated arbitrary. 同样,第二域转换器814可对经补偿的任意缩减混音信号执行IDFT或IFFT。 Similarly, the second domain converter 814 may perform IDFT or down-mix signal IFFT on the compensated arbitrary.

[0216] 类似于图7所示的3D渲染单元710,3D渲染单元815可对在频域、QMF/混合域或时域中的、经补偿的任意缩减混音信号执行3D渲染操作。 [0216] similar to that shown in FIG. 7 710,3D 3D rendering unit rendering unit 815 may perform a 3D rendering operation on the arbitrary down-mix signal in the frequency domain, the time domain or QMF / hybrid domain, compensated. 为此,该3D渲染单元815可包括域转换器(未示出)。 For this, the 3D rendering unit 815 may include a domain converter (not shown). 域转换器将经补偿的任意缩减混音信号的域转换成将要执行3D渲染操作的域,或转换通过3D渲染操作获取的信号的域。 Domain converter of the down-mix signal compensated arbitrary domain to be converted to perform the 3D rendering domain, or a domain conversion operation by the 3D rendering operation on the acquired signal.

[0217] 其中补偿处理器813补偿任意缩减混音信号的域可与其中3D渲染单元815对经补偿的任意缩减混音信号执行3D渲染操作的域相同或不同。 [0217] wherein the compensation processor 813 compensated arbitrary down-mix signal may be a domain in which the 3D rendering unit 815 the compensated arbitrary down-mix signal perform the same or different 3D rendering operation domain.

[0218] 图13是根据本发明的实施例的缩减混音补偿/3D渲染单元820的框图。 [0218] FIG. 13 is a block diagram of the 820 unit / 3D rendering down-mix compensation according to an embodiment of the present invention. 参考图13,缩减混音补偿/3D渲染单元820包括第一域转换器821、第二域转换器822、补偿/3D渲染处理器823以及第三域转换器824。 13, the down-mix compensation / 3D rendering unit 820 includes a first domain converter 821, a second domain converter 822, a compensation / 3D rendering processor 823, and a third domain converter 824.

[0219] 缩减混音补偿/3D渲染单元820可在单个域中对任意缩减混音信号执行补偿操作和3D渲染操作,从而减少解码装置的计算量。 [0219] The down-mix compensation / 3D rendering unit 820 may perform an arbitrary down-mix signal compensation operation and a 3D rendering operation in a single domain, thereby reducing the amount of computation of the decoding apparatus.

[0220] 更具体地,第一域转换器821将任意缩减混音信号的域转换成其中将执行补偿操作和3D渲染操作的第一域。 [0220] More specifically, the first domain converter 821 converts the arbitrary down-mix signal into a domain in which the first execution domain compensation operation and a 3D rendering operation. 第二域转换器822转换空间信息,其包括生成多声道信号所必需的基本空间信息和补偿任意缩减混音信号所必需的补偿信息,使得空间信息变得可适用于第一域。 The second domain converter 822 converts spatial information, including basic spatial information and the compensation generating any signals necessary for the multi-channel down-mix signal compensation information required, so that the spatial domain information becomes applicable to the first. 补偿信息可包括缩减混音增益和扩展信息中的至少一个。 Compensation information may comprise down-mix gain and the extension information at least one.

[0221] 例如,第二域转换器822可将QMF/混合域中对应于参数频带的补偿信息映射到频带,使得补偿信息可变得容易地适用于频域。 [0221] For example, the second domain converter 822 may QMF / hybrid domain, a parameter band corresponding to the compensation information mapped to the frequency band, so that the compensation information can become readily applicable in a frequency domain.

[0222] 第一域可以是诸如DFT或FFT之类的频域、QMF/混合域或时域。 [0222] The first domain may be a frequency domain such as a DFT or FFT or the like, QMF / hybrid domain or a time domain. 或者,第一域可以是除本文中陈述的域以外的域。 Alternatively, the first domain may be a domain other than the domain set forth herein.

[0223] 在补偿信息的转换期间,可发生时延。 [0223] During the conversion of the compensation information, the delay may occur. 为了解决该问题,第二域转换器822可执行时延补偿操作,使得补偿信息的域和第一域之间的时延能够被补偿。 To solve this problem, the second domain converter 822 may perform time delay compensation operation so as to compensate the time delay between the domain information and the first domain can be compensated.

[0224] 补偿/3D渲染处理器823利用经转换的空间信息对第一域中的任意缩减混音信号执行补偿操作,然后对通过补偿操作获取的信号执行3D渲染操作。 [0224] compensation / 3D rendering processor 823 by any of the first domain by the spatial information converting the down-mix signal compensation operation performed, then the operation signal is acquired by compensating for performing a 3D rendering operation. 补偿/3D渲染处理器823可按与本文陈述的不同的顺序执行补偿操作和3D渲染操作。 Compensation operation and a 3D rendering operation on the compensation / 3D rendering processor 823 may be performed in a different order set forth herein.

[0225] 补偿/3D渲染处理器823可同时对任意缩减混音信号执行补偿操作和3D渲染操作。 [0225] compensation / 3D rendering processor 823 may simultaneously perform arbitrary down-mix signal compensation operation and a 3D rendering operation. 例如,补偿/3D渲染处理器823可通过使用新滤波器系数对第一域中的任意缩减混音信号执行3D渲染操作来生成经补偿的3D缩减混音信号,该新滤波器系数是补偿信息和通常在3D渲染操作中使用的现有滤波器系数的组合。 For example, the compensation / 3D rendering processor 823 may be reduced by using the new filter coefficient on any of the first domain-mix signal by performing a 3D rendering operation 3D to generate a compensated down-mix signal, the compensation information of the new filter coefficients are and combining existing filter coefficient typically used in a 3D rendering operation.

[0226] 第三域转换器拟4将补偿/3D渲染处理器823所生成的3D缩减混音信号的域转换成频域。 [0226] The third domain converter 4 Quasi compensation / 3D rendering processor 823 generates a 3D down-mix signal is converted domain into the frequency domain.

[0227] 图14是根据本发明的实施例用于处理兼容缩减混音信号的解码装置900的框图。 [0227] FIG. 14 is an embodiment of the present invention for processing a block diagram of a down-mix signal decoding apparatus 900 compatible. 参考图14,解码装置900包括第一多声道解码器910、缩减混音兼容性处理单元920、第二多声道解码器930和3D渲染单元940。 Referring to Figure 14, the decoding apparatus 900 includes a first multichannel decoder 910, a down-mix compatibility processing unit 920, a second multi-channel decoder 930 and a 3D rendering unit 940. 将省略与图1的实施例相同的解码过程的详细描述。 Detailed description of the same decoding process according to the embodiment of FIG. 1 will be omitted.

[0228] 兼容缩减混音信号是可由两个或更多多声道解码器解码的缩减混音信号。 [0228] compatible down-mix signal may be two or more multi-channel decoder for decoding a down-mix signal. 换言之,兼容缩减混音信号是最初针对预定多声道解码器优化、然后能够通过兼容性处理操作转换成针对除该预定多声道解码器以外的多声道解码器优化的信号的缩减混音信号。 In other words, a compatible down-mix signal optimized for a first predetermined multi-channel decoder, and then down-mix can be converted into a signal of other than the predetermined multi-channel decoder multichannel decoder optimized by the compatibility processing operation signal.

[0229] 参考图14,假设输入的兼容缩减混音信号针对第一多声道解码器910优化。 [0229] Referring to Figure 14, assume that the input compatible down-mix signal 910 is optimized for a first multi-channel decoder. 为了使第二多声道解码器930解码输入的兼容缩减混音信号,缩减混音兼容性处理单元920可对输入的兼容缩减混音信号执行兼容性处理操作,使得输入的兼容缩减混音信号可被转换成针对第二多声道解码器930优化的信号。 Order for the second multichannel decoder 930 decodes the input compatible down-mix signal, the down-mix compatibility processing unit 920 may be performed on the compatible down-mix compatibility processing operation input signal, so that the input compatible down-mix signal It may be converted into a signal for the second multichannel decoder 930 to optimize. 第一多声道解码器910通过解码输入的兼容缩减混音信号来生成第一多声道信号。 The first multichannel decoder 910 decoding the compatible down-mix signal by generating a first input of a multichannel signal. 第一多声道解码器910可通过仅仅使用输入的兼容缩减混音信号而不需要空间信息进行解码来生成多声道信号。 The first multichannel decoder 910 may decode information without spatial multichannel signal is generated by simply using the input compatible down-mix signal.

[0230] 第二多声道解码器930利用由缩减混音兼容性处理单元920执行的兼容性处理操作获取的缩减混音信号生成第二多声道信号。 [0230] The second multichannel decoder 930 generates a second multi-channel signal using the down-mix compatibility processing operation signal by the down-mix compatibility processing unit 920 acquired. 3D渲染单元940可通过对由缩减混音兼容性处理单元920执行的兼容性处理操作获取的缩减混音信号执行3D渲染操作来生成解码器3D缩减混音信号。 3D rendering unit 940 may down-mix signal by performing a 3D rendering operation on the decoder generates a 3D down-mix compatibility processing operation on the signal by the down-mix compatibility processing unit 920 acquired.

[0231] 可利用诸如逆矩阵之类的兼容性信息,将针对预定多声道解码器优化的兼容缩减混音信号转换成针对除预定多声道解码器以外的多声道解码器优化的缩减混音信号。 [0231] may utilize compatibility information such as an inverse matrix or the like, will be optimized for a predetermined multi-channel decoder compatible down-mix signal converted into a multichannel decoder for multichannel decoder other than a predetermined optimization reduction mix signal. 例如当存在利用不同编码方法的第一和第二多声道编码器与利用不同编码/解码方法的第一和第二多声道解码器时,编码装置可将矩阵施加于第一多声道编码器生成的缩减混音信号,从而生成针对第二多声道解码器优化的兼容缩减混音信号。 For example when using a different encoding method of the first and second multichannel encoders and first and second multichannel decoders using different encoding / decoding method, the encoding apparatus may be applied to the first multi-channel matrix encoder generates a down-mix signal, thereby generating a compatible down-mix signal for the second multichannel decoder optimization. 然后,解码装置可将逆矩阵施加于由编码装置所生成的兼容缩减混音信号,从而生成针对第一多声道解码器优化的兼容缩减混音信号。 Then, the decoding apparatus may be applied to an inverse matrix of a compatible down-mix signal generated by the encoding apparatus, thereby generating the down-mix signal for the first multichannel decoder compatible optimization.

[0232] 参考图14,缩减混音兼容性处理单元920可利用逆矩阵对输入的兼容缩减混音信号执行兼容性处理操作,从而生成针对第二多声道解码器930优化的缩减混音信号。 [0232] Referring to FIG 14, the down-mix compatibility processing unit 920 may utilize the inverse matrix of the input compatible down-mix signal perform a compatibility processing operation, thereby generating the down-mix signal for the second multichannel decoder 930 Optimization .

[0233] 与缩减混音兼容性处理单元920所使用的逆矩阵有关的信息可提前存储在解码装置900中,或可被包括在编码装置传送的比特流中。 [0233] and the down-mix compatibility processing unit 920 using information relating to the inverse matrix can be stored in the decoding apparatus 900 in advance, or may be included in the bit stream transmitted from the encoding apparatus. 此外,指示包括在输入比特流中的缩减混音信号是任意缩减混音信号还是兼容缩减混音信号的信息可被包括在输入比特流中。 In addition, the down-mix signal comprising an indication input bitstream is an arbitrary down-mix signal or a compatible down-mix signal may be included in the input bitstream.

[0234] 参考图14,缩减混音兼容性处理单元920包括第一域转换器921、兼容性处理器922和第二域转换器923。 [0234] Referring to FIG 14, the down-mix compatibility processing unit 920 includes a first domain converter 921, a compatibility processor 922 and a second domain converter 923.

[0235] 第一域转换器921将输入的兼容缩减混音信号的域转换成预定域,且兼容性处理器922利用诸如逆矩阵之类的兼容性信息执行兼容性处理操作,使得在预定域中的输入兼容缩减混音信号可被转换成针对第二多声道解码器930优化的信号。 [0235] domain of the first domain converter 921 to the input compatible down-mix signal into a predetermined domain, and the compatibility processor 922 performs a compatibility processing operation using compatibility information such as an inverse matrix or the like, such that the predetermined domain the input compatible down signal for the second multichannel decoder 930 to optimize the mix signal can be converted into.

[0236] 兼容性处理器922可在QMF/混合域中执行兼容性处理操作。 [0236] compatibility processor 922 may perform a compatibility processing operation in a QMF / hybrid domain. 为此,第一域转换器921可对输入的兼容缩减混音信号执行QMF/混合分析。 To this end, a first domain converter 921 may mix the input signal compatible perform QMF / hybrid analysis reduced. 同样,第一域转换器921可将输入的兼容缩减混音信号的域转换成除QMF/混合域以外的域,例如,诸如DFT或FFT域之类的频域,且兼容性处理器922可在除QMF/混合域以外的域——如频域或时域中执行兼容性处理操作。 Also, the first domain converter 921 may be input compatible down-mix signal is converted into a domain other than QMF / hybrid domain domain, e.g., a frequency domain such as a DFT or FFT domain or the like, and the compatibility processor 922 may in a domain other than QMF / hybrid domain - such as a compatibility processing operation domain frequency domain or time.

[0237] 第二域转换器923转换由兼容性处理操作获取的兼容缩减混音信号的域。 [0237] The second domain converter 923 converts the domain obtained by the compatibility processing operation compatible down-mix signal. 更具体地,第二域转换器923可通过反向地执行由第一域转换器921所执行的域转换操作来将通过兼容性处理操作获取的兼容性缩减混音信号的域转换成与原始输入兼容缩减混音信号相同的域。 More specifically, the second domain converter 923 may be performed by reversing the reduction operation of the first domain converter compatibility domain converter 921 to perform acquired by the compatibility processing operation-mix signal is converted into the original domain the same input compatible down-mix signal domain.

[0238] 例如,第二域转换器923可通过对由兼容性处理操作获取的兼容缩减混音信号执行QMF/混合域合成来将由兼容性处理操作获取的兼容缩减混音信号转换成时域信号。 [0238] For example, the second domain converter 923 may be obtained by the compatibility by the compatibility processing operation performed down-mix signal QMF / hybrid domain by the synthesis processing operation for compatibility converted compatible down-mix signal into a time domain signal . 或者,第二域转换器923可对由兼容处理操作获取的兼容缩减混音信号执行IDFT或IFFT。 Alternatively, the second domain converter 923 may be compatible with the processing operation by the compatible down-mix signal obtained by performing IDFT or IFFT.

[0239] 3D渲染单元940可对在频域、QMF/混合域或时域中的、由兼容性处理操作获取的兼容缩减混音信号执行3D渲染操作。 [0239] The 3D rendering unit 940 may perform a 3D rendering operation-mix signal for reduction in the frequency or time domain, QMF / hybrid domain, compatible, acquired by the compatibility processing operation. 为此,该3D渲染单元940可包括域转换器(未示出)。 For this, the 3D rendering unit 940 may include a domain converter (not shown). 域转换器将输入的兼容缩减混音信号的域转换成其中将执行3D渲染操作的域,或转换由3D渲染操作获取的信号的域。 Domain converter input compatible down-mix signal is converted into a domain wherein the domain to the domain, or converting a 3D rendering operation acquired by the 3D rendering operation signal.

[0240] 其中兼容性处理器922执行兼容性处理操作的域可与其中3D渲染单元940执行3D渲染操作的域相同或不同。 [0240] where the domain compatibility processor 922 may perform a compatibility processing operation in which the 3D rendering unit 940 performs a 3D rendering operation is the same as or different domains.

[0241] 图15是根据本发明的实施例的缩减混音兼容性处理/3D渲染单元950的框图。 [0241] FIG. 15 is a block / 3D rendering unit 950 down-mix compatibility processing according to an embodiment of the present invention. 参考图15,缩减混音兼容性处理/3D渲染单元950包括第一域转换器951、第二域转换器952、 兼容性/3D渲染处理器953以及第三域转换器954。 15, the down-mix compatibility processing / 3D rendering unit 950 951, the second domain converter 952, the compatibility / 3D rendering processor 953, and a third domain converter 954 includes a first domain converter.

[0242] 缩减混音兼容性处理/3D渲染单元950在单个域中执行兼容性处理操作和3D渲染操作,从而减少解码装置的计算量。 [0242] The down-mix compatibility processing / 3D rendering unit 950 performs a compatibility processing operation and a 3D rendering operation in a single domain, thereby reducing the amount of computation of the decoding apparatus.

[0243] 第一域转换器951将输入的兼容缩减混音信号转换至其中将执行兼容性处理操作和3D渲染操作的第一域。 [0243] The first domain converter 951 to the input compatible down-mix signal into which a compatibility processing operation and a 3D rendering operation in a first domain. 第二域转换器952转换空间信息和兼容性信息,例如逆矩阵, 使得空间信息和兼容性信息能变得适用于第一域。 The second domain converter 952 converts spatial information and compatibility information, for example, the inverse matrix, such that the spatial information and the compatibility information can become applicable in the first domain.

[0244] 例如,第二域转换器952可将QMF/混合域中对应于参数频带的逆矩阵映射到频域,使得逆矩阵可容易地适用于频域。 [0244] For example, the second domain converter 952 may QMF / hybrid domain, a parameter band corresponding to the inverse matrix mapped to the frequency domain, so the inverse matrix can be easily adapted to the frequency domain.

[0245] 第一域可以是诸如DFT或FFT域之类的频域、QMF/混合域或时域。 [0245] The first domain may be a frequency domain such as a DFT or FFT domain like, QMF / hybrid domain or a time domain. 或者,第一域可以是除本文中陈述的域以外的域。 Alternatively, the first domain may be a domain other than the domain set forth herein.

[0246] 在空间信息和兼容性信息的转换期间,可发生时间延迟。 [0246] During the conversion of the spatial information and compatibility information, a time delay may occur.

[0247] 为了解决该问题,第二域转换器952可执行时延补偿操作,使得空间信息和补偿信息的域与第一域之间的时延能够被补偿。 [0247] In order to solve this problem, the second domain converter 952 may perform time delay compensation operation so that the time delay between the domain and the first domain compensation information and the spatial information can be compensated.

[0248] 兼容性/3D渲染处理器953利用经转换的兼容性信息对第一域中的输入兼容缩减混音信号执行兼容性处理操作,然后对通过兼容性处理操作获取的兼容缩减混音信号执行3D渲染操作。 [0248] compatibility / 3D rendering processor 953 by using the converted compatibility information to the first domain of the input compatible down-mix signal perform a compatibility processing operation and processing operations of the compatibility by the compatibility down-mix signal obtained perform a 3D rendering operation. 兼容性/3D渲染处理器953可按与本文陈述的不同的顺序执行兼容性处理操作和3D渲染操作。 Compatibility / 3D rendering processor 953 may be a different order set forth herein performs a compatibility processing operation and a 3D rendering operation.

[0249] 兼容性/3D渲染处理器953可同时对输入的兼容缩减混音信号执行兼容性处理操作和3D渲染操作。 [0249] compatibility / 3D rendering processor 953 may simultaneously input compatible down-mix compatibility processing operation and a signal performs a 3D rendering operation. 例如,兼容性/3D渲染处理器953可通过使用新滤波器系数对第一域中的输入兼容缩减混音信号执行3D渲染操作来生成3D缩减混音信号,该新滤波器系数是兼容性信息和通常在3D渲染操作中使用的现有滤波器系数的组合。 For example, the compatibility / 3D rendering processor 953 may be compatible down-mix signal by performing a 3D rendering operation to generate a 3D down-mix signal input to the first domain by using the new filter coefficients, the filter coefficients are new compatibility information and combining existing filter coefficient typically used in a 3D rendering operation.

[0250] 第三域转换器卯4将兼容性/3D渲染处理器953所生成的3D缩减混音信号的域转换成频域。 Domain 3D down-mix signal [0250] The third domain converter 4 d compatibility / 3D rendering processor 953 into a frequency domain generated.

[0251] 图16是根据本发明的实施例用于消除串扰的解码装置的框图。 [0251] FIG. 16 is a block diagram of a decoding apparatus according to the embodiment of the crosstalk elimination of the present invention. 参考图16,解码装置包括比特拆分单元960、缩减混音解码器970、3D渲染单元980以及串扰消除单元990。 16, with reference to FIG decoding apparatus includes a bit unpacking unit 960, a down-mix decoder 980 and a rendering unit 970,3D crosstalk cancellation unit 990. 将省略与图1的实施例相同的解码过程的详细描述。 Detailed description of the same decoding process according to the embodiment of FIG. 1 will be omitted.

[0252] 由3D渲染单元980输出的3D缩减混音信号可由耳机再现。 [0252] 3D 3D down-mix signal generated by the rendering unit 980 may output the headphone reproduction. 然而,当3D缩减混音信号由远离用户的扬声器再现时,声道间串扰很可能发生。 However, when the 3D down-mix signal is reproduced by the speaker away from the user, inter-channel crosstalk is likely to occur.

[0253] 因此,解码装置可包括对3D缩减混音信号执行串扰消除操作的串扰消除单元990。 [0253] Thus, the decoding apparatus may include a 3D down-mix signal crosstalk to perform operations to eliminate crosstalk cancellation unit 990.

[0254] 解码装置可执行声场处理操作。 [0254] decoding apparatus may perform a sound field processing operation.

[0255] 声场处理操作中使用的声场信息,即,标识其中将再现3D缩减混音信号的空间的信息,可被包括在由编码装置传送的输入比特流中,或可由解码装置选择。 [0255] sound field processing the sound field information used in the operation, i.e., wherein the identification information reproduced 3D spatial down-mix signal may be included in an input bitstream transmitted by the encoding device, the device may be selected or decoding.

[0256] 输入比特流可包括混响时间信息。 [0256] the input bitstream may include reverberation time information. 可根据混响时间信息控制在声场处理操作中使用的滤波器。 Reverberation time can filter information for controlling a sound field processing operation in accordance with the use.

[0257] 对于早前部分和后面的混响部分可有差别地执行声场处理操作。 [0257] There may be different for the earlier and later portions of the reverberation sound field processing portion performs the operation. 例如,早前部分可利用HR滤波器来处理,而后面的混响部分可利用UR滤波器处理。 For example, the earlier part may be processed using HR filter, and later reverberations part may utilize UR filter process.

[0258] 更具体地,可通过使用HR滤波器在时域中执行卷积操作或通过在时域中执行乘法操作、并将乘法操作的结果转换至时域来对早前部分执行声场处理操作。 [0258] More specifically, by using HR filter or the time domain by performing a convolution operation in the time domain multiplication, and convert the multiplication result of the operation to the time domain to the earlier operation section performs sound field processing . 声场处理操作可在时域中对后面的混响部分执行。 Sound field processing operation may be performed on the back of the reverberations part in the time domain.

[0259] 可将本发明实现为写在计算机可读记录介质上的计算机可读代码。 [0259] The present invention may be implemented as a write-readable code on a computer-readable recording medium on a computer. 计算机可读记录介质可以是其中数据以计算机可读方式存储的任何类型的记录设备。 The computer-readable recording medium may be any type of recording device in which data is stored in a computer-readable manner. 计算机可读记录介质的例子包括R0M、RAM、⑶-ROM、磁带、软盘、光数据存储器、载波(例如,通过因特网的数据传输)。 Examples of the computer readable recording medium include R0M, RAM, ⑶-ROM, magnetic tapes, floppy disks, optical data storage, carrier wave (e.g., data transmission through the Internet). 可将计算机可读记录介质分布在连接至网络的多个计算机系统上,使得计算机可读代码能以分散的方式向其写入或从其执行。 The computer-readable recording medium can also be distributed over a network connected to a plurality of computer systems so that the computer readable code can be written in a decentralized manner or from its execution. 实现本发明所需的功能程序、代码和代码段可由本领域的普通技术人员容易地解释。 Functional programs required by the present invention, codes, and code segments by one of ordinary skill in the art readily explained.

[0260] 如上所述,根据本发明,高效率地编码具有3D效果的多声道信号,并根据再现环境的特性以最优音质自适应地恢复和再现音频信号是可能的。 Multi-channel signal [0260] As described above, according to the present invention, high efficiency encoding with 3D effects and to adaptively restore and reproduce audio signals to the best possible sound quality according to the characteristics of a reproduction environment.

[0261] 工业实用性 [0261] Industrial Applicability

[0262] 其他实现在以下权利要求的范围内。 [0262] In other implementations are within the scope of the following claims. 例如,根据本发明的编组、数据译码和熵译码可被应用于各种应用领域和各种产品。 For example, according to the present invention, grouping, data coding and entropy coding it can be applied to various application fields and various products. 应用本发明的一个方面的存储数据的存储介质在本发明的范围内。 A storage medium for storing data aspect of the present invention is applied within the scope of the present invention.

Claims (14)

1. 一种解码音频信号的解码方法,所述解码方法包括:从输入比特流中提取第一三维3D缩减混音信号;通过执行从所述第一3D缩减混音信号去除3D效果的第一3D渲染操作来生成3D效果去除的缩减混音信号;以及通过对所述3D效果去除的缩减混音信号执行第二3D渲染操作来生成具有3D效果的第二3D缩减混音信号。 1. A decoding method for decoding an audio signal, the decoding method comprising: extracting a first three-dimensional 3D down-mix signal from an input bitstream; down-mix signal by performing a first 3D removal from the first 3D effect 3D rendering operation on the down-mix signal generating 3D effects removed; and generating a second 3D down-mix signal with 3D effects by performing a 3D rendering operation a second down-mix signal to the 3D effects removed.
2.如权利要求1所述的方法,其特征在于,所述第二 3D缩减混音信号的生成通过利用存储在解码装置中的滤波器信息来执行。 2. The method according to claim 1, wherein said second generating 3D down-mix signal in the filter information decoding device by using a storage performed.
3.如权利要求2所述的方法,其特征在于,所述滤波器信息包括关于报头相关传递函数HRTF的系数的信息。 The method according to claim 2, wherein the filter information includes information about a header related transfer function HRTF coefficients.
4.如权利要求1所述的方法,其特征在于,所述第二 3D缩减混音信号的生成通过利用特性与用于生成所述第一3D缩减混音信号的滤波器的特性不同的滤波器来执行。 4. The method according to claim 1, wherein said second generating 3D down-mix signal different from said first generating a 3D down-mix signal of the filter characteristic of the filter characteristics by using a is performed.
5.如权利要求1所述的方法,其特征在于,还包括从多个滤波器中确定一个滤波器用于生成所述第二3D缩减混音信号。 5. The method according to claim 1, wherein further comprising determining a plurality of filters from a second filter for generating the 3D down-mix signal.
6.如权利要求5所述的方法,其特征在于,用于生成所述第二 3D缩减混音信号的滤波器是基于以下至少之一来确定的:由用户做出的选择、所述解码装置的性能、再现环境的特性,及所需音质。 6. The method according to claim 5, characterized in that, for generating a 3D down-mix signal and the second filter is determined based on at least one of the following to: the choice made by the user, the decoding performance of the apparatus, the reproduction property of the environment, and the desired sound.
7.如权利要求1所述的方法,其特征在于,所述3D效果去除的缩减混音信号的生成通过利用用于生成所述第一3D缩减混音信号的滤波器的逆滤波器来执行。 7. The method of performing claimed in claim 1, wherein the down-mix signal generated by using 3D effects removed for generating the first inverse filter 3D down-mix signal of the filter .
8. 一种用于解码音频信号的装置,包括:比特拆分单元,从输入比特流中提取第一3D缩减混音信号;第一3D渲染单元,通过执行从所述第一3D缩减混音信号去除3D效果的第一3D渲染操作来生成3D效果去除的缩减混音信号;以及第二3D渲染单元,通过对所述3D效果去除的缩减混音信号执行第二3D渲染操作来生成具有3D效果的第二3D缩减混音信号。 8. An apparatus for decoding an audio signal, comprising: a bit unpacking unit which extracts a first input bitstream from the 3D down-mix signal; a first 3D rendering unit, from the first down-mix by performing 3D a first signal removing 3D effects generating a 3D rendering operation on the effect of removing 3D down-mix signal; and a second 3D rendering unit performs a 3D rendering operation of the second down-mix signal by the 3D effect is removed to generate a 3D the second effect of the 3D down-mix signal.
9.如权利要求8所述的装置,其特征在于,还包括存储单元,所述存储单元存储与用以生成3D第二缩减混音信号的滤波器有关的滤波器信息,其中所述第二3D渲染单元使用所述滤波器信息生成所述第二3D缩减混音信号。 9. The apparatus according to claim 8, characterized in that, further comprising a storage unit, said storage unit stores a second filter to generate filter information on the 3D down-mix signal, wherein said second using the 3D rendering unit of the second filter information generating 3D down-mix signal.
10.如权利要求9所述的装置,其特征在于,所述滤波器信息包括关于报头相关传递函数HRTF的系数的信息。 10. The apparatus according to claim 9, wherein the filter information includes information about a header related transfer function HRTF coefficients.
11.如权利要求8所述的装置,其特征在于,所述第二 3D渲染单元通过使用特性与用于生成所述第一3D缩减混音信号的滤波器的特性不同的滤波器来生成所述第二3D缩减混音信号。 11. The generating apparatus as claimed in claim 8, wherein the second 3D rendering unit by using different characteristics to generate the first 3D down-mix signal of the filter characteristic of the filter said second 3D down-mix signal.
12.如权利要求8所述的装置,其特征在于,所述第二 3D渲染单元从多个滤波器中确定一个滤波器用于生成所述第二3D缩减混音信号。 12. The apparatus according to claim 8, wherein the second 3D rendering unit determines a plurality of filters from a second filter for generating the 3D down-mix signal.
13.如权利要求12所述的装置,其特征在于,用于生成所述第二 3D缩减混音信号的滤波器是基于以下至少之一来确定的:由用户做出的选择、所述解码装置的性能、再现环境的特性,及所需音质。 13. The apparatus of claim 12, wherein, for generating a 3D down-mix signal and the second filter is determined based on at least one of: a selection made by a user, the decoding performance of the apparatus, the reproduction property of the environment, and the desired sound.
14.如权利要求8所述的装置,其特征在于,所述第一3D渲染单元通过利用用于生成所述第一3D缩减混音信号的滤波器的逆滤波器来生成所述3D效果去除的缩减混音信号。 14. The apparatus according to claim 8, wherein said first 3D rendering unit generates the 3D effect is removed by using an inverse filter for generating the first 3D down-mix signal filter down-mix signal.
CN 200780004527 2006-02-07 2007-02-07 Apparatus and method for encoding/decoding signal CN101385077B (en)

Priority Applications (17)

Application Number Priority Date Filing Date Title
US76574706P true 2006-02-07 2006-02-07
US60/765,747 2006-02-07
US77147106P true 2006-02-09 2006-02-09
US60/771,471 2006-02-09
US77333706P true 2006-02-15 2006-02-15
US60/773,337 2006-02-15
US77577506P true 2006-02-23 2006-02-23
US60/775,775 2006-02-23
US78175006P true 2006-03-14 2006-03-14
US60/781,750 2006-03-14
US78251906P true 2006-03-16 2006-03-16
US60/782,519 2006-03-16
US79232906P true 2006-04-17 2006-04-17
US60/792,329 2006-04-17
US79365306P true 2006-04-21 2006-04-21
US60/793,653 2006-04-21
PCT/KR2007/000672 WO2007091845A1 (en) 2006-02-07 2007-02-07 Apparatus and method for encoding/decoding signal

Publications (2)

Publication Number Publication Date
CN101385077A CN101385077A (en) 2009-03-11
CN101385077B true CN101385077B (en) 2012-04-11

Family

ID=40422032

Family Applications (7)

Application Number Title Priority Date Filing Date
CN 200780004515 CN101385076B (en) 2006-02-07 2007-02-07 Apparatus and method for encoding/decoding signal
CN 200780004545 CN101379554B (en) 2006-02-07 2007-02-07 Apparatus and method for encoding/decoding signal
CN 200780004527 CN101385077B (en) 2006-02-07 2007-02-07 Apparatus and method for encoding/decoding signal
CN 200780004535 CN101379553B (en) 2006-02-07 2007-02-07 Apparatus and method for encoding/decoding signal
CN200780004505.3A CN101385075B (en) 2006-02-07 2007-02-07 Apparatus and method for encoding/decoding signal
CN 200780004555 CN101379555B (en) 2006-02-07 2007-02-07 Apparatus and method for encoding/decoding signal
CN 200780004508 CN101379552B (en) 2006-02-07 2007-02-07 Apparatus and method for encoding/decoding signal

Family Applications Before (2)

Application Number Title Priority Date Filing Date
CN 200780004515 CN101385076B (en) 2006-02-07 2007-02-07 Apparatus and method for encoding/decoding signal
CN 200780004545 CN101379554B (en) 2006-02-07 2007-02-07 Apparatus and method for encoding/decoding signal

Family Applications After (4)

Application Number Title Priority Date Filing Date
CN 200780004535 CN101379553B (en) 2006-02-07 2007-02-07 Apparatus and method for encoding/decoding signal
CN200780004505.3A CN101385075B (en) 2006-02-07 2007-02-07 Apparatus and method for encoding/decoding signal
CN 200780004555 CN101379555B (en) 2006-02-07 2007-02-07 Apparatus and method for encoding/decoding signal
CN 200780004508 CN101379552B (en) 2006-02-07 2007-02-07 Apparatus and method for encoding/decoding signal

Country Status (1)

Country Link
CN (7) CN101385076B (en)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
MX2011006248A (en) * 2009-04-08 2011-07-20 Fraunhofer Ges Forschung Apparatus, method and computer program for upmixing a downmix audio signal using a phase value smoothing.
JP2011217139A (en) * 2010-03-31 2011-10-27 Sony Corp Signal processing device and method, and program
CN102884789B (en) * 2010-05-11 2017-04-12 瑞典爱立信有限公司 Video signal compression coding
EP2875510A4 (en) 2012-07-19 2016-04-13 Nokia Technologies Oy Stereo audio signal encoder
EP2757559A1 (en) 2013-01-22 2014-07-23 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for spatial audio object coding employing hidden objects for signal mixture manipulation
KR101703333B1 (en) * 2013-03-29 2017-02-06 삼성전자주식회사 Audio providing apparatus and method thereof
US9883312B2 (en) 2013-05-29 2018-01-30 Qualcomm Incorporated Transformed higher order ambisonics audio data
US9922656B2 (en) * 2014-01-30 2018-03-20 Qualcomm Incorporated Transitioning of ambient higher-order ambisonic coefficients
US10074373B2 (en) * 2015-12-21 2018-09-11 Qualcomm Incorporated Channel adjustment for inter-frame temporal shift variations

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6118875A (en) 1994-02-25 2000-09-12 Moeller; Henrik Binaural synthesis, head-related transfer functions, and uses thereof
US6574339B1 (en) 1998-10-20 2003-06-03 Samsung Electronics Co., Ltd. Three-dimensional sound reproducing apparatus for multiple listeners and method thereof

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1999049574A1 (en) 1998-03-25 1999-09-30 Lake Technology Limited Audio signal processing method and apparatus
DE19847689B4 (en) * 1998-10-15 2013-07-11 Samsung Electronics Co., Ltd. Apparatus and method for three-dimensional sound reproduction
EP1211857A1 (en) 2000-12-04 2002-06-05 STMicroelectronics N.V. Process and device of successive value estimations of numerical symbols, in particular for the equalization of a data communication channel of information in mobile telephony
EP1315148A1 (en) * 2001-11-17 2003-05-28 Deutsche Thomson-Brandt Gmbh Determination of the presence of ancillary data in an audio bitstream
DE60326782D1 (en) 2002-04-22 2009-04-30 Koninkl Philips Electronics Nv Decoding device with decorrelation unit
US7583805B2 (en) 2004-02-12 2009-09-01 Agere Systems Inc. Late reverberation-based synthesis of auditory scenes
KR100773539B1 (en) * 2004-07-14 2007-11-05 삼성전자주식회사 Multi channel audio data encoding/decoding method and apparatus

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6118875A (en) 1994-02-25 2000-09-12 Moeller; Henrik Binaural synthesis, head-related transfer functions, and uses thereof
US6574339B1 (en) 1998-10-20 2003-06-03 Samsung Electronics Co., Ltd. Three-dimensional sound reproducing apparatus for multiple listeners and method thereof

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
HERRE J ET AL.THE REFERENCE MODEL ARCHITECTURE FOR MPEG SPATIAL AUDIO CODING.《AUDIO ENGINEERING SOCIETY CONVENTION》.2005,

Also Published As

Publication number Publication date
CN101379553A (en) 2009-03-04
CN101379554A (en) 2009-03-04
CN101379552A (en) 2009-03-04
CN101379554B (en) 2012-09-19
CN101385075B (en) 2015-04-22
CN101379552B (en) 2013-06-19
CN101379555B (en) 2013-03-13
CN101385076A (en) 2009-03-11
CN101385075A (en) 2009-03-11
CN101385077A (en) 2009-03-11
CN101379553B (en) 2012-02-29
CN101385076B (en) 2012-11-28
CN101379555A (en) 2009-03-04

Similar Documents

Publication Publication Date Title
US10455344B2 (en) Compatible multi-channel coding/decoding
JP5156386B2 (en) Compact side information for parametric coding of spatial speech
US9009057B2 (en) Audio encoding and decoding to generate binaural virtual spatial signals
EP1927266B1 (en) Audio coding
AU2008215232B2 (en) Methods and apparatuses for encoding and decoding object-based audio signals
US7783493B2 (en) Slot position coding of syntax of spatial audio application
US6356639B1 (en) Audio decoding apparatus, signal processing device, sound image localization device, sound image control method, audio signal processing device, and audio signal high-rate reproduction method used for audio visual equipment
CN1307612C (en) Encoding and decoding method for audio signal, encoder, decoder, and related equipments
US8411869B2 (en) Method and apparatus for processing a media signal
KR101215872B1 (en) Parametric coding of spatial audio with cues based on transmitted channels
JP5185337B2 (en) Apparatus and method for generating level parameters and apparatus and method for generating a multi-channel display
JP5302980B2 (en) Apparatus for mixing multiple input data streams
EP2320414B1 (en) Parametric joint-coding of audio sources
US8762157B2 (en) Methods and apparatuses for encoding and decoding object-based audio signals
ES2323275T3 (en) Individual channel temporary envelope conformation for binaural and similar indication coding schemes.
CA2701360C (en) Method and apparatus for generating a binaural audio signal
KR20080078882A (en) Decoding of binaural audio signals
JP4625084B2 (en) Shaped diffuse sound for binaural cue coding method etc.
JP2006524002A (en) Audio signal generation
EP1999999B1 (en) Generation of spatial downmixes from parametric representations of multi channel signals
US20080025519A1 (en) Binaural rendering using subband filters
ES2461191T3 (en) Device, procedure and computer program to obtain a multi-channel audio signal from an audio signal
KR101215868B1 (en) A method for encoding and decoding audio channels, and an apparatus for encoding and decoding audio channels
JP4601669B2 (en) Apparatus and method for generating a multi-channel signal or parameter data set
CN101118747B (en) Fidelity-optimized pre echoes inhibition encoding

Legal Events

Date Code Title Description
C06 Publication
C10 Request of examination as to substance
C14 Granted