CN101292285B - Method for encoding and decoding multi-channel audio signal and apparatus thereof - Google Patents

Method for encoding and decoding multi-channel audio signal and apparatus thereof Download PDF

Info

Publication number
CN101292285B
CN101292285B CN2006800385900A CN200680038590A CN101292285B CN 101292285 B CN101292285 B CN 101292285B CN 2006800385900 A CN2006800385900 A CN 2006800385900A CN 200680038590 A CN200680038590 A CN 200680038590A CN 101292285 B CN101292285 B CN 101292285B
Authority
CN
China
Prior art keywords
audio signal
mix
channel
spatial information
compensating parameter
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN2006800385900A
Other languages
Chinese (zh)
Other versions
CN101292285A (en
Inventor
郑亮源
房熙锡
吴贤午
金东秀
林宰显
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
LG Electronics Inc
Original Assignee
LG Electronics Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by LG Electronics Inc filed Critical LG Electronics Inc
Priority claimed from PCT/KR2006/004285 external-priority patent/WO2007046660A1/en
Publication of CN101292285A publication Critical patent/CN101292285A/en
Application granted granted Critical
Publication of CN101292285B publication Critical patent/CN101292285B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Stereophonic System (AREA)

Abstract

Methods and apparatuses for encoding and decoding a multi-channel audio signal are provided. In the decoding method, a down-mix signal is generated based on an input signal, and spatial information is generated based on the down-mix signal through estimation. Then, a multi-channel audio signal is generated based on the down-mix signal and the spatial information. Therefore, it is possible to compensate for a down-mix signal or generate additional spatial information by using additional information.

Description

The method of Code And Decode multi-channel audio signal and device thereof
Technical field
The present invention relates to a kind of coding method and device and a kind of coding/decoding method and device; Especially; Relate to a kind of coding method and device and a kind of coding/decoding method and device, wherein utilize to compensate the additional information that down-mix audio signal maybe can generate additional spatial information multi-channel audio signal is encoded or decoded.Background technology
In the method for typical coding multi-channel audio signal; Multi-channel audio signal is become monophony or stereophonic signal by multi-channel audio; And this monophony or stereophonic signal are encoded with spatial information, rather than each sound channel in the coding multi-channel audio signal.Here, spatial information is used to recover original multi-channel audio signal.
Accompanying drawing 1 is the block scheme that is used for the canonical system of encoding/decoding multi-channel audio signal.Referring to accompanying drawing 1, audio signal encoder comprises: the multi-channel audio module, and it is through becoming the multi-channel audio signal multi-channel audio stereo or monophonic signal generates down-mix audio signal; With the spatial parameter estimation block, its span information.This system may be received in the artistic down-mix signal that the outside is processed, rather than generates down-mix audio signal.Audio signal decoder is explained the spatial information that is generated by the spatial parameter estimation block, and recovers original multi-channel audio signal based on explanation results.Yet, in the process that generates down-mix audio signal through audio signal encoder or in the generative process in artistic down-mix signal, signal level attenuation occurs in the process with different sound channel signal additions probably.For example, under the situation with two sound channel additions that have level L1 and L2 respectively, two sound channels can be not overlapping but skew each other, and the level DL12 of a sound channel that obtains through addition like this is littler than the summation of L1 and L2.
The decay of the level of down-mix audio signal can cause distorted signals in the decode operation process.For example, the relation between the level of sound channel can poor based on levels of channels (CLD) information be determined, and it is poor between the levels of channels of one type spatial information and indication.Yet when the level through down-mix audio signal that the sound channel addition is obtained was attenuated, the level of the down-mix audio signal that obtains through decoding was lower than the level of original channel reduction audio signal.
As the result of above-mentioned phenomenon, the multi-channel audio signal that obtains through decoding can be enhanced or suppresses in a predetermined frequency, thereby causes the deterioration of matter.In addition because the attenuation degree of the signal level that is caused by the skew of the part of a signal and another signal changes with frequency domain, signal through the distorted signals degree after audio coder and the audio decoder also along with frequency changes.Through in a predetermined frequency area scope, changing the energy level of down-mix audio signal, this problem can not be by abundant solution.In addition, in some cases, the spatial information that can not be necessary all is transmitted, thereby makes shoddyization of sound about multi-channel audio signal that obtains through decoding.Disclosure of the Invention content technologies problem
The present invention provides a kind of coding method and device, wherein utilizable energy compensation down-mix audio signal and the additional information that the generates additional spatial information multi-channel audio signal of encoding.
The present invention also provides a kind of coding/decoding method and device, and wherein utilizable energy compensation down-mix audio signal is come decoding multi-channel audio signal with the additional information that generates additional spatial information.Technical scheme
According to an aspect of the present invention, a kind of coding/decoding method is provided.This coding/decoding method comprises extraction down-mix audio signal and additional information from input signal; Based on this additional information and this down-mix audio signal span information, and based on this down-mix audio signal and this spatial information generation multi-channel audio signal.
According to a further aspect in the invention, a kind of decoding device is provided.This decoding device comprises demultiplexer, and it extracts down-mix audio signal and additional information through coding from input signal; Core decoder, it generates down-mix audio signal through decoding through the down-mix audio signal of coding; Become frame unit, its arrangement reduces audio signal about the data of down-mix audio signal with synchronous sound track; Spatial information estimation unit, its based on this additional information with by the down-mix audio signal that becomes frame unit carry out to arrange to obtain through estimation span information; And the multichannel synthesis unit, it generates multi-channel audio signal based on this down-mix audio signal and this spatial information.
According to a further aspect in the invention, a kind of coding/decoding method is provided.This coding/decoding method comprises: generate down-mix audio signal based on input signal, generate multi-channel audio signal based on down-mix audio signal through estimation span information with based on this down-mix audio signal and this spatial information.
According to a further aspect in the invention, a kind of decoding device is provided.This decoding device comprises core decoder, and it generates down-mix audio signal through decoding through the down-mix audio signal of coding; Become frame unit, its arrangement reduces audio signal about the data of down-mix audio signal with synchronous sound track; Spatial information estimation unit, it is based on passing through estimation span information and multichannel lock unit by the down-mix audio signal that becomes frame unit execution arrangement to obtain, and it is based on this down-mix audio signal and this spatial information generation multi-channel audio signal.
According to a further aspect in the invention, a kind of coding/decoding method is provided.This coding/decoding method comprises extraction down-mix audio signal and additional information from input signal; Generate multi-channel audio signal and compensate this multi-channel audio signal based on this down-mix audio signal and the spatial information that from additional information, extracts based on the compensating parameter of from this additional information, extracting.
According to a further aspect in the invention, a kind of coding method is provided.This coding method comprises based on multi-channel audio signal and down-mix audio signal computer memory information with through this down-mix audio signal of encoding and generates bit stream with the information that is selected from this spatial information.
According to a further aspect in the invention; A kind of computer readable recording medium storing program for performing is provided; It has the program record on it, is used to carry out a kind of coding/decoding method, and this coding/decoding method comprises extraction down-mix audio signal and additional information from input signal; Based on this additional information and this down-mix audio signal span information, and based on this down-mix audio signal and this spatial information generation multi-channel audio signal.
According to a further aspect in the invention; A kind of computer readable recording medium storing program for performing is provided; It has the program record on it, is used to carry out a kind of coding/decoding method, and this coding/decoding method comprises based on input signal generation down-mix audio signal; Generate multi-channel audio signal based on down-mix audio signal through estimation span information with based on this down-mix audio signal and this spatial information.
According to a further aspect in the invention; A kind of computer readable recording medium storing program for performing is provided; It has the program record on it; Be used to carry out a kind of coding method, this coding method comprises based on multi-channel audio signal and down-mix audio signal computer memory information with through this down-mix audio signal of encoding and generates bit stream with the information that is selected from this spatial information.Beneficial effect
In coding/decoding method, generate down-mix audio signal based on input signal, and pass through estimation span information based on this down-mix audio signal.Generate multi-channel audio signal based on this down-mix audio signal and this spatial information then.Therefore, through utilizing additional information can compensate down-mix audio signal or generating additional spatial information.Brief Description Of Drawings
Through being described in detail with reference to the attached drawings example embodiment of the present invention, of the present inventionly above-mentionedly will become more obvious with other characteristics and advantage, wherein:
Accompanying drawing 1 is the block scheme that is used for the canonical system of encoding/decoding multi-channel audio signal;
Accompanying drawing 2 is block schemes of code device according to an embodiment of the invention;
Accompanying drawing 3 is block schemes of decoding device according to an embodiment of the invention;
Accompanying drawing 4 is process flow diagrams that the operation of the decoding device shown in the accompanying drawing 3 is shown according to one embodiment of present invention;
Accompanying drawing 5 is block schemes of decoding device according to another embodiment of the present invention; With
Accompanying drawing 6 is block schemes of decoding device according to another embodiment of the present invention.Realize optimal mode of the present invention
Referring now to the accompanying drawing of wherein showing example embodiment of the present invention the present invention is described more fully.
Coding method according to an embodiment of the invention and device and coding/decoding method and device can be used to handle multi-channel audio signal.Yet the present invention is not limited only to this.In other words, the present invention also can be used to handle the signal except that multi-channel audio signal.
Accompanying drawing 2 is block schemes of code device according to an embodiment of the invention.Referring to accompanying drawing 2, this code device comprises down-mix unit 110, compensation parameter calculation unit 120, spatial information calculation unit 130, and bit stream generation unit 170.Bit stream generation unit 170 comprises core encoder 140, parametric encoder 150 and multiplexer 160.
Down-mix unit 110 is that monophonic signal or stereophonic signal generate down-mix audio signal through the multi-channel audio signal multi-channel audio with input.The level of the down-mix audio signal that compensation parameter calculation unit 120 will be generated by down-mix unit 110 or the artistic down-mix signal of input or envelope (envelope) compare with the level or the envelope of the artistic down-mix signal that is used to generate down-mix audio signal multi-channel audio signal or input, and calculate the needed compensating parameter of compensation down-mix audio signal based on comparative result.Spatial information calculation unit 130 is calculated the spatial information of multi-channel audio signal.
The core encoder 140 coding down-mix audio signal of bit stream generation unit 170.The parametric encoder 150 of bit stream generation unit 170 generates additional information through coding compensating parameter and spatial information.Then, multiplexer 160 generates bit stream through combination through the down-mix audio signal and the additional information of coding.Particularly, down-mix unit 110 generates down-mix audio signal through the multi-channel audio signal of input being done the multi-channel audio processing.For example; The multi-channel audio signal that has five sound channels (being sound channel 1 to 5) in multi-channel audio is under the situation of stereophonic signal; Down-mix channel 1 can obtain through the sound channel 1,3 and 4 of combination multi-channel audio signal, and down-mix channel 2 can obtain through the sound channel 2,3 and 5 of combination multi-channel audio signal.
In case down-mix audio signal is generated, compensation parameter calculation unit 120 is calculated the needed compensating parameter of compensation down-mix audio signal.Compensating parameter can use several different methods to calculate.For example; Suppose that multi-channel audio signal comprises 5 sound channels that belong to predetermined frequency band; That is, sound channel 1,2,3,4 and 5, L1, L2, L3, L4 and L5 represent the level of sound channel 1,2,3,4 and 5 respectively; Down-mix channel 1 comprises sound channel 1,3 and 4, and down-mix channel 2 comprises sound channel 2,3 and 5.In this case, the level DL235 of the level DL134 of down-mix channel 1 and down-mix channel 2 can represent by enough equations (1):
Mathematical expression 1DL134≤L1+g3*L3+g4*L4DL235≤L2+g3*L3+g5*L5
The gain that wherein generates in g3, g4 and the operating process of g5 indication multi-channel audio.Generating under the situation of multi-channel audio signal through decoding based on down-mix audio signal, level L1 ', L2 ', L3 ', L4 ' and the L5 ' of 5 sound channels of the multi-channel audio signal that is generated equals original level L1, L2, L3, L4 and the L5 of 5 sound channels of original multi-channel audio signal ideally respectively.In order to realize this, the compensating parameter CF235 of the compensating parameter CF123 of down-mix channel 1 and down-mix channel 2 can user's formula (2) calculate:
Mathematical expression 2CF134=(L1+g3*L3+g4*L4)/DL134
CF235=(L2+g3*L3+g5*L5)/DL235
According to present embodiment, for each down-mix channel is calculated the data volume that compensating parameter is transmitted to reduce.Yet can calculate compensating parameter for each sound channel of multi-channel audio signal.In other words; Compensating parameter can be calculated the ratio as the energy of each sound channel of the energy of down-mix audio signal and multi-channel audio signal, perhaps as the ratio of the envelope of each sound channel of the envelope of down-mix audio signal and multi-channel audio signal.
Spatial information calculation unit 130 computer memory information.The example of spatial information comprises levels of channels difference (CLD) information, inter-channel cross correlation (ICC) information and sound channel predictive coefficient (CPC) information.
Core encoder 140 coding down-mix audio signal.Parametric encoder 150 generates additional information through space encoder information and compensating parameter.Compensating parameter can use the method identical with the CLD that encodes to encode.For example, compensating parameter can be utilized time-division decoding method or frequency division decoding method, grouping pulse code modulation (pcm) decoding method, and based on the decoding method of guiding, or Huffman code this law is encoded.Multiplexer 160 generates bit stream through combination through the down-mix audio signal and the additional information of coding.Under this mode, comprise that the compensating parameter that is used to compensate the decay of down-mix audio signal level can be generated as the bit stream of additional information.
Under the situation that does not need level compensation, can be configured to 0 value about the sign of compensating parameter, thereby reduce the bit rate of additional information.If there is not big gap between the value of compensating parameter CF134 and CF235, so only there is one of the compensating parameter CF134 that can represent two compensating parameter CF134 and CF235 and CF235 to be transmitted, rather than transmits two compensating parameter CF134 and CF235.Also have, if the compensating parameter value does not change in time but keeps constant, predetermined flag can be used to indicate last compensating parameter value to be used so.
According to present embodiment, based on the comparative result of the level of level and the down-mix audio signal of input multi-channel audio signal, compensating parameter can be set up.Yet compensating parameter can be used and be provided with method diverse ways noted earlier or estimate.In other words; Because the decay that compensation parameter models down-mix audio signal level is compared with the level of the input multi-channel audio signal that is used to generate down-mix audio signal; Compensating parameter can be defined as level ratio; Wave data, or have the gain compensation value of linear/non-linear characteristic.Through using such mathematics analogue value parameter value by way of compensation, can only use small number of bits to come to carry out expeditiously the transmission and the compensation down-mix audio signal of compensating parameter.
Accompanying drawing 3 is block schemes of decoding device according to an embodiment of the invention.Referring to accompanying drawing 3, decoding device comprises demultiplexer 310, core decoder 320, parameter decoder 330 and multichannel synthesis unit 340.
Demultiplexer 310 multichannel from incoming bit stream is decomposed additional information and the down-mix audio signal through encoding.Core decoder 320 generates down-mix audio signal through decoding through the down-mix audio signal of coding.Parameter decoder 330 generates spatial information and compensating parameter based on the additional information that is obtained by demultiplexer 310.Multichannel synthesis unit 340 generates multi-channel audio signal based on the down-mix audio signal that is obtained by core decoder 320 with by spatial information and compensating parameter that parameter decoder 330 obtains.
Accompanying drawing 4 is process flow diagrams that the operation of the decoding device shown in the accompanying drawing 3 is shown according to one embodiment of present invention.Referring to accompanying drawing 3 and 4, in step S400, the bit stream of multi-channel audio signal is received.At step S405, demultiplexer 310 multichannel from the bit stream that receives is decomposed down-mix audio signal and additional information through coding.At step S410, core decoder 320 generates down-mix audio signal through decoding through the down-mix audio signal of coding.At step S420, parameter decoder 330 generates compensating parameter and spatial information through the decoding additional information.At step S430, multichannel synthesis unit 340 generates multi-channel audio signal based on spatial information and down-mix audio signal.At step S440, multichannel synthesis unit 340 using compensation parametric compensation multi-channel audio signals.Particularly, multichannel synthesis unit 340 can compensate each the output in a plurality of sound channels that obtain through decoding based on down-mix audio signal and spatial information, shown in equation (3):
Mathematical expression 3L1 "=L1 ' * CF134L2 "=L2 ' * CF235L3 "=L3 ' * (CF124+CF235)/2L4 "=L4 ' * CF134L5 "=L5 ' * CF235
The energy level of L1 ', L2 ', L3 ', L4 ' and L5 ' expression sound channel wherein, CF124 and CF235 indication compensating parameter.
Under this mode, in the decode operation process, can prevent distorted signals in preset frequency through using with the received compensating parameter of spatial information, the multi-channel audio signal that obtains as the result of decode operation like this can be by suitably compensation.According to present embodiment, the output of each sound channel all using compensation parameter compensates.Yet the present invention is not limited to this.In other words, when the envelope of each sound channel when parameter is transmitted by way of compensation, spatial information need not be transmitted, because spatial information can generate based on the envelope of relevant each sound channel.Even when not receiving spatial information, decoding device can extract pseudo spatial information from the input sound channel reduction audio signal with two or more down-mix channel, and the down-mix audio signal that decoding is imported based on pseudo spatial information.
Accompanying drawing 5 is block schemes of decoding device according to an embodiment of the invention.Referring to accompanying drawing 5, decoding device not usage space information also only generates multi-channel audio signal based on down-mix audio signal as additional information.
Referring to accompanying drawing 5, decoding device comprises core decoder 510, becomes frame unit 520, spatial information estimation unit 530 and multichannel synthesis unit 540.
Core decoder 510 generates down-mix audio signal through the decoding incoming bit stream, and down-mix audio signal is sent to into frame unit 520.Down-mix audio signal can be through using the down-mix audio signal of the matrix form that orientation logic for example or logic 7 obtain, but the present invention is not limited to this.
The data that become frame unit 520 to arrange about the down-mix audio signal that is obtained by core decoder 510, so corresponding down-mix audio signal can be deciphered in (SAC) frame unit synchronous at space audio.In this framing operating process; If based on the down-mix audio signal that obtains by core decoder 510 through the operational analysis bank of filters; Generate quadrature mirror filter (QMF) and mixed zone territory signal; Mixed zone territory signal become frame unit 520 can mixed zone territory signal be sent to multichannel synthesis unit 540 then, because can use in decode operation easily.
Spatial information estimation unit 530 is based on by the down-mix audio signal span information that becomes frame unit 520 to obtain for example CLD, ICC and CPC information.Specifically, spatial information estimation unit 530 is each SAC frame span information.In this case, spatial information estimation unit 530 can be collected the down-mix audio signal data and equaled the length of frame up to the length of the data of collecting that are combined, and handles the down-mix audio signal data of collecting then.Alternatively, spatial information estimation unit 530 can be each PCM sampling span information.The spatial information that is generated by spatial information estimation unit 530 is not the data that are transmitted, and therefore need not be compressed, and for example quantizes.Therefore, the spatial information that is generated by spatial information estimation unit 530 can comprise information as much as possible.
Multichannel synthesis unit 540 is based on generating multi-channel audio signal by down-mix audio signal that becomes frame unit 520 to obtain and the spatial information that generated by spatial information estimation unit 530.
According to present embodiment, compare spatial information as the classic method that additional information transmits with relating to, can reduce bit rate.In addition, use the same procedure of the multi-channel audio content that generally is used to the generator matrix form, can generate multi-channel signal.
Accompanying drawing 6 is block schemes of decoding device according to an embodiment of the invention.Referring to accompanying drawing 6; When comprising that not only down-mix audio signal comprises that also the bit stream of spatial information is received; Decoding device generates additional spatial information based on the spatial information that is included in the received bit stream, and utilizes additional spatial information to come decoded channels reduction audio mixing sound signal.
Referring to accompanying drawing 6, decoding device comprises demultiplexer 610, core decoder 620, becomes frame unit 630, spatial information estimation unit 640, multichannel synthesis unit 650 and assembled unit 650.
Demultiplexer 610 multichannel from incoming bit stream decomposites spatial information and the down-mix audio signal through encoding.Core decoder 620 generates down-mix audio signal through decoding through the down-mix audio signal of coding.The data that become frame unit 630 to arrange about the down-mix audio signal that is obtained by core decoder 510, so corresponding down-mix audio signal can be in spatial audio coding (SAC) frame unit synchronously.Spatial information estimation unit 640 generates additional spatial information based on the spatial information that is obtained by demultiplexer 610 through estimation.Assembled unit 660 makes up spatial information that is obtained by demultiplexer 610 and the additional spatial information that is generated by spatial information estimation unit 640, and will be sent to multichannel lock unit 650 by the spatial information that combination obtains.Then, multichannel synthesis unit 650 is based on generating multi-channel audio signal by the down-mix audio signal of core decoder 620 generations and the spatial information that is transmitted by assembled unit 660.
According to present embodiment, not only can use the spatial information that is included in the incoming bit stream but also can use the additional spatial information that obtains through estimation from down-mix audio signal.According to the type that is included in the spatial information in the incoming bit stream, a variety of application are possible, and below will be elaborated to this.
When the spatial information that only comprises minority time slot and data tape is received; Promptly; When the bit rate of spatial information lower; When the number of the data tape of spatial information or spatial information transmitted frequency are very low like this, the information that spatial information estimation unit 640 lacks based on the spatial information that receives and multi-channel audio PCM signal span information, thus improved the quality of multi-channel audio signal.For example, if receive the spatial information that includes only 5 data tapes, spatial information estimation unit 640 can reference convert this spatial information to the spatial information that comprises 28 data tapes with the together received down-mix audio signal of spatial information so.If receive the spatial information that includes only 2 time slots, spatial information estimation unit 640 can reference generate 8 time slots altogether with the down-mix audio signal that spatial information together is received through interpolation method so.
When only receiving the spatial information that comprises CLD, ICC and CPD information a part of, for example, when only receiving ICC information, spatial information estimation unit 640 can generate CLD and CPC information through estimation, thereby has improved the quality of multi-channel audio signal.Equally, when only receiving CLD information, spatial information estimation unit 640 can generate ICC information through estimation.
Code device utilizes one to two (OTT) or two to three (TTT) box that the multi-channel signal multi-channel audio of importing is processed into down-mix audio signal.When a spatial information corresponding to some OTT or TTT box is received; Spatial information estimation unit 640 can generate and other OTT or the corresponding spatial information of TTT box through estimation, and generates multi-channel audio signal based on the spatial information of spatial information that receives and generation.In this case, the estimation of spatial information can be carried out after the spatial information that is received being carried out the SAC decoding.For example; If have 2 sound channels (promptly; A left side (L) and right (R) sound channel) down-mix audio signal and be received corresponding to the spatial information of TTT box, spatial information estimation unit 640 can be based on the L of the down-mix audio signal that receives and R sound channel signal generation L-, center (C)-and (R)-sound channel signal so.
Thereafter, spatial information estimation unit 640 can generate the spatial information corresponding to the OTT box.Then, multichannel synthesis unit 650 generates multi-channel audio signal based on the spatial information of reception and the spatial information that is generated by spatial information estimation unit 640.This method can be applied to the bigger situation of number when output channels.For example; When the bit stream with 525 forms be imported into can provide up to the decoding device of 7 sound channels in the time; Decoding device generates 5 sound channel signals (composite field) through the SAC decoding; Through the estimation generation 5 sound channel signals are expanded to 7 needed spatial informations of sound channel, and carry out decoding in addition, flow the more signal of sound channel that can provide thereby generate to have than individual bit.
The present invention can be implemented as the computer-readable code that writes on the computer readable recording medium storing program for performing.Computer readable recording medium storing program for performing can be any type of recording unit, and wherein data are stored with computer-readable mode.The example of computer readable recording medium storing program for performing comprises ROM, RAM, CD-ROM, tape, floppy disk, optical storage of data and the carrier wave data transmission of the Internet (for example, through).Computer readable recording medium storing program for performing can be distributed in a plurality of computer systems that are connected to network, and computer-readable code can be written to that and carry out therefrom with the mode of disperseing like this.Realize that function program, code and code segment required for the present invention can easily be analyzed by those of ordinary skills.
The present invention can be implemented as the computer-readable code that writes on the computer readable recording medium storing program for performing.Computer readable recording medium storing program for performing can be any type of recording unit, and wherein data are stored with computer-readable mode.The example of computer-readable medium comprises ROM, RAM, CD-ROM, tape, floppy disk, optical storage of data and the carrier wave data transmission of the Internet (for example, through).Computer readable recording medium storing program for performing can be distributed in a plurality of computer systems that are connected to network, and computer-readable code can be written to that and carry out therefrom with the mode of disperseing like this.Realize that function program, code and code segment required for the present invention can easily be analyzed by those of ordinary skills.
Though the present invention has carried out special exhibition and description with reference to above-mentioned exemplary embodiment; But concerning those of ordinary skills, it is understandable that; In not breaking away from, can make the various variations on form and the details like the defined the spirit and scope of the present invention of following claim.Industrial usability
According to the present invention, capable of using as additional information pass through relatively import the multi-channel audio signal of compensating parameter compensation that the level of level and down-mix audio signal of multi-channel audio signal calculates through using decoding to obtain.In addition, according to the present invention, can generate additional spatial information based on input space information and input sound channel reduction audio signal.Therefore, can prevent the distortion of multi-channel audio signal under preset frequency that obtain through decoding, and improve the quality of multi-channel audio signal.
According to the present invention, can compensate down-mix audio signal through utilizing compensating parameter, in the coding of multi-channel audio signal and/or decode procedure, prevent shoddyization of sound.

Claims (10)

1. coding/decoding method that is used for multi-channel audio signal comprises:
From input signal, extract down-mix audio signal and additional information;
From said additional information, extract spatial information and compensating parameter;
Generate multi-channel audio signal based on said down-mix audio signal and said spatial information; And
The sign that whether is applied to the multi-channel audio signal of said generation based on said compensating parameter and the said compensating parameter of indication compensates the multi-channel audio signal of said generation,
Wherein, said compensating parameter is that envelope and the envelope of the multi-channel audio signal that is used to generate said down-mix audio signal through using down-mix audio signal calculates, and
Said compensating parameter comprises about the envelope of the said down-mix audio signal ratio to the envelope of the multi-channel audio signal that is used to generate said down-mix audio signal.
2. the method for claim 1 is characterized in that, said compensating parameter is applied to each sound channel of said multi-channel audio signal.
3. the method for claim 1 is characterized in that, said spatial information comprises the data corresponding to one to two (OTT) frame or two to three (TTT) frame.
4. the method for claim 1 is characterized in that, said spatial information comprises levels of channels poor (CLD), inter-channel cross correlation (ICC), sound channel predictive coefficient (CPC) information.
5. decoding device that is used for multi-channel audio signal comprises:
Demultiplexer, it extracts down-mix audio signal and additional information from input signal;
Parameter decoder is extracted spatial information and compensating parameter from said additional information; With
The multichannel synthesis unit; It generates multi-channel audio signal based on said down-mix audio signal and said spatial information; And compensate the multi-channel audio signal of said generation based on the sign whether said compensating parameter and the said compensating parameter of indication are applied to the multi-channel audio signal of said generation
Wherein, said compensating parameter is that envelope and the envelope of the multi-channel audio signal that is used to generate said down-mix audio signal through using down-mix audio signal calculates,
Said compensating parameter comprises about the envelope of the said down-mix audio signal ratio to the envelope of the multi-channel audio signal that is used to generate said down-mix audio signal.
6. device as claimed in claim 5 is characterized in that, the multi-channel audio signal of the said generation of said multichannel synthesis unit using compensation parametric compensation.
7. device as claimed in claim 5 is characterized in that, said multichannel synthesis unit is applied to compensating parameter each sound channel of said multi-channel audio signal.
8. device as claimed in claim 5 is characterized in that, said spatial information comprises the data corresponding to one to two (OTT) or two to three (TTT) frame.
9. coding method comprises:
Based on multi-channel audio signal and down-mix audio signal computer memory information;
Envelope based on the envelope of down-mix audio signal and the multi-channel audio signal that is used to generate said down-mix audio signal calculates compensating parameter, and said compensating parameter is used for the multi-channel audio signal that generates based on said down-mix audio signal and said spatial information is compensated;
Through the said spatial information of encoding, compensating parameter, the sign and the said down-mix audio signal that indicate whether to use said compensating parameter generate bit stream; And
Said compensating parameter comprises about the envelope of the said down-mix audio signal ratio to the envelope of the multi-channel audio signal that is used to generate said down-mix audio signal.
10. code device comprises:
Spatial information calculation unit is based on multi-channel audio signal and down-mix audio signal computer memory information;
Compensation parameter calculation unit; Calculate compensating parameter based on the envelope of the multi-channel audio signal that is used to generate said down-mix audio signal and the envelope of down-mix audio signal; Said compensating parameter is used for the multi-channel audio signal based on said down-mix audio signal and the generation of said spatial information is compensated, and
Said compensating parameter comprises about the envelope of the said down-mix audio signal ratio to the envelope of the multi-channel audio signal that is used to generate said down-mix audio signal; With
The bit stream generation unit through the said spatial information of encoding, said compensating parameter, indicate whether to use the sign and the said down-mix audio signal of said compensating parameter, and mixes the result of said coding, generates bit stream.
CN2006800385900A 2005-10-20 2006-10-20 Method for encoding and decoding multi-channel audio signal and apparatus thereof Active CN101292285B (en)

Applications Claiming Priority (9)

Application Number Priority Date Filing Date Title
US72830905P 2005-10-20 2005-10-20
US60/728,309 2005-10-20
US73429205P 2005-11-08 2005-11-08
US60/734,292 2005-11-08
US76573006P 2006-02-07 2006-02-07
US60/765,730 2006-02-07
KR10-2006-0102146 2006-10-20
PCT/KR2006/004285 WO2007046660A1 (en) 2005-10-20 2006-10-20 Method for encoding and decoding multi-channel audio signal and apparatus thereof
KR1020060102146A KR20070043651A (en) 2005-10-20 2006-10-20 Method for encoding and decoding multi-channel audio signal and apparatus thereof

Publications (2)

Publication Number Publication Date
CN101292285A CN101292285A (en) 2008-10-22
CN101292285B true CN101292285B (en) 2012-10-10

Family

ID=40035657

Family Applications (2)

Application Number Title Priority Date Filing Date
CN2006800385900A Active CN101292285B (en) 2005-10-20 2006-10-20 Method for encoding and decoding multi-channel audio signal and apparatus thereof
CN2006800385883A Active CN101292284B (en) 2005-10-20 2006-10-20 Method for encoding and decoding multi-channel audio signal and apparatus thereof

Family Applications After (1)

Application Number Title Priority Date Filing Date
CN2006800385883A Active CN101292284B (en) 2005-10-20 2006-10-20 Method for encoding and decoding multi-channel audio signal and apparatus thereof

Country Status (1)

Country Link
CN (2) CN101292285B (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101521013B (en) * 2009-04-08 2011-08-17 武汉大学 Spatial audio parameter bidirectional interframe predictive coding and decoding devices
JP5298245B2 (en) * 2009-12-16 2013-09-25 ドルビー インターナショナル アーベー SBR bitstream parameter downmix
US9852735B2 (en) 2013-05-24 2017-12-26 Dolby International Ab Efficient coding of audio scenes comprising audio objects
CN104200827B (en) * 2014-09-05 2017-04-19 赵平 Method and device for obtaining internet audio file
CN114898761A (en) * 2017-08-10 2022-08-12 华为技术有限公司 Stereo signal coding and decoding method and device

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
Seungkwon Beack et.al.An Efficient Representation Method for ICLD with Robustness to Spectral Distortion.《ETRI JOURNAL》.2005,第27卷(第3期),全文. *

Also Published As

Publication number Publication date
CN101292284B (en) 2012-10-10
CN101292284A (en) 2008-10-22
CN101292285A (en) 2008-10-22

Similar Documents

Publication Publication Date Title
EP1952392B1 (en) Method, apparatus and computer-readable recording medium for decoding a multi-channel audio signal
JP4601669B2 (en) Apparatus and method for generating a multi-channel signal or parameter data set
CN101379555B (en) Apparatus and method for encoding/decoding signal
US20080052089A1 (en) Acoustic Signal Encoding Device and Acoustic Signal Decoding Device
US20070168183A1 (en) Audio distribution system, an audio encoder, an audio decoder and methods of operation therefore
CN104681030A (en) Apparatus and method for encoding/decoding signal
GB2390788A (en) Audio decoding method and apparatus which recovers high frequency component with small computation.
JP4568363B2 (en) Audio signal decoding method and apparatus
CN101292285B (en) Method for encoding and decoding multi-channel audio signal and apparatus thereof
TWI390993B (en) Method for encoding and decoding multi-channel audio signal and apparatus thereof

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant