EP2880653B1 - Decoder and method for multi-instance spatial-audio-object-coding employing a parametric concept for multichannel downmix/upmix cases - Google Patents
Decoder and method for multi-instance spatial-audio-object-coding employing a parametric concept for multichannel downmix/upmix cases Download PDFInfo
- Publication number
- EP2880653B1 EP2880653B1 EP13745103.5A EP13745103A EP2880653B1 EP 2880653 B1 EP2880653 B1 EP 2880653B1 EP 13745103 A EP13745103 A EP 13745103A EP 2880653 B1 EP2880653 B1 EP 2880653B1
- Authority
- EP
- European Patent Office
- Prior art keywords
- channels
- downmix
- channel
- processing units
- depending
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims description 44
- 238000012545 processing Methods 0.000 claims description 144
- 238000004590 computer program Methods 0.000 claims description 13
- 238000009877 rendering Methods 0.000 claims description 13
- 230000005236 sound signal Effects 0.000 description 11
- 230000003595 spectral effect Effects 0.000 description 8
- 239000000203 mixture Substances 0.000 description 7
- 230000005540 biological transmission Effects 0.000 description 6
- 238000000926 separation method Methods 0.000 description 6
- 238000013459 approach Methods 0.000 description 5
- 238000005516 engineering process Methods 0.000 description 4
- 238000012986 modification Methods 0.000 description 4
- 230000004048 modification Effects 0.000 description 4
- 239000011159 matrix material Substances 0.000 description 3
- YYAVXASAKUOZJJ-UHFFFAOYSA-N 4-(4-butylcyclohexyl)benzonitrile Chemical compound C1CC(CCCC)CCC1C1=CC=C(C#N)C=C1 YYAVXASAKUOZJJ-UHFFFAOYSA-N 0.000 description 2
- BBHJTCADCKZYSO-UHFFFAOYSA-N 4-(4-ethylcyclohexyl)benzonitrile Chemical compound C1CC(CC)CCC1C1=CC=C(C#N)C=C1 BBHJTCADCKZYSO-UHFFFAOYSA-N 0.000 description 2
- 101100180304 Arabidopsis thaliana ISS1 gene Proteins 0.000 description 2
- 208000003381 Pontocerebellar hypoplasia type 1 Diseases 0.000 description 2
- 208000037507 Pontocerebellar hypoplasia type 11 Diseases 0.000 description 2
- 208000003369 Pontocerebellar hypoplasia type 2 Diseases 0.000 description 2
- 101100519257 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) PDR17 gene Proteins 0.000 description 2
- 101100042407 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) SFB2 gene Proteins 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 208000003383 pontocerebellar hypoplasia type 3 Diseases 0.000 description 2
- 201000003034 pontocerebellar hypoplasia type 4 Diseases 0.000 description 2
- 230000002123 temporal effect Effects 0.000 description 2
- 208000033164 type 11 pontocerebellar hypoplasia Diseases 0.000 description 2
- -1 ISS2 Proteins 0.000 description 1
- 241001025261 Neoraja caerulea Species 0.000 description 1
- 101100356268 Schizosaccharomyces pombe (strain 972 / ATCC 24843) red1 gene Proteins 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000000354 decomposition reaction Methods 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
- 230000001755 vocal effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2400/00—Details of stereophonic systems covered by H04S but not provided for in its groups
- H04S2400/03—Aspects of down-mixing multi-channel audio to configurations with lower numbers of playback channels, e.g. 7.1 -> 5.1
Definitions
- the present invention relates to a decoder and a method for multi-instance spatial-audio-object-coding (M-SAOC) employing a parametric concept for multichannel downmix/upmix cases.
- M-SAOC multi-instance spatial-audio-object-coding
- multi-channel audio content brings along significant improvements for the user. For example, a three-dimensional hearing impression can be obtained, which brings along an improved user satisfaction in entertainment applications.
- multi-channel audio content is also useful in professional environments, for example, in telephone conferencing applications, because the talker intelligibility can be improved by using a multi-channel audio playback.
- Another possible application is to offer to a listener of a musical piece to individually adjust playback level and/or spatial position of different parts (also termed as "audio objects") or tracks, such as a vocal part or different instruments.
- the user may perform such an adjustment for reasons of personal taste, for easier transcribing one or more part(s) from the musical piece, educational purposes, karaoke, rehearsal, etc.
- MPEG Moving Picture Experts Group
- MPS MPEG Surround
- SAOC MPEG Spatial Audio Object Coding
- JSC MPEG Spatial Audio Object Coding
- the temporal dimension is represented by the time-block number and the spectral dimension is captured by the spectral coefficient ("bin") number.
- the temporal dimension is represented by the time-slot number and the spectral dimension is captured by the sub-band number. If the spectral resolution of the QMF is improved by subsequent application of a second filter stage, the entire filter bank is termed hybrid QMF and the fine resolution sub-bands are termed hybrid sub-bands.
- Multi-channel 5.1 audio formats are already standard in DVD and Blue-Ray productions. New audio formats like MPEG-H 3D Audio with even more audio transport channels appear at the horizon, which will provide the end-users a highly immersive audio experience.
- Parametric audio object coding schemes are currently restricted to a maximum of two downmix channels. They can only be applied to some extend on multi-channel mixtures, for example on only two selected downmix channels. The flexibility these coding schemes offer to the user to adjust the audio scene to his/her own preferences is thus severely limited, e.g., with respect to changing audio level of the sports commentator and the atmosphere in sports broadcast.
- the object of the present invention is to provide improved concepts for audio object coding.
- the object of the present invention is solved by a decoder according to claim 1, by a method according to claim 10 and by a computer program according to claim 11.
- a decoder for generating an audio output signal comprising one or more audio output channels from a downmix signal comprising three or more downmix channels, wherein the downmix signal encodes three or more audio object signals is provided.
- the decoder comprises an input channel router for receiving the three or more downmix channels and for receiving side information, and at least two channel processing units for generating at least two processed channels to obtain the one or more audio output channels.
- the input channel router is configured to feed each of at least two of the three or more downmix channels into at least one of the at least two channel processing units, so that each of the at least two channel processing units receives one or more of the three or more downmix channels, and so that each of the at least two channel processing units receives less than the total number of the three or more downmix channels.
- Each channel processing unit of the at least two channel processing units is configured to generate one or more of the at least two processed channels depending on the side information and depending on said one or more of the at least two of the three or more downmix channels received by said channel processing unit from the input channel router.
- a downmix can be produced which is optimized for the parametric separation at the decoder side regarding perceived quality.
- Embodiments extend the parametric part of the SAOC scheme to an arbitrary number of downmix/upmix channels.
- the inventive method further allows fully flexible mixing of the audio objects.
- the input channel router may be configured to feed each of the at least two of the three or more downmix channels into exactly one of the at least two channel processing units.
- the input channel router may be configured to feed each of the three or more downmix channels into at least one of the at least two channel processing units, so that each of the three or more downmix channels is received by one or more of the at least two channel processed units.
- each of the at least two channel processing units may be configured to generate said one or more of the at least two processed channels independent from at least one of three or more downmix channels.
- each of the at least two channel processing units may either be a mono processing unit or a stereo processing unit, wherein said mono processing unit may be configured to receive exactly one of the three or more downmix channels and is configured to generate exactly one or exactly two of the at least two processed channels depending on said exactly one of the three or more downmix channels and depending on the side information, and wherein said stereo processing unit may be configured to receive exactly two of the three or more downmix channels and is configured to generate exactly one or exactly two of the at least two processed channels depending on said exactly two of the three or more downmix channels and depending on the side information.
- At least one of the at least two channel processing units may be configured to receive exactly one of the three or more downmix channels and being configured to generate exactly two of the at least two processed channels depending on said exactly one of the three or more downmix channels and depending on the side information.
- At least one of the at least two channel processing units may be configured to receive exactly two of the three or more downmix channels and being configured to generate exactly one of the at least two processed channels depending on said exactly two of the three or more downmix channels and depending on the side information.
- the input channel router may be configured to receive four or more downmix channels, and at least one of the at least two channel processing units may be configured to receive at least three of the four or more downmix channels and may be configured to generate at least three of the processed channels depending on said at least three of the four or more downmix channels and depending on the side information.
- At least one of the at least two channel processing units may be configured to receive exactly three of the four or more downmix channels and may be configured to generate exactly three of the processed channels depending on said exactly three of the four or more downmix channels and depending on the side information.
- the input channel router may be configured to receive six or more downmix channels, and wherein at least one of the at least two channel processing units may be configured to receive exactly five of the six or more downmix channels and is configured to generate exactly five of the processed channels depending on said exactly five of the six or more downmix channels and depending on the side information.
- the input channel router is configured to not feed at least one of the three or more downmix channels into any of the at least two channel processing units, so that said at least one of the three or more downmix channels is not received by any of the at least two channel processed units.
- the decoder further comprises an output channel router for combining the at least two processed channels to obtain an estimation of the audio object signals.
- the decoder further comprises a renderer, wherein the renderer is configured to receive rendering information, and wherein the renderer is configured to generate the one or more audio output channels depending on the estimation of the audio object signals and depending on the rendering information.
- the at least two channel processing units are configured to generate the at least two processed channels in parallel.
- a first channel processing unit of the at least two channel processing units may be configured to feed a first processed channel of the at least two processed channels into a second channel processing unit of the at least two channel processing units.
- Said second processing unit may be configured to generate a second processed channel of the at least two processed channels depending on the first processed channel.
- Fig. 2 shows a general arrangement of an SAOC encoder 10 and an SAOC decoder 12.
- the SAOC encoder 10 receives as an input N objects, i.e., audio signals s 1 to s N .
- the encoder 10 comprises a downmixer 16 which receives the audio signals s 1 to s N and downmixes same to a downmix signal 18.
- the downmix may be provided externally ("artistic downmix") and the system estimates additional side information to make the provided downmix match the calculated downmix.
- the downmix signal is shown to be a P-channel signal.
- side-information estimator 17 provides the SAOC decoder 12 with side information including SAOC-parameters.
- SAOC parameters comprise object level differences (OLD), inter-object correlations (IOC) (inter-object cross correlation parameters), downmix gain values (DMG) and downmix channel level differences (DCLD).
- the SAOC decoder 12 comprises an up-mixer which receives the downmix signal 18 as well as the side information 20 in order to recover and render the audio signals ⁇ 1 and ⁇ N onto any user-selected set of channels ⁇ 1 to ⁇ M , with the rendering being prescribed by rendering information 26 input into SAOC decoder 12.
- the audio signals s 1 to s N may be input into the encoder 10 in any coding domain, such as, in time or spectral domain.
- encoder 10 may use a filter bank, such as a hybrid QMF bank, in order to transfer the signals into a spectral domain, in which the audio signals are represented in several sub-bands associated with different spectral portions, at a specific filter bank resolution. If the audio signals s 1 to s N are already in the representation expected by encoder 10, same does not have to perform the spectral decomposition.
- Fig. 1 illustrates a decoder for generating an audio output signal comprising one or more audio output channels from a downmix signal comprising three or more downmix channels according to an embodiment.
- the downmix signal encodes three or more audio object signals.
- the decoder comprises an input channel router 110 for receiving the three or more downmix channels DMX1, DMX2, DMX3 and for receiving side information SI, and at least two channel processing units 121, 122 for generating at least two processed channels to obtain the one or more audio output channels.
- the input channel router 110 is configured to feed each of at least two of the three or more downmix channels DMX1, DMX2 DMX3 into at least one of the at least two channel processing units 121, 122, so that each of the at least two channel processing units 121, 122 receives one or more of the three or more downmix channels, and so that each of the at least two channel processing units 121, 122 receives less than the total number of the three or more downmix channels DMX1, DMX2, DMX3.
- each of the three downmix channels DMX1, DMX2, DMX3 are fed into exactly one channel processing unit.
- not all of the three or more downmix channels received by the input channel router 110 are fed into a processing unit.
- each of at least two downmix channels of the three or more downmix channels will be fed into at least one of the channel processing units.
- Each channel processing unit of the at least two channel processing units 121, 122 is configured to generate one or more of the at least two processed channels depending on the side information SI and depending on said one or more of the at least two of the three or more downmix channels (DMX1, DMX2, DMX3) received by said channel processing unit 121, 122, from the input channel router 110.
- DMX1, DMX2, DMX3 downmix channels
- channel processing unit 121 receives two downmix channels (DMX1 DMX2) for generating two processed channels (PCH1, PCH2).
- processing unit 121 may be considered as a stereo-to-stereo processing unit.
- channel processing unit 122 receives downmix channel DMX3 for generating two processed channels (PCH3, PCH4).
- the processed channels PCH1, PCH2, PCH3, PCH4 are the audio output channels generated by the decoder.
- the audio output channels are generated depending on the processed channels e.g. by employing rendering information.
- the side information may for example comprise downmix information which indicates how audio objects have been downmixed to obtain the three or more downmix channels.
- the side information may also comprise information on a covariance matrix of size N x N, which may indicate for N audio objects or N audio object signals, which are encoded, the OLD and IOC parameters of these N audio objects.
- a channel processing unit of the at least two processing units 121, 122 may, for example, be a mono-to-mono processing unit which implements a mono to mono "x-1-1" processing mode. Or, a channel processing unit of the at least two processing units 121, 122 may, for example, be configured to implement a mono to stereo "x-1-2" processing mode. Or, a channel processing unit of the at least two processing units 121, 122 may, for example, be configured to implement a stereo to mono "x-2-1" processing mode. Or, a channel processing unit of the at least two processing units 121, 122 may, for example, be a stereo-to-stereo processing unit which implements a stereo to stereo "x-2-2" processing mode.
- the mono to mono "x-1-1" processing mode, the mono to stereo “x-1-2” processing mode, the stereo to mono “x-2-1” processing mode and the stereo to stereo “x-2-2” processing mode are described in the SAOC Standard (see [SAOC]), as decoding modes of the SAOC standard.
- each of the at least two channel processing units 121, 122 may either be a mono processing unit or a stereo processing unit, wherein said mono processing unit is configured to receive exactly one of the three or more downmix channels and is configured to generate exactly one or exactly two of the at least two processed channels depending on said exactly one of the three or more downmix channels and depending on the side information, and wherein said stereo processing unit is configured to receive exactly two of the three or more downmix channels and is configured to generate exactly one or exactly two of the at least two processed channels depending on said exactly two of the three or more downmix channels and depending on the side information.
- At least one of the at least two channel processing units 121, 122 may be configured to receive exactly one of the three or more downmix channels and being configured to generate exactly two of the at least two processed channels depending on said exactly one of the three or more downmix channels and depending on the side information.
- At least one of the at least two channel processing units 121, 122 may be configured to receive exactly two of the three or more downmix channels and is configured to generate exactly one of the at least two processed channels depending on said exactly two of the three or more downmix channels and depending on the side information.
- a channel processing unit of the at least two processing units 121, 122 may, for example, implement a mono downmix ("x-1-5") processing mode for generating five processed channels from a mono downmix channel.
- a channel processing unit of the at least two processing units 121, 122 may, for example, implement a stereo downmix ("x-2-5") processing mode for generating five processed channels from a two downmix channels.
- the mono downmix (“x-1-5”) processing mode and the stereo downmix (“x-2-5”) processing mode are described in the SAOC Standard (see [SAOC]), as transcoding modes of the SAOC standard.
- one, some or all of the channel processing units 121, 122 may be configured differently.
- the input channel router 110 may be configured to receive four or more downmix channels, and at least one of the at least two channel processing units 121, 122 may be configured to receive at least three of the four or more downmix channels and may be configured to generate at least three of the processed channels depending on said at least three of the four or more downmix channels and depending on the side information.
- At least one of the at least two channel processing units 121, 122 may be configured to receive exactly three of the four or more downmix channels and may be configured to generate exactly three of the processed channels depending on said exactly three of the four or more downmix channels and depending on the side information.
- the input channel router 110 may be configured to receive six or more downmix channels, and wherein at least one of the at least two channel processing units 121, 122 may be configured to receive exactly five of the six or more downmix channels and is configured to generate exactly five of the processed channels depending on said exactly five of the six or more downmix channels and depending on the side information.
- the input channel router may be configured to feed each of the at least two of the three or more downmix channels into exactly one of the at least two channel processing units 121, 122.
- the downmix channels DMX1, DMX2, DMX3 is fed into two or more of the channel processing units 121, 122, as, e.g. in the example of Fig. 1 .
- one or more of the downmix channels may be fed into more than one channel processing unit.
- the input channel router 110 may be configured to feed each of the three or more downmix channels into at least one of the at least two channel processing units 121, 122, so that each of the three or more downmix channels is received by one or more of the at least two channel processed units 121, 122.
- the input channel router 110 is configured to not feed at least one of the three or more downmix channels into any of the at least two channel processing units 121, 122, so that said at least one of the three or more downmix channels is not received by any of the at least two channel processed units.
- each of the at least two channel processing units 121, 122 may be configured to generate said one or more of the at least two processed channels independent from at least one of the three or more downmix channels.
- none of the channel processing unit receives all of the downmix channels DMX1, DMX2, DMX3, as illustrated by Fig. 1 .
- the multichannel downmix processing functionality can be realized by the (cascaded or/and parallel) application of multiple SAOC decoders/transcoder instances (or their parts).
- Fig. 3 depicts a schematic illustration showing the principle of combining multiple SAOC mono and stereo decoders/transcoder instances in parallel to parametrically decode a multi-channel signal mixture according to an embodiment.
- the channel processing units 121, 122, 123, 124, 125, 126 of Fig. 3 are configured to generate the at least two processed channels in parallel.
- the channel processing units 121, 122, 123, 124, 125, 126 may be configured to generate the at least two processed channels in parallel so that each of the at least two channel processing units starts generating one of the at least two processed channels, before any other channel processing unit of the at least two channel processing units finishes generating another one of the at least two processed channels.
- the input channel router 110 of Fig. 3 routes the input channels to the several decoders/transcoders. It should be noted that the decoders/transcoders can be driven with any arbitrary number of input channels and are not restricted to mono or stereo signals only, as depicted in Fig. 3 for visual clarity.
- the decoder further comprises an output channel router 130 for combining the at least two processed channels to obtain the one or more audio output channels.
- the (processed) signals processed from the decoders/transcoders units are fed into the output channel router 130.
- the output channel router 130 combines the several input streams and yields a final estimation of the audio object signals to the renderer 140.
- the decoder further comprises a renderer 140.
- the renderer 140 is configured to receive rendering information, wherein the renderer is configured to generate the one or more audio output channels depending on the at least two processed channels and depending on the rendering information.
- Fig. 4 depicts a schematic diagram illustrating the principle of a cascaded SAOC mono and stereo decoders/transcoder structure to process a multi-channel signal mixture according to an embodiment.
- a first channel processing unit 121 of the at least two channel processing units may be configured to feed a first processed channel PCH11 of the at least two processed channels into a second channel processing unit 126 of the at least two channel processing units.
- Said second processing unit 126 may be configured to generate a second processed channel PCH22 of the at least two processed channels depending on the first processed channel PCH11.
- decoders/transcoders can be static and given a priori, but also be adapted dynamically.
- This approach represents a fully SAOC backward compatible extension method of handling multichannel downmix systems.
- the presented inventive embodiments can be applied on an arbitrary number of downmix / upmix channels. It can be combined with any current and also future audio formats.
- the flexibility of the inventive method allows bypassing of unaltered channels to reduce computational complexity, reduce bitstream payload / reduced data amount.
- Some examples relate to an audio encoder, method or computer program for encoding. Moreover, some embodiments relate to an audio decoder, method or computer program for decoding as described above. Furthermore, some examples relate to an encoded signal.
- aspects have been described in the context of an apparatus, it is clear that these aspects also represent a description of the corresponding method, where a block or device corresponds to a method step or a feature of a method step. Analogously, aspects described in the context of a method step also represent a description of a corresponding block or item or feature of a corresponding apparatus.
- the decomposed signal can be stored on a digital storage medium or can be transmitted on a transmission medium such as a wireless transmission medium or a wired transmission medium such as the Internet.
- embodiments of the invention can be implemented in hardware or in software.
- the implementation can be performed using a digital storage medium, for example a floppy disk, a DVD, a CD, a ROM, a PROM, an EPROM, an EEPROM or a FLASH memory, having electronically readable control signals stored thereon, which cooperate (or are capable of cooperating) with a programmable computer system such that the respective method is performed.
- a digital storage medium for example a floppy disk, a DVD, a CD, a ROM, a PROM, an EPROM, an EEPROM or a FLASH memory, having electronically readable control signals stored thereon, which cooperate (or are capable of cooperating) with a programmable computer system such that the respective method is performed.
- Some embodiments according to the invention comprise a non-transitory data carrier having electronically readable control signals, which are capable of cooperating with a programmable computer system, such that one of the methods described herein is performed.
- embodiments of the present invention can be implemented as a computer program product with a program code, the program code being operative for performing one of the methods when the computer program product runs on a computer.
- the program code may for example be stored on a machine readable carrier.
- inventions comprise the computer program for performing one of the methods described herein, stored on a machine readable carrier.
- an embodiment of the inventive method is, therefore, a computer program having a program code for performing one of the methods described herein, when the computer program runs on a computer.
- a further embodiment of the inventive methods is, therefore, a data carrier (or a digital storage medium, or a computer-readable medium) comprising, recorded thereon, the computer program for performing one of the methods described herein.
- a further embodiment of the inventive method is, therefore, a data stream or a sequence of signals representing the computer program for performing one of the methods described herein.
- the data stream or the sequence of signals may for example be configured to be transferred via a data communication connection, for example via the Internet.
- a further embodiment comprises a processing means, for example a computer, or a programmable logic device, configured to or adapted to perform one of the methods described herein.
- a processing means for example a computer, or a programmable logic device, configured to or adapted to perform one of the methods described herein.
- a further embodiment comprises a computer having installed thereon the computer program for performing one of the methods described herein.
- a programmable logic device for example a field programmable gate array
- a field programmable gate array may cooperate with a microprocessor in order to perform one of the methods described herein.
- the methods are preferably performed by any hardware apparatus.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Stereophonic System (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201261679412P | 2012-08-03 | 2012-08-03 | |
PCT/EP2013/066374 WO2014020181A1 (en) | 2012-08-03 | 2013-08-05 | Decoder and method for multi-instance spatial-audio-object-coding employing a parametric concept for multichannel downmix/upmix cases |
Publications (2)
Publication Number | Publication Date |
---|---|
EP2880653A1 EP2880653A1 (en) | 2015-06-10 |
EP2880653B1 true EP2880653B1 (en) | 2017-11-01 |
Family
ID=48916076
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP13745103.5A Active EP2880653B1 (en) | 2012-08-03 | 2013-08-05 | Decoder and method for multi-instance spatial-audio-object-coding employing a parametric concept for multichannel downmix/upmix cases |
Country Status (12)
Country | Link |
---|---|
US (1) | US10176812B2 (ko) |
EP (1) | EP2880653B1 (ko) |
JP (1) | JP6141978B2 (ko) |
KR (1) | KR101660004B1 (ko) |
CN (1) | CN104756186B (ko) |
AU (1) | AU2013298462B2 (ko) |
BR (1) | BR112015002367B1 (ko) |
CA (1) | CA2880891C (ko) |
ES (1) | ES2654792T3 (ko) |
MX (1) | MX351687B (ko) |
RU (1) | RU2604337C2 (ko) |
WO (1) | WO2014020181A1 (ko) |
Families Citing this family (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
BR112015002367B1 (pt) * | 2012-08-03 | 2021-12-14 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung Ev | Decodificador e método para codificação de objeto de áudio espacial multi-instância empregando um conceito paramétrico para caixas multicanal de downmix/upmix |
KR102343453B1 (ko) * | 2014-03-28 | 2021-12-27 | 삼성전자주식회사 | 음향 신호의 렌더링 방법, 장치 및 컴퓨터 판독 가능한 기록 매체 |
CN111556426B (zh) | 2015-02-06 | 2022-03-25 | 杜比实验室特许公司 | 用于自适应音频的混合型基于优先度的渲染系统和方法 |
US9854375B2 (en) * | 2015-12-01 | 2017-12-26 | Qualcomm Incorporated | Selection of coded next generation audio data for transport |
EP4123644B1 (en) | 2018-04-11 | 2024-08-21 | Dolby International AB | 6dof audio decoding and/or rendering |
CN110808054B (zh) * | 2019-11-04 | 2022-05-06 | 思必驰科技股份有限公司 | 多路音频的压缩与解压缩方法及系统 |
GB202002900D0 (en) * | 2020-02-28 | 2020-04-15 | Nokia Technologies Oy | Audio repersentation and associated rendering |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20080205657A1 (en) * | 2006-12-07 | 2008-08-28 | Lg Electronics, Inc. | Method and an Apparatus for Decoding an Audio Signal |
US20110002469A1 (en) * | 2008-03-03 | 2011-01-06 | Nokia Corporation | Apparatus for Capturing and Rendering a Plurality of Audio Channels |
US20110196685A1 (en) * | 2006-09-29 | 2011-08-11 | Lg Electronics Inc. | Methods and apparatuses for encoding and decoding object-based audio signals |
Family Cites Families (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DK1173925T3 (da) * | 1999-04-07 | 2004-03-29 | Dolby Lab Licensing Corp | Matriksforbedringer til tabsfri kodning og dekodning |
DE102004043521A1 (de) * | 2004-09-08 | 2006-03-23 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Vorrichtung und Verfahren zum Erzeugen eines Multikanalsignals oder eines Parameterdatensatzes |
KR100888474B1 (ko) * | 2005-11-21 | 2009-03-12 | 삼성전자주식회사 | 멀티채널 오디오 신호의 부호화/복호화 장치 및 방법 |
CN101371298A (zh) * | 2006-01-19 | 2009-02-18 | Lg电子株式会社 | 用于解码信号的方法和装置 |
SG175632A1 (en) * | 2006-10-16 | 2011-11-28 | Dolby Sweden Ab | Enhanced coding and parameter representation of multichannel downmixed object coding |
RU2417549C2 (ru) * | 2006-12-07 | 2011-04-27 | ЭлДжи ЭЛЕКТРОНИКС ИНК. | Способ и устройство для обработки аудиосигнала |
CN101542597B (zh) * | 2007-02-14 | 2013-02-27 | Lg电子株式会社 | 用于编码和解码基于对象的音频信号的方法和装置 |
JP5133401B2 (ja) * | 2007-04-26 | 2013-01-30 | ドルビー・インターナショナル・アクチボラゲット | 出力信号の合成装置及び合成方法 |
WO2009066959A1 (en) * | 2007-11-21 | 2009-05-28 | Lg Electronics Inc. | A method and an apparatus for processing a signal |
US8060042B2 (en) * | 2008-05-23 | 2011-11-15 | Lg Electronics Inc. | Method and an apparatus for processing an audio signal |
WO2010090019A1 (ja) * | 2009-02-04 | 2010-08-12 | パナソニック株式会社 | 結合装置、遠隔通信システム及び結合方法 |
US8112168B2 (en) | 2009-07-29 | 2012-02-07 | Texas Instruments Incorporated | Process and method for a decoupled multi-parameter run-to-run controller |
KR101615262B1 (ko) * | 2009-08-12 | 2016-04-26 | 삼성전자주식회사 | 시멘틱 정보를 이용한 멀티 채널 오디오 인코딩 및 디코딩 방법 및 장치 |
KR101613975B1 (ko) * | 2009-08-18 | 2016-05-02 | 삼성전자주식회사 | 멀티 채널 오디오 신호의 부호화 방법 및 장치, 그 복호화 방법 및 장치 |
WO2012040897A1 (en) * | 2010-09-28 | 2012-04-05 | Huawei Technologies Co., Ltd. | Device and method for postprocessing decoded multi-channel audio signal or decoded stereo signal |
KR101227932B1 (ko) * | 2011-01-14 | 2013-01-30 | 전자부품연구원 | 다채널 멀티트랙 오디오 시스템 및 오디오 처리 방법 |
EP2477188A1 (en) * | 2011-01-18 | 2012-07-18 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Encoding and decoding of slot positions of events in an audio signal frame |
JP2015509212A (ja) * | 2012-01-19 | 2015-03-26 | コーニンクレッカ フィリップス エヌ ヴェ | 空間オーディオ・レンダリング及び符号化 |
CN104541524B (zh) * | 2012-07-31 | 2017-03-08 | 英迪股份有限公司 | 一种用于处理音频信号的方法和设备 |
BR112015002367B1 (pt) * | 2012-08-03 | 2021-12-14 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung Ev | Decodificador e método para codificação de objeto de áudio espacial multi-instância empregando um conceito paramétrico para caixas multicanal de downmix/upmix |
AR090703A1 (es) * | 2012-08-10 | 2014-12-03 | Fraunhofer Ges Forschung | Codificador, decodificador, sistema y metodo que emplean un concepto residual para codificar objetos de audio parametricos |
EP2830046A1 (en) * | 2013-07-22 | 2015-01-28 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for decoding an encoded audio signal to obtain modified output signals |
-
2013
- 2013-08-05 BR BR112015002367-3A patent/BR112015002367B1/pt active IP Right Grant
- 2013-08-05 WO PCT/EP2013/066374 patent/WO2014020181A1/en active Application Filing
- 2013-08-05 CN CN201380051500.1A patent/CN104756186B/zh active Active
- 2013-08-05 JP JP2015524811A patent/JP6141978B2/ja active Active
- 2013-08-05 MX MX2015001514A patent/MX351687B/es active IP Right Grant
- 2013-08-05 CA CA2880891A patent/CA2880891C/en active Active
- 2013-08-05 EP EP13745103.5A patent/EP2880653B1/en active Active
- 2013-08-05 KR KR1020157005399A patent/KR101660004B1/ko active IP Right Grant
- 2013-08-05 RU RU2015107245/08A patent/RU2604337C2/ru active
- 2013-08-05 AU AU2013298462A patent/AU2013298462B2/en active Active
- 2013-08-05 ES ES13745103.5T patent/ES2654792T3/es active Active
-
2015
- 2015-01-30 US US14/610,396 patent/US10176812B2/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20110196685A1 (en) * | 2006-09-29 | 2011-08-11 | Lg Electronics Inc. | Methods and apparatuses for encoding and decoding object-based audio signals |
US20080205657A1 (en) * | 2006-12-07 | 2008-08-28 | Lg Electronics, Inc. | Method and an Apparatus for Decoding an Audio Signal |
US20110002469A1 (en) * | 2008-03-03 | 2011-01-06 | Nokia Corporation | Apparatus for Capturing and Rendering a Plurality of Audio Channels |
Non-Patent Citations (1)
Title |
---|
ANONYMOUS: "ISO/IEC FDIS 23003-2: 2010, Spatial Audio Object Coding", 91. MPEG MEETING;18-1-2010 - 22-1-2010; KYOTO; (MOTION PICTURE EXPERTGROUP OR ISO/IEC JTC1/SC29/WG11),, no. N11207, 10 May 2010 (2010-05-10), XP030017704, ISSN: 0000-0030 * |
Also Published As
Publication number | Publication date |
---|---|
JP6141978B2 (ja) | 2017-06-07 |
CA2880891C (en) | 2017-10-17 |
WO2014020181A1 (en) | 2014-02-06 |
EP2880653A1 (en) | 2015-06-10 |
KR20150040997A (ko) | 2015-04-15 |
JP2015527611A (ja) | 2015-09-17 |
BR112015002367A2 (pt) | 2018-09-11 |
AU2013298462A1 (en) | 2015-02-19 |
BR112015002367B1 (pt) | 2021-12-14 |
CA2880891A1 (en) | 2014-02-06 |
MX2015001514A (es) | 2015-07-06 |
RU2015107245A (ru) | 2016-09-27 |
CN104756186A (zh) | 2015-07-01 |
ES2654792T3 (es) | 2018-02-15 |
RU2604337C2 (ru) | 2016-12-10 |
US20150149187A1 (en) | 2015-05-28 |
AU2013298462B2 (en) | 2016-10-20 |
CN104756186B (zh) | 2018-01-02 |
MX351687B (es) | 2017-10-25 |
KR101660004B1 (ko) | 2016-09-27 |
US10176812B2 (en) | 2019-01-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2016234987B2 (en) | Decoder and method for a generalized spatial-audio-object-coding parametric concept for multichannel downmix/upmix cases | |
US10176812B2 (en) | Decoder and method for multi-instance spatial-audio-object-coding employing a parametric concept for multichannel downmix/upmix cases | |
US10497375B2 (en) | Apparatus and methods for adapting audio information in spatial audio object coding |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20150202 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR |
|
AX | Request for extension of the european patent |
Extension state: BA ME |
|
DAX | Request for extension of the european patent (deleted) | ||
17Q | First examination report despatched |
Effective date: 20160324 |
|
GRAP | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOSNIGR1 |
|
INTG | Intention to grant announced |
Effective date: 20170515 |
|
GRAS | Grant fee paid |
Free format text: ORIGINAL CODE: EPIDOSNIGR3 |
|
GRAA | (expected) grant |
Free format text: ORIGINAL CODE: 0009210 |
|
AK | Designated contracting states |
Kind code of ref document: B1 Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR |
|
REG | Reference to a national code |
Ref country code: GB Ref legal event code: FG4D |
|
REG | Reference to a national code |
Ref country code: CH Ref legal event code: EP Ref country code: AT Ref legal event code: REF Ref document number: 942764 Country of ref document: AT Kind code of ref document: T Effective date: 20171115 |
|
REG | Reference to a national code |
Ref country code: IE Ref legal event code: FG4D |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R096 Ref document number: 602013028755 Country of ref document: DE |
|
REG | Reference to a national code |
Ref country code: ES Ref legal event code: FG2A Ref document number: 2654792 Country of ref document: ES Kind code of ref document: T3 Effective date: 20180215 |
|
REG | Reference to a national code |
Ref country code: NL Ref legal event code: MP Effective date: 20171101 |
|
REG | Reference to a national code |
Ref country code: LT Ref legal event code: MG4D |
|
REG | Reference to a national code |
Ref country code: AT Ref legal event code: MK05 Ref document number: 942764 Country of ref document: AT Kind code of ref document: T Effective date: 20171101 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: NO Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20180201 Ref country code: NL Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20171101 Ref country code: LT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20171101 Ref country code: FI Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20171101 Ref country code: SE Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20171101 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: GR Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20180202 Ref country code: LV Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20171101 Ref country code: IS Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20180301 Ref country code: AT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20171101 Ref country code: BG Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20180201 Ref country code: RS Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20171101 Ref country code: HR Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20171101 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: CY Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20171101 Ref country code: EE Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20171101 Ref country code: CZ Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20171101 Ref country code: DK Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20171101 Ref country code: SK Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20171101 |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R097 Ref document number: 602013028755 Country of ref document: DE |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 6 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: SM Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20171101 Ref country code: PL Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20171101 Ref country code: RO Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20171101 |
|
PLBE | No opposition filed within time limit |
Free format text: ORIGINAL CODE: 0009261 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT |
|
26N | No opposition filed |
Effective date: 20180802 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: SI Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20171101 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: MC Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20171101 |
|
REG | Reference to a national code |
Ref country code: CH Ref legal event code: PL |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: LU Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20180805 Ref country code: LI Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20180831 Ref country code: CH Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20180831 |
|
REG | Reference to a national code |
Ref country code: BE Ref legal event code: MM Effective date: 20180831 |
|
REG | Reference to a national code |
Ref country code: IE Ref legal event code: MM4A |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: IE Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20180805 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: BE Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20180831 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: MT Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20180805 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: HU Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT; INVALID AB INITIO Effective date: 20130805 Ref country code: PT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20171101 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: MK Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20171101 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: AL Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20171101 |
|
P01 | Opt-out of the competence of the unified patent court (upc) registered |
Effective date: 20230516 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: TR Payment date: 20230731 Year of fee payment: 11 Ref country code: IT Payment date: 20230831 Year of fee payment: 11 Ref country code: ES Payment date: 20230918 Year of fee payment: 11 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: DE Payment date: 20240819 Year of fee payment: 12 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: GB Payment date: 20240822 Year of fee payment: 12 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: FR Payment date: 20240823 Year of fee payment: 12 |