WO2015104166A1 - Method and apparatus for improving the coding of side information required for coding a higher order ambisonics representation of a sound field - Google Patents

Method and apparatus for improving the coding of side information required for coding a higher order ambisonics representation of a sound field Download PDF

Info

Publication number
WO2015104166A1
WO2015104166A1 PCT/EP2014/078641 EP2014078641W WO2015104166A1 WO 2015104166 A1 WO2015104166 A1 WO 2015104166A1 EP 2014078641 W EP2014078641 W EP 2014078641W WO 2015104166 A1 WO2015104166 A1 WO 2015104166A1
Authority
WO
WIPO (PCT)
Prior art keywords
prediction
array
side information
data
indices
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/EP2014/078641
Other languages
English (en)
French (fr)
Inventor
Alexander Krueger
Sven Kordon
Oliver Wuebbolt
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Thomson Licensing SAS
Original Assignee
Thomson Licensing SAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority to KR1020227019915A priority Critical patent/KR102686291B1/ko
Priority to CN202410341175.2A priority patent/CN118248156A/zh
Priority to EP19208682.5A priority patent/EP3648102B1/en
Priority to EP25158678.0A priority patent/EP4554255A3/en
Priority to US15/110,354 priority patent/US9990934B2/en
Priority to CN202410171734.XA priority patent/CN118016077A/zh
Priority to KR1020217040165A priority patent/KR102409796B1/ko
Priority to KR1020167021560A priority patent/KR102338374B1/ko
Priority to CN201480072725.XA priority patent/CN105981100B/zh
Priority to EP14815731.6A priority patent/EP3092641B1/en
Priority to CN202010020047.XA priority patent/CN111028849B/zh
Priority to KR1020247023646A priority patent/KR20240116835A/ko
Priority to CN202010025266.7A priority patent/CN111179951B/zh
Priority to JP2016544628A priority patent/JP6530412B2/ja
Priority to CN202010019977.3A priority patent/CN111179955B/zh
Priority to EP22176389.9A priority patent/EP4089675B1/en
Application filed by Thomson Licensing SAS filed Critical Thomson Licensing SAS
Publication of WO2015104166A1 publication Critical patent/WO2015104166A1/en
Anticipated expiration legal-status Critical
Priority to US15/956,295 priority patent/US10147437B2/en
Priority to US16/189,797 priority patent/US10424312B2/en
Priority to US16/532,302 priority patent/US10553233B2/en
Priority to US16/719,806 priority patent/US10714112B2/en
Priority to US16/925,334 priority patent/US11211078B2/en
Priority to US17/558,550 priority patent/US11488614B2/en
Priority to US17/970,118 priority patent/US11869523B2/en
Priority to US18/390,546 priority patent/US12277948B2/en
Priority to US19/176,090 priority patent/US20250329338A1/en
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/20Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • H04S3/008Systems employing more than two channels, e.g. quadraphonic in which the audio signals are in digital form, i.e. employing more than two discrete digital channels
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/11Application of ambisonics in stereophonic audio systems

Definitions

  • the invention relates to a method and to an apparatus for improving the coding of side information required for coding a Higher Order Ambisonics representation of a sound field.
  • HOA Higher Order Ambisonics
  • WFS wave field synthesis
  • channel based approaches like the 22.2 multichannel audio format.
  • the HOA representation offers the advantage of being independent of a specific loudspeaker set-up. This flexibility, however, is at the expense of a decoding pro ⁇ cess which is required for the playback of the HOA representation on a particular loudspeaker set-up.
  • HOA signals may also be rendered to set- ups consisting of only few loudspeakers.
  • a further advantage of HOA is that the same representation can also be employed without any modification for binaural rendering to headphones .
  • HOA is based on the representation of the spatial density of complex harmonic plane wave amplitudes by a truncated Spher ⁇ ical Harmonics (SH) expansion.
  • SH Spher ⁇ ical Harmonics
  • the spatial resolution of the HOA representation improves with a growing maximum order N of the expansion.
  • the total bit rate for the transmission of HOA representation given a desired single- channel sampling rate f $ and the number of bits per sam- pie, is determined by 0 ⁇ f s ⁇ .
  • HOA sound field representations are proposed in WO 2013/171083 Al, EP 13305558.2 and PCT/EP2013/075559. These processings have in common that they perform a sound field analysis and decompose the given HOA representation into a directional component and a residual ambient compo ⁇ nent.
  • the final compressed representation is as ⁇ sumed to consist of a number of quantised signals, resulting from the perceptual coding of the directional signals and relevant coefficient sequences of the ambient HOA component.
  • a problem to be solved by the invention is to provide a more efficient way of coding side information related to that spatial prediction.
  • the inventive method is suited for improving the coding of side information required for coding a Higher Order Ambisonics representation of a sound field, denoted HOA, with input time frames of HOA coefficient sequences, wherein dominant directional signals as well as a residual ambient HOA component are determined and a prediction is used for said dominant directional signals, thereby provid ⁇ ing, for a coded frame of HOA coefficients, side information data describing said prediction, and wherein said side information data can include:
  • said method including the step:
  • the inventive apparatus is suited for improving the coding of side information required for coding a Higher Order Ambisonics representation of a sound field, denoted HOA, with input time frames of HOA coefficient sequences, wherein dominant directional signals as well as a residual ambient HOA component are determined and a prediction is used for said dominant directional signals, thereby provid- ing, for a coded frame of HOA coefficients, side information data describing said prediction, and wherein said side information data can include:
  • said apparatus including means which:
  • Fig. 1 Exemplary coding of side information related to spa- tial prediction in the HOA compression processing described in EP 13305558.2;
  • FIG. 2 Exemplary decoding of side information related to spatial prediction in the HOA decompression processing described in patent application EP 13305558.2; Fig. 3 HOA decomposition as described in patent application
  • Fig. 4 Illustration of directions (depicted as crosses) of general plane waves representing the residual signal and the directions (depicted as circles) of dominant sound sources.
  • the directions are presented in a three-dimensional coordinate system as sampling po ⁇ sitions on the unit sphere;
  • Fig. 6 Inventive coding of spatial prediction side information
  • Fig. 7 Inventive decoding of coded spatial prediction side information
  • Fig. 1 it is illustrated how the coding of side information related to spatial prediction can be embedded into the HOA compression processing described patent application EP 13305558.2.
  • a frame-wise processing with non-overlapping input frames C(/c) of HOA coeffi ⁇ cient sequences of length L is assumed, where k denotes the frame index.
  • the first step or stage 11/12 in Fig. 1 is op ⁇ tional and consists of concatenating the non-overlapping k- th and ( k— 1) -th frames of HOA coefficient sequences C(/c) in ⁇ to a long frame C(/c) as
  • C(fc): [C(fc-l) C ⁇ k) ⁇ , (1) which long frame is 50% overlapped with an adjacent long frame and which long frame is successively used for the es ⁇ timation of dominant sound source directions. Similar to the notation for C(/c), the tilde symbol is used in the following description for indicating that the respective quantity re ⁇ fers to long overlapping frames. If step/stage 11/12 is not present, the tilde symbol has no specific meaning.
  • a parameter in bold means a set of values, e.g. a matrix or a vector.
  • the long frame C(/c) is successively used in step or stage 13 for the estimation of dominant sound source directions as described in EP 13305558.2.
  • This estimation provides a data set JDIR , ACT(k) ⁇ ⁇ 1, ... , D ⁇ of indices of the related directional signals that have been detected, as well as a data set
  • step or stage 14 the current (long) frame C(/c) of HOA co- efficient sequences is decomposed (as proposed in EP 13305156.5) into a number of directional signals X mR (k— 2) belonging to the directions contained in the set Qa,Acr(k , and a residual ambient HOA component C AMB (k— 2).
  • the delay of two frames is introduced as a result of overlap-add processing in order to obtain smooth signals. It is assumed that X mR (k— 2) is con ⁇ taining a total of D channels, of which however only those corresponding to the active directional signals are non ⁇ zero. The indices specifying these channels are assumed to be output in the data set mRACT (k— 2).
  • the de- composition in step/stage 14 provides some parameters (/c— 2) which can be used at decompression side for predicting portions of the original HOA representation from the directional signals (see EP 13305156.5 for more details) .
  • the HOA decomposition is described in more detail in the below section HOA decomposition .
  • step or stage 15 the number of coefficients of the ambi ⁇ ent HOA component C AMB (k— 2) is reduced to contain only
  • N mRACT (k— 2) indicates the cardinality of the data set mRACT (k— 2), i.e. the number of active di ⁇ rectional signals in frame k— 2. Since the ambient HOA com ⁇ ponent is assumed to be always represented by a minimum num ⁇ ber ORED of HOA coefficient sequences, this problem can be actually reduced to the selection of the remaining D— N mRACT (k— 2) HOA coefficient sequences out of the possible 0— 0 REO ones. In order to obtain a smooth reduced ambient HOA representa ⁇ tion, this choice is accomplished such that, compared to the choice taken at the previous frame k— 3, as few changes as possible will occur.
  • step/stage 16 The final ambient HOA representation with the reduced number of 0 RED + N DIRiACT (/c— 2) non-zero coefficient sequences is de- noted by C AMB REO (k— 2) .
  • the indices of the chosen ambient HOA coefficient sequences are output in the data set 2) .
  • step/stage 16 the active directional signals contained in X mR (k— 2) and the HOA coefficient sequences contained in C AMBjRED (/c— 2) are assigned to the frame Y(k— 2) of / channels for individual perceptual encoding as described in EP 13305558.2.
  • Perceptual coding step/stage 17 encodes the / channels of frame Y(k— 2) and outputs an encoded frame Y(k— 2) .
  • the spa ⁇ tial prediction parameters or side information data (/c— 2) resulting from the decomposition of the HOA representation are losslessly coded in step or stage 19 in order to provide a coded data representation ⁇ 2), using the index set
  • Fig. 2 it is exemplary shown how to embed in step or stage 25 the decoding of the received encoded side infor- mation data ⁇ 2) related to spatial prediction into the HOA decompression processing described in Fig. 3 of patent application EP 13305558.2.
  • the decoding of the encoded side information data ⁇ 2) is carried out before entering its decoded version (/c— 2) into the composition of the HOA representation in step or stage 23, using the received index set mRACT (k) delayed by two frames in delay 24.
  • step or stage 21 a perceptual decoding of the / signals contained in Y(k— 2) is performed in order to obtain the / decoded signals in Y(k— 2) .
  • the perceptually decoded signals in Y(k— 2) are re-distributed in order to recreate the frame X mR (k— 2) of directional signals and the frame C AMB RED (k— 2) of the ambient HOA component.
  • the infor ⁇ mation about how to re-distribute the signals is obtained by reproducing the assigning operation performed for the HOA compression, using the index data sets mR ACT (k) an d ⁇ AMB.ACT C ⁇ — 2) .
  • composition step or stage 23 a current frame C(k— 3) of the desired total HOA representation is re-composed (accord ⁇ ing to the processing described in connection with Fig. 2b and Fig.
  • C AMBJRED (/c— 2) corresponds to component D A (k— 2) in PCT/EP2013/ 075559
  • Gn,A j (k A N CL ⁇ DIRACT C ⁇ ) correspond to A ⁇ (k) in PCT/ EP2013/075559, wherein active directional signal indices can be obtained by taking those indices of rows of A ⁇ k which contain valid elements.
  • directional signals with re ⁇ spect to uniformly distributed directions are predicted from the directional signals X mR (k— 2) using the received parame ⁇ ters (/c— 2) for such prediction, and thereafter the current decompressed frame C(k— 3) is re-composed from the frame of directional signals X mR (k— 2) , from mR ACT (k) and OO r and from the predicted portions and the reduced ambient HOA compo- nent CAMB.RED (fc - 2) .
  • the smoothed dominant directional signals X mR (k— 1) and their HOA representation C DIR (/c— 1) are computed in step or stage 31, using the long frame C(/c) of the input HOA rep- resentation, the set of directions and the set 3 ⁇ 4IR,ACT of corresponding indices of directional signals. It is as ⁇ sumed that X mR (k— 1) contains a total of D channels, of which however only those corresponding to the active directional signals are non-zero. The indices specifying these channels are assumed to be output in the set mR ACT (k— 1) .
  • step or stage 33 the residual between the original HOA representation C(k— 1) and the HOA representation C DIR (/c— 1) of the dominant directional signals is represented by a num ⁇ ber of 0 directional signals -X RES (/C— 1), which can be consid- ered as being general plane waves from uniformly distributed directions, which are referred to a uniform grid.
  • step or stage 34 these directional signals are predicted from the dominant directional signals X mR (k— 1) in order to provide the predicted signals -X RES (/C— 1) together with the respective prediction parameters (/c— 1) .
  • the dominant directional signals Xum ,d (k— 1) with indices d which are contained in the set ⁇ DIRACT C ⁇ — Or are consid ⁇ ered. The prediction is described in more detail in the be ⁇ low section Spatial prediction.
  • step or stage 35 the smoothed HOA representation C RES (/c— 2) of the predicted directional signals X RE$ (k— 1) is computed.
  • step or stage 37 the residual C AMB (/c— 2) between the orig- inal HOA representation C(k— 2) and the HOA representation C DIR (/c— 2) of the dominant directional signals together with the HOA representation C RES (/c— 2) of the predicted directional signals from uniformly distributed directions is computed and is output.
  • the required signal delays in the Fig. 3 processing are per ⁇ formed by corresponding delays 381 to 387.
  • the goal of the spatial prediction is to predict the 0 re ⁇ sidual signals
  • X mR (k - 1): [X OlR (k - 3) X OlR (k - 2) X OlR (k - 1)] (3)
  • Dp ED t 9 1, denote the indices from which di ⁇ rectional signals the prediction for the direction q has to be performed. If no prediction is to be performed for a direction q , the corresponding column of the matrix ⁇ IND C ⁇ - 1) consists of zeros. Further, if less than D PRED directional signals are used for the prediction for a di ⁇ rection q , the non-required elements in the 9-th column of P IND (/c— 1) are also zero.
  • P TYPE O -1) [1 0 0 0 0 0 2 0 0 0 0 0 0 0 0] , (7) r j , _ ⁇ ⁇ - ⁇ 1 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 01
  • the general plane wave sig ⁇ nal 3 ⁇ 4ES,GRID,I — 1) from direction ⁇ is predicted from the di- rectional signal 3 ⁇ 4IR,I( ⁇ — ⁇ from direction -QACT.I by a pure multiplication (i.e. full band) with a factor that results from de-quantising the value 40.
  • the general plane wave signal 3 ⁇ 4ES,GRID,7 — 1) from direction ⁇ 7 is predicted from the directional signals x DIR1 (/c— 1) and 3 ⁇ 4iR,4(k — 1) by a lowpass filtering and multiplication with factors that result from de-quantising the values 15 and —13.
  • B sc denotes a predefined number of bits to be used for the quantisation of the prediction factors.
  • PF ,d, q.k— 1) is assumed to be set to zero, if iND,d,q(k — 1) is equal to zero.
  • a bit array ActivePred consisting of 0 bits is creat- ed, in which the bit ActivePred [q>] indicates whether or not for the direction q a prediction is performed.
  • the number of 'ones' in this array is denoted by NumActivePred .
  • the bit array PredType of length NumActivePred is creat- ed where each bit indicates, for the directions where a pre ⁇ diction is to be performed, the kind of the prediction, i.e. full band or low pass.
  • the unsigned inte ⁇ ger array PredDirSiglds of length NumActivePred ⁇ D PRED is created, whose elements denote for each active prediction the
  • OpRED indices of the directional signals to be used If less than DpRED directional signals are to be used for the predic ⁇ tion, the indices are assumed to be set to zero.
  • Each ele ⁇ ment of the array PredDirSiglds is assumed to be represented by [log 2 (D + 1)1 bits. The number of non-zero elements in the array PredDirSiglds is denoted by NumNonZerolds .
  • the integer array QuantPredGains of length NumNonZerolds is created, whose elements are assumed to represent the quantised scaling factors f Q ,F ,d,q (k — 1) to be used in equation (17) .
  • the dequantisation to obtain the corresponding dequan- tised scaling factors P Fdq (k— l) is given in equation (10).
  • Each element of the array QuantPredGains is assumed to be represented by B sc bits.
  • the coded repre ⁇ sentation of equations (7) to (9) is used:
  • QuantPredGains [40 15 -13] . (23)
  • the state- of-the-art processing is advantageously modified.
  • PSPredictionActive is zero (or '1' as an alternative), the array ActivePred and further data related to the prediction are not to be included into the coded side information ⁇ COD ⁇ I R practise, this operation reduces over time the average bit rate for the transmission of ⁇ COD ⁇
  • NumActivePred of active prediction is often very low. In such situation, instead of using the bit array ActivePred for indicating for each direction q whether or not the prediction is performed, it can be more efficient to transmit or transfer instead the number of active predic ⁇ tions and the respective indices. In particular, this modified kind of coding the activity is more efficient in case that NumActivePred ⁇ M M , (24) where M M is the greatest integer number that satisfies
  • Equation (25) [log 2 (M M )l denotes the number of bits re ⁇ quired for coding the actual number NumActivePred of active predictions, and M M ⁇ [log 2 (0)l is the number of bits re ⁇ quired for coding the respective direction indices.
  • the right hand side of equation (25) corresponds to the num ⁇ ber of bits of the array ActivePred , which would be re ⁇ quired for coding the same information in the known way.
  • a single bit KindOfCodedPredlds can be used for indicating in which way the indices of those directions, where a prediction is supposed to be performed, are coded. If the bit
  • KindOfCodedPredlds has the value '1' (or '0' in the alterna- tive) , the number NumActivePred and the array Predlds containing the indices of directions, where a prediction is supposed to be performed, are added to the coded side in ⁇ formation ⁇ COD ⁇ Otherwise, if the bit KindOfCodedPredlds has the value '0' (or '1' in the alternative), the array
  • ActivePred is used to code the same information.
  • bits can be used for coding each element of the index ar ⁇ ray PredDirSiglds , which kind of coding is more efficient.
  • the data set 3 ⁇ 4IR,ACT is assumed to be known, and thus the decoder also knows how many bits have to be read for decoding an index of a directional signal. Note that the frame indices of ⁇ COD to be computed and the used index data set 3 ⁇ 4IR,ACT have to be identical.
  • PredGains which however contains quantised values.
  • this representation coded according to the invention requires 8 bits less.
  • the decoding of the modified side information related to spatial prediction is summarised in the example decoding processing depicted in Fig. 7 and Fig. 8 (the processing depicted in Fig. 8 is the continuation of the processing depicted in Fig. 7 ) and is explained in the following.
  • NumActivePred elements is read, where each element is assumed to be coded by [log 2 (0)l bits.
  • the elements of this array are the indices of directions, where a prediction has to be per ⁇ formed.
  • the bit array PredType of length is read, where each element is assumed to be coded by [log 2 (0)l bits.
  • the elements of this array are the indices of directions, where a prediction has to be per ⁇ formed.
  • NumActivePred is read, of which the elements indicate the kind of prediction to be performed for each one of the relevant directions.
  • the elements of the vector PTYPE are computed.
  • the array PredDirSiglds is read, which con ⁇ sists of NumActivePred ⁇ D PRED elements. Each element is assumed to be coded by log 2 (OACT)l bits. Using the information con ⁇ tained in PTYPE 3 ⁇ 4IR , ACT an d PredDirSiglds , the elements of ma ⁇ trix PJ D are set and the number NumNonZeroIds of non-zero el- ements in P IND is computed.
  • the array QuantPredGains is read, which consists of NumNonZeroIds elements, each coded by B SC bits. Using the information contained in and QuantPredGains , the elements of the matrix P QF are set.
  • inventive processing can be carried out by a single pro ⁇ cessor or electronic circuit, or by several processors or electronic circuits operating in parallel and/or operating on different parts of the inventive processing.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Mathematical Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Stereophonic System (AREA)
  • Circuit For Audible Band Transducer (AREA)
PCT/EP2014/078641 2014-01-08 2014-12-19 Method and apparatus for improving the coding of side information required for coding a higher order ambisonics representation of a sound field Ceased WO2015104166A1 (en)

Priority Applications (25)

Application Number Priority Date Filing Date Title
CN202010019977.3A CN111179955B (zh) 2014-01-08 2014-12-19 包括编码hoa表示的位流的解码方法和装置、以及介质
EP19208682.5A EP3648102B1 (en) 2014-01-08 2014-12-19 Method and apparatus for improving the coding of side information required for coding a higher order ambisonics representation of a sound field
EP25158678.0A EP4554255A3 (en) 2014-01-08 2014-12-19 Method and apparatus for improving the coding of side information required for coding a higher order ambisonics representation of a sound field
US15/110,354 US9990934B2 (en) 2014-01-08 2014-12-19 Method and apparatus for improving the coding of side information required for coding a Higher Order Ambisonics representation of a sound field
CN202410171734.XA CN118016077A (zh) 2014-01-08 2014-12-19 包括编码hoa表示的位流的解码方法和装置、以及介质
KR1020217040165A KR102409796B1 (ko) 2014-01-08 2014-12-19 사운드 필드의 고차 앰비소닉스 표현을 코딩하기 위해 요구되는 사이드 정보의 코딩을 개선하기 위한 방법 및 장치
KR1020167021560A KR102338374B1 (ko) 2014-01-08 2014-12-19 사운드 필드의 고차 앰비소닉스 표현을 코딩하기 위해 요구되는 사이드 정보의 코딩을 개선하기 위한 방법 및 장치
CN201480072725.XA CN105981100B (zh) 2014-01-08 2014-12-19 用于改善对声场的高阶高保真度立体声响复制表示进行编码所需的边信息的编码的方法和装置
EP14815731.6A EP3092641B1 (en) 2014-01-08 2014-12-19 Method and apparatus for improving the coding of side information required for coding a higher order ambisonics representation of a sound field
CN202010020047.XA CN111028849B (zh) 2014-01-08 2014-12-19 包括编码hoa表示的位流的解码方法和装置、以及介质
CN202410341175.2A CN118248156A (zh) 2014-01-08 2014-12-19 包括编码hoa表示的位流的解码方法和装置、以及介质
CN202010025266.7A CN111179951B (zh) 2014-01-08 2014-12-19 包括编码hoa表示的位流的解码方法和装置、以及介质
JP2016544628A JP6530412B2 (ja) 2014-01-08 2014-12-19 音場の高次アンビソニックス表現を符号化するために必要とされるサイド情報の符号化を改善する方法および装置
KR1020227019915A KR102686291B1 (ko) 2014-01-08 2014-12-19 사운드 필드의 고차 앰비소닉스 표현을 코딩하기 위해 요구되는 사이드 정보의 코딩을 개선하기 위한 방법 및 장치
KR1020247023646A KR20240116835A (ko) 2014-01-08 2014-12-19 사운드 필드의 고차 앰비소닉스 표현을 코딩하기 위해 요구되는 사이드 정보의 코딩을 개선하기 위한 방법 및 장치
EP22176389.9A EP4089675B1 (en) 2014-01-08 2014-12-19 Method and apparatus for improving the coding of side information required for coding a higher order ambisonics representation of a sound field
US15/956,295 US10147437B2 (en) 2014-01-08 2018-04-18 Method and apparatus for decoding a bitstream including encoding higher order ambisonics representations
US16/189,797 US10424312B2 (en) 2014-01-08 2018-11-13 Method and apparatus for decoding a bitstream including encoded higher order ambisonics representations
US16/532,302 US10553233B2 (en) 2014-01-08 2019-08-05 Method and apparatus for decoding a bitstream including encoded higher order ambisonics representations
US16/719,806 US10714112B2 (en) 2014-01-08 2019-12-18 Method and apparatus for decoding a bitstream including encoded higher order Ambisonics representations
US16/925,334 US11211078B2 (en) 2014-01-08 2020-07-10 Method and apparatus for decoding a bitstream including encoded higher order ambisonics representations
US17/558,550 US11488614B2 (en) 2014-01-08 2021-12-21 Method and apparatus for decoding a bitstream including encoded Higher Order Ambisonics representations
US17/970,118 US11869523B2 (en) 2014-01-08 2022-10-20 Method and apparatus for decoding a bitstream including encoded higher order ambisonics representations
US18/390,546 US12277948B2 (en) 2014-01-08 2023-12-20 Method and apparatus for decoding a bitstream including encoded Higher Order Ambisonics representations
US19/176,090 US20250329338A1 (en) 2014-01-08 2025-04-10 Method and apparatus for decoding a bitstream including encoded higher order ambisonics representations

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
EP14305022 2014-01-08
EP14305022.7 2014-01-08
EP14305061 2014-01-16
EP14305061.5 2014-01-16

Related Child Applications (2)

Application Number Title Priority Date Filing Date
US15/110,354 A-371-Of-International US9990934B2 (en) 2014-01-08 2014-12-19 Method and apparatus for improving the coding of side information required for coding a Higher Order Ambisonics representation of a sound field
US15/956,295 Division US10147437B2 (en) 2014-01-08 2018-04-18 Method and apparatus for decoding a bitstream including encoding higher order ambisonics representations

Publications (1)

Publication Number Publication Date
WO2015104166A1 true WO2015104166A1 (en) 2015-07-16

Family

ID=52134201

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP2014/078641 Ceased WO2015104166A1 (en) 2014-01-08 2014-12-19 Method and apparatus for improving the coding of side information required for coding a higher order ambisonics representation of a sound field

Country Status (6)

Country Link
US (10) US9990934B2 (cg-RX-API-DMAC7.html)
EP (4) EP3648102B1 (cg-RX-API-DMAC7.html)
JP (5) JP6530412B2 (cg-RX-API-DMAC7.html)
KR (4) KR102338374B1 (cg-RX-API-DMAC7.html)
CN (7) CN105981100B (cg-RX-API-DMAC7.html)
WO (1) WO2015104166A1 (cg-RX-API-DMAC7.html)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11781416B2 (en) 2019-10-16 2023-10-10 Saudi Arabian Oil Company Determination of elastic properties of a geological formation using machine learning applied to data acquired while drilling
WO2022125771A1 (en) 2020-12-10 2022-06-16 Saudi Arabian Oil Company Determination of mechanical properties of a geological formation using deep learning applied to data acquired while drilling

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2451196A1 (en) * 2010-11-05 2012-05-09 Thomson Licensing Method and apparatus for generating and for decoding sound field data including ambisonics sound field data of an order higher than three
US20120155653A1 (en) * 2010-12-21 2012-06-21 Thomson Licensing Method and apparatus for encoding and decoding successive frames of an ambisonics representation of a 2- or 3-dimensional sound field
US20130216070A1 (en) * 2010-11-05 2013-08-22 Florian Keiler Data structure for higher order ambisonics audio data

Family Cites Families (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7394903B2 (en) * 2004-01-20 2008-07-01 Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E.V. Apparatus and method for constructing a multi-channel output signal or for generating a downmix signal
SE0400997D0 (sv) * 2004-04-16 2004-04-16 Cooding Technologies Sweden Ab Efficient coding of multi-channel audio
US7983922B2 (en) * 2005-04-15 2011-07-19 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method for generating multi-channel synthesizer control signal and apparatus and method for multi-channel synthesizing
US7680123B2 (en) * 2006-01-17 2010-03-16 Qualcomm Incorporated Mobile terminated packet data call setup without dormancy
US8379868B2 (en) * 2006-05-17 2013-02-19 Creative Technology Ltd Spatial audio coding based on universal spatial cues
WO2009065144A1 (en) * 2007-11-16 2009-05-22 Divx, Inc. Chunk header incorporating binary flags and correlated variable-length fields
US8219409B2 (en) * 2008-03-31 2012-07-10 Ecole Polytechnique Federale De Lausanne Audio wave field encoding
WO2010090019A1 (ja) * 2009-02-04 2010-08-12 パナソニック株式会社 結合装置、遠隔通信システム及び結合方法
KR101953279B1 (ko) * 2010-03-26 2019-02-28 돌비 인터네셔널 에이비 오디오 재생을 위한 오디오 사운드필드 표현을 디코딩하는 방법 및 장치
EP2541547A1 (en) * 2011-06-30 2013-01-02 Thomson Licensing Method and apparatus for changing the relative positions of sound objects contained within a higher-order ambisonics representation
EP2637427A1 (en) * 2012-03-06 2013-09-11 Thomson Licensing Method and apparatus for playback of a higher-order ambisonics audio signal
EP2665208A1 (en) * 2012-05-14 2013-11-20 Thomson Licensing Method and apparatus for compressing and decompressing a Higher Order Ambisonics signal representation
EP2738762A1 (en) * 2012-11-30 2014-06-04 Aalto-Korkeakoulusäätiö Method for spatial filtering of at least one first sound signal, computer readable storage medium and spatial filtering system based on cross-pattern coherence
EP2743922A1 (en) * 2012-12-12 2014-06-18 Thomson Licensing Method and apparatus for compressing and decompressing a higher order ambisonics representation for a sound field
EP2800401A1 (en) * 2013-04-29 2014-11-05 Thomson Licensing Method and Apparatus for compressing and decompressing a Higher Order Ambisonics representation

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2451196A1 (en) * 2010-11-05 2012-05-09 Thomson Licensing Method and apparatus for generating and for decoding sound field data including ambisonics sound field data of an order higher than three
US20130216070A1 (en) * 2010-11-05 2013-08-22 Florian Keiler Data structure for higher order ambisonics audio data
US20120155653A1 (en) * 2010-12-21 2012-06-21 Thomson Licensing Method and apparatus for encoding and decoding successive frames of an ambisonics representation of a 2- or 3-dimensional sound field

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
ANONYMOUS: "Sparse array - Wikipedia, the free encyclopedia", 6 July 2012 (2012-07-06), XP055152567, Retrieved from the Internet <URL:https://web.archive.org/web/20120706211611/http://en.wikipedia.org/wiki/Sparse_array> [retrieved on 20141112] *
JOHANNES BOEHM ET AL: "RM0-HOA Working Draft Text", 106. MPEG MEETING; 28-10-2013 - 1-11-2013; GENEVA; (MOTION PICTURE EXPERT GROUP OR ISO/IEC JTC1/SC29/WG11),, no. m31408, 23 October 2013 (2013-10-23), XP030059861 *

Also Published As

Publication number Publication date
KR102686291B1 (ko) 2024-07-19
JP2025003689A (ja) 2025-01-09
US20160336021A1 (en) 2016-11-17
JP6848004B2 (ja) 2021-03-24
CN111028849B (zh) 2024-03-01
US11869523B2 (en) 2024-01-09
JP2021081753A (ja) 2021-05-27
US20180240469A1 (en) 2018-08-23
CN118016077A (zh) 2024-05-10
CN105981100A (zh) 2016-09-28
JP7258063B2 (ja) 2023-04-14
US20230108008A1 (en) 2023-04-06
US20220115027A1 (en) 2022-04-14
CN111179951B (zh) 2024-03-01
EP3648102B1 (en) 2022-06-01
US20190214033A1 (en) 2019-07-11
EP3092641A1 (en) 2016-11-16
JP2019133200A (ja) 2019-08-08
JP6530412B2 (ja) 2019-06-12
KR20220085848A (ko) 2022-06-22
CN111182443A (zh) 2020-05-19
CN111179955B (zh) 2024-04-09
US20240185872A1 (en) 2024-06-06
EP4089675A1 (en) 2022-11-16
CN105981100B (zh) 2020-02-28
EP4554255A2 (en) 2025-05-14
US10424312B2 (en) 2019-09-24
KR20240116835A (ko) 2024-07-30
EP4089675B1 (en) 2025-02-19
US20190362731A1 (en) 2019-11-28
KR102409796B1 (ko) 2022-06-22
KR20210153751A (ko) 2021-12-17
JP2023076610A (ja) 2023-06-01
EP4554255A3 (en) 2025-06-25
US20250329338A1 (en) 2025-10-23
EP3092641B1 (en) 2019-11-13
JP7589876B2 (ja) 2024-11-26
JP2017508174A (ja) 2017-03-23
US11211078B2 (en) 2021-12-28
US12277948B2 (en) 2025-04-15
US10714112B2 (en) 2020-07-14
CN118248156A (zh) 2024-06-25
US20210027795A1 (en) 2021-01-28
US11488614B2 (en) 2022-11-01
US20200126579A1 (en) 2020-04-23
US10553233B2 (en) 2020-02-04
KR102338374B1 (ko) 2021-12-13
US10147437B2 (en) 2018-12-04
CN111182443B (zh) 2021-10-22
CN111179955A (zh) 2020-05-19
EP3648102A1 (en) 2020-05-06
CN111179951A (zh) 2020-05-19
US9990934B2 (en) 2018-06-05
KR20160106692A (ko) 2016-09-12
CN111028849A (zh) 2020-04-17

Similar Documents

Publication Publication Date Title
US20250329338A1 (en) Method and apparatus for decoding a bitstream including encoded higher order ambisonics representations
HK40079041B (en) Method and apparatus for improving the coding of side information required for coding a higher order ambisonics representation of a sound field
HK40123824A (en) Method and apparatus for improving the coding of side information required for coding a higher order ambisonics representation of a sound field
HK40079041A (en) Method and apparatus for improving the coding of side information required for coding a higher order ambisonics representation of a sound field
HK40019652B (zh) 包括编码hoa表示的位流的解码方法和装置、以及介质
HK40018256B (zh) 包括编码hoa表示的位流的解码方法和装置、以及介质
HK40020236B (zh) 包括编码hoa表示的位流的解码方法和装置、以及介质
HK1227165A1 (en) Method and apparatus for improving the coding of side information required for coding a higher order ambisonics representation of a sound filed

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 14815731

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2016544628

Country of ref document: JP

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: 15110354

Country of ref document: US

NENP Non-entry into the national phase

Ref country code: DE

REEP Request for entry into the european phase

Ref document number: 2014815731

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 2014815731

Country of ref document: EP

ENP Entry into the national phase

Ref document number: 20167021560

Country of ref document: KR

Kind code of ref document: A