WO2012059385A1 - Data structure for higher order ambisonics audio data - Google Patents

Data structure for higher order ambisonics audio data Download PDF

Info

Publication number
WO2012059385A1
WO2012059385A1 PCT/EP2011/068782 EP2011068782W WO2012059385A1 WO 2012059385 A1 WO2012059385 A1 WO 2012059385A1 EP 2011068782 W EP2011068782 W EP 2011068782W WO 2012059385 A1 WO2012059385 A1 WO 2012059385A1
Authority
WO
WIPO (PCT)
Prior art keywords
hoa
ambisonics
data
coefficients
data structure
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/EP2011/068782
Other languages
English (en)
French (fr)
Inventor
Florian Keiler
Sven Kordon
Johannes Boehm
Holger Kropp
Johann-Markus Batke
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Thomson Licensing SAS
Original Assignee
Thomson Licensing SAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Thomson Licensing SAS filed Critical Thomson Licensing SAS
Priority to US13/883,094 priority Critical patent/US9241216B2/en
Priority to CN201180053153.7A priority patent/CN103250207B/zh
Priority to JP2013537071A priority patent/JP5823529B2/ja
Priority to BR112013010754-5A priority patent/BR112013010754B1/pt
Priority to EP11776422.5A priority patent/EP2636036B1/en
Priority to KR1020137011661A priority patent/KR101824287B1/ko
Priority to HK14102354.0A priority patent/HK1189297B/en
Priority to AU2011325335A priority patent/AU2011325335B8/en
Publication of WO2012059385A1 publication Critical patent/WO2012059385A1/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R5/00Stereophonic arrangements
    • H04R5/02Spatial or constructional arrangements of loudspeakers
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/167Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/11Application of ambisonics in stereophonic audio systems

Definitions

  • the invention relates to a data structure for Higher Order Ambisonics audio data, which includes 2D and/or 3D spatial audio content data and which is also suited for HOA audio data having on order of greater than '3'.
  • 3D Audio may be realised using a sound field description by a technique called Higher Order Ambisonics (HOA) as de ⁇ scribed below.
  • HOA Higher Order Ambisonics
  • Storing HOA data requires some conventions and stipulations how this data must be used by a special de ⁇ coder to be able to create loudspeaker signals for replay at a given reproduction speaker setup. No existing storage format defines all of these stipulations for HOA.
  • the B-Format (based on the extensible ⁇ iff/wav' structure) with its * . amb file format realisation as described as of 30 March 2009 for example in Martin Leese, "File Format for B-
  • a problem to be solved by the invention is to provide an Am ⁇ bisonics file format that is capable of storing two or more sound field descriptions at once, wherein the Ambisonics or- der can be greater than 3. This problem is solved by the data structure disclosed in claim 1 and the method disclosed in claim 12.
  • next-generation Ambison- ics decoders will require either a lot of conventions and stipulations together with stored data to be processed, or a single file format where all related parameters and data elements can be coherently stored.
  • the inventive file format for spatial sound content can store one or more HOA signals and/or directional mono sig ⁇ nals together with directional information, wherein Ambison- ics orders greater than 3 and files >4GB are feasible.
  • Fur ⁇ thermore the inventive file format provides additional ele- ments which existing formats do not offer:
  • Ambisonics wave information plane, spherical, mixture types
  • region of interest sources outside the lis- tening area or within
  • reference radius for de ⁇ coding of spherical waves
  • Posi ⁇ tion information of these directional signals can be described either using angle and distance information or an encoding vector of Ambisonics coefficients.
  • the inventive format allows storing data related to the Ambisonics order (Ambisonics channels) with dif ⁇ ferent PCM-word size resolution as well as using re ⁇ stricted bandwidth.
  • Meta fields allow storing accompanying information about the file like recording information for microphone sig- nals :
  • This file format for 2D and 3D audio content covers the storage of both Higher Order Ambisonics descriptions (HOA) as well as single sources with fixed or time-varying posi ⁇ tions, and contains all information enabling next-generation audio decoders to provide realistic 3D Audio.
  • HOA Higher Order Ambisonics descriptions
  • the inventive file format is also suited for streaming of audio content.
  • content dependent side info head data
  • the inven tive file format serves also as scene description where tracks of an audio scene can start and end at any time.
  • the inventive data structure is suited for
  • HOA audio data which data structure includes 2D and/or 3D spatial audio content data for one or more different HOA audio data stream descriptions, and which data structure is also suited for HOA audio data that have on order of greater than '3', and which data structure in addition can include single audio signal source data and/or microphone array audio data from fixed or time-varying spa ⁇ tial positions.
  • the inventive method is suited for audio pres entation, wherein an HOA audio data stream containing at least two different HOA audio data signals is received and at least a first one of them is used for presentation with dense loudspeaker arrangement located at a distinct area of a presentation site, and at least a second and different one of them is used for presentation with a less dense loud ⁇ speaker arrangement surrounding said presentation site.
  • Fig. 1 holophonic reproduction in cinema with dense speaker arrangements at the frontal region and coarse speaker density surrounding the listening area;
  • Fig. 2 sophisticated decoding system
  • Fig. 3 HOA content creation from microphone array re ⁇ cording, single source recording, simple and complex sound field generation;
  • FIG. 5 2D decoding of HOA signals for simple surround loud ⁇ speaker setup, and 3D decoding of HOA signals for a holophonic loudspeaker setup for frontal stage and a more coarse 3D surround loudspeaker setup;
  • Fig. 8 exterior domain problem, wherein the sources are inside the region of interest/validity
  • Fig. 10 example for a HOA file containing multiple frames with multiple tracks
  • HOA Higher Order Ambisonics
  • Fig. lb shows the perceived direction of arrival of repro ⁇ quizzed frontal sound waves, wherein the direction of arrival of plane waves matches different screen positions, i.e.
  • plane waves are suitable to reproduce depth.
  • Fig. lc shows the perceived direction of arrival of repro ⁇ cuted spherical waves, which lead to better consistency of perceived sound direction and 3D visual action around the screen .
  • the need for two different HOA streams is caused in the fact that the main visual action in a cinema takes place in the frontal region of the listeners.
  • the perceptive preci ⁇ sion of detecting the direction of a sound is higher for frontal sound sources than for surrounding sources.
  • There ⁇ fore the precision of frontal spatial sound reproduction needs to be higher than the spatial precision for reproduced ambient sounds.
  • Holophonic means for sound reproduction a high number of loudspeakers, a dedicated decoder and related speaker drivers are required for the frontal screen region, while less costly technology is needed for ambient sound re ⁇ production (lower density of speakers surrounding the listening area and less perfect decoding technology) . Due to content creation and sound reproduction technologies, it is advantageous to supply one HOA representation for the ambient sounds and one HOA representation for the foreground action sounds, cf. Fig. 4. A cinema using a simple setup with a simple coarse reproduction sound equipment can mix both streams prior to decoding (cf . Fig. 5 upper part) .
  • a more sophisticated cinema equipped with full immersive re ⁇ production means can use two decoders - one for decoding the ambient sounds and one specialised decoder for high-accuracy positioning of virtual sound sources for the foreground main action, as shown in the sophisticated decoding system in Fig. 2 and the bottom part of Fig. 5.
  • a special HOA file contains at least two tracks which repre ⁇ sent HOA sound fields for ambient sounds ATM(t) and for fron ⁇ tal sounds related to the visual main action CTM(t).
  • Optional streams for directional effects may be provided.
  • Two corre ⁇ sponding decoder systems together with a panner provide signals for a dense frontal 3D holophonic loudspeaker system 21 and a less dense (i.e. coarse) 3D surround system 22.
  • the HOA data signal of the Track 1 stream represents the am- bience sounds and is converted in a HOA converter 231 for input to a Decoderl 232 specialised for reproduction of ambience.
  • HOA signal data (fron ⁇ tal sounds related to visual scene) is converted in a HOA converter 241 for input to a distance corrected (Eq. (26)) filter 242 for best placement of spherical sound sources around the screen area with a dedicated Decoder2 243.
  • the directional data streams are directly panned to L speakers.
  • the three speaker signals are PCM mixed for joint reproduc- tion with the 3D speaker system.
  • Fig. 3a natural recordings of sound fields are created by using microphone arrays.
  • the capsule signals are matrixed and equalised in order to form HOA signals.
  • Higher-order signals Ambisonics order >1 are usually band-pass filtered to reduce artefacts due to capsule distance effects: low- pass filtered to reduce spatial alias at high frequencies, and high-pass filtered to reduce excessive low frequency levels with increasing Ambisonics order n ( h n (kr d mic ) , see Eq. (34) .
  • Optionally distance coding filtering may be ap ⁇ plied, see Eqs . (25) and (27) .
  • HOA format in ⁇ formation is added to the track header.
  • Artistic sound field representations are usually created us ⁇ ing multiple directional single source streams.
  • a single source signal can be captured as a PCM re ⁇ cording. This can be done by close-up microphones or by us ⁇ ing microphones with high directivity.
  • the di- rectional parameters ( s ,0 s ,0 s ) of the sound source relative to a virtual best listening position are recorded (HOA coordinate system, or any reference point for later mapping) .
  • the distance information may also be created by artistically placing sounds when rendering scenes for movies. As shown in Fig.
  • the directional information (0 s ⁇ 0 s ) is then used to create the encoding vector ⁇ , and the directional source signal is encoded into an Ambisonics signal, see Eq. (18) .
  • This is equivalent to a plane wave representation.
  • a tailing filtering process may use the distance information r s to im- print a spherical source characteristic into the Ambisonics signal (Eq. (19)), or to apply distance coding filtering, Eqs. (25), (27) .
  • the HOA format information is added to the track header. More complex wave field descriptions are generated by HOA mixing Ambisonics signals as depicted in Fig. 3d. Before storage, the HOA format information is added to the track header .
  • FIG. 4 Frontal sounds related to the visual action are encoded with high spatial accuracy and mixed to a HOA signal (wave field) C (t) and stored as Track 2.
  • the involved encod ⁇ ers encode with a high spatial precision and special wave types necessary for best matching the visual scene.
  • Track 1 contains the sound field ATM(t) which is related to encoded ambient sounds with no restriction of source direction.
  • the spatial precision of the ambient sounds needs not be as high as for the frontal sounds (consequently the Ambi- sonics order can be smaller) and the modelling of wave type is less critical.
  • the ambient sound field can also include reverberant parts of the frontal sound signals. Both tracks are multiplexed for storage and/or exchange.
  • directional sounds can be multi ⁇ plexed to the file. These sounds can be special effects sounds, dialogs or classic information like a narrative speech for visually impaired.
  • Fig. 5 shows the principles of decoding. As depicted in the upper part, a cinema with coarse loudspeaker setup can mix both HOA signals from Trackl and Track2 before simplified HOA decoding, and may truncate the order of Track2 and re ⁇ appear the dimension of both tracks to 2D. In case a direc- tional stream is present, it is encoded to 2D HOA. Then, all three streams are mixed to form a single HOA representation which is then decoded and reproduced.
  • the bottom part corresponds to Fig. 2.
  • a cinema equipped with a holophonic system for the frontal stage and a coarser 3D surround system will use dedicated sophisticated decoders and mix the speakers feeds.
  • HOA data representing the ambience sounds is converted to De- coderl specialised for reproduction of ambience.
  • HOA frontal sounds related to visual scene
  • Eq. (26) distance corrected for best place ⁇ ment of spherical sound sources around the screen area with a dedicated Decoder2.
  • the directional data streams are di ⁇ rectly panned to L speakers.
  • the three speaker signals are PCM mixed for joint reproduction with the 3D speaker system. Sound field descriptions using Higher Order Ambisonics
  • the sound pressure is a function of spherical coordinates ⁇ , ⁇ , ⁇ (see Fig. 7 for their definition) and spatial fre ⁇ quency
  • the ATM(k are called Ambisonic Coefficients
  • j n (kr) is the spherical Bessel function of first kind
  • ⁇ TM( ⁇ , ⁇ ) are called Spherical Harmonics (SH)
  • n is the Ambisonics order index
  • m indicates the degree.
  • the series can be stopped at some order n and restricted to a value N with sufficient ac ⁇ curacy.
  • N is called the Ambison- ics order.
  • N is called the Ambisonics order, and the term 'order' is usually also used in combination with the n in Bessel j n (kr) and Hankel h n (kr) functions .
  • the BTM(k) are again called Ambisonics coefficients and h n (kr) denotes the spherical Hankel function of first kind and n th order.
  • the formula assumes orthogonal-normalised SH.
  • the spherical harmonics YTM may be either complex or real valued.
  • the general case for HOA uses real valued spherical harmonics.
  • a unified description of Ambisonics using real and complex spherical harmonics may be reviewed in Mark
  • N nm is a normalisation term which takes form for an orthogonal-normalised representation (! denotes factorial) :
  • Real valued SH are derived by combining complex conjugate Y r m n corresponding to opposite values of m (the term (—l) m in the definition (6) is introduced to obtain unsigned expressions for the real SH, which is the usual case in Ambisonics) :
  • the total number of spherical components STM for a given Am- bisonics order N equals (N+l) 2 .
  • Common normalisation schemes of the real valued spherical harmonics are given in Table 3.
  • the SH degree can only take values m G ⁇ — ,n ⁇ .
  • the total number of components for a given N reduces to 2N+1 because components representing the inclination ⁇ become ob ⁇ solete and the spherical harmonics can be replaced by the circular harmonics given in Eq. (8) .
  • the normalisation has an effect on the notation describing the pressure (cf . Eqs . (1) , (2) ) and all derived considerations.
  • the kind of normalisation also in ⁇ fluences the Ambisonics coefficients.
  • CH to SH conversion and vice versa can also be applied to Ambisonics coefficients, for example when decoding a 3D Ambisonics representation (recording) with a 2D decoder for a 2D loudspeaker setting.
  • STM and f>TM
  • for 3D-2D conversion is de ⁇ picted in the following scheme up to an Ambisonics order of
  • the Ambisonics coefficients form the Ambisonics signal and in general are a function of dis ⁇ crete time.
  • Table 5 shows the relationship between dimensional representation, Ambisonics order N and number of Ambisonics coefficients (channels) :
  • the i4o(n) signal can be regarded as a mono representation of the Ambisonics recording, having no directional information but being a representative for the general timbre impression of the recording.
  • a N3D TM J(2n + 1)A SN3D TM for the SN3D to N3D case.
  • the B-Format and the AMB format use additional weights (Ger- son, Furse-Malham (FuMa), MaxN weights) which are applied to the coefficients.
  • the reference normalisation then usually is SN3D, cf. Jerome Daniel, "Representation de champs acoustiques, application a la transmission et a la reproduc ⁇ tion de scenes sonores complexes dans un contexte mul ⁇ timedia", PhD thesis, Universite Paris 6, 2001, and Dave Malham, "3-D acoustic space and its simulation using ambisonics", http : //www . dxarts . Washington . edu/ courses/ 567
  • the coefficients dTM can either be derived by post-processed microphone array signals or can be created synthetically us ⁇ ing a mono signal P SQ (t) in which case the directional spheri- cal harmonics ⁇ 5 , ⁇ 5 t) can be time-dependent as well (moving source) .
  • Eq. (17) is valid for each temporal sampling instance v.
  • the process of synthetic encoding can be rewritten (for every sample instance v) in vector/matrix form for a selected Ambisonics order N:
  • size(d) [
  • the encoding vector can be derived from the spherical harmonics for the specific source direc ⁇ tion ⁇ , (equal to the direction of the plane wave) .
  • Ambisonics coefficients describing incoming spherical waves generated by point sources (near field sources) for r ⁇ r s are :
  • h Q is the zeroth-order spherical Hankel function of second kind.
  • Ambisonics assumes a reproduction of the sound field by L loudspeakers which are uniformly distributed on circle or on a sphere.
  • L loudspeakers When assuming that the loudspeakers are placed far enough from the listener position, a plane- wave decoding model is valid at the centre (r s > ⁇ ) .
  • the sound pressure generated by L loudspeakers is described by:
  • p(r, ⁇ , ⁇ , ⁇ ) (20) with W; being the signal for loudspeaker I and having the unit scale of a sound pressure, lPa.
  • w L is often called driving function of loudspeaker I.
  • y can then be derived using a couple of known methods, e.g. mode matching, or by methods which optimise for special speaker panning functions.
  • the speaker signals W j are determined by the pressure in the origin.
  • CTM the reference distance r L re j and an indicator that spherical distance coded coefficients are used.
  • a simple decoding processing as given in Eq. (22) is feasible as long as the real speaker distance r « r l re j . If that difference is too large, a correc-
  • the normalisation of the Spherical Harmonics can have an influence of the formulation of distance coded Ambison- ics, i.e. Distance Coded Ambisonics coefficients need a de- fined context.
  • the conversion factor a- ⁇ D_ to convert a 2D circular component
  • G (r ⁇ r s ) can also be expressed in spherical harmonics for r ⁇ r s by G ⁇ ⁇ , ⁇ ) ⁇ TM( 8 , ⁇ 8 ⁇ (33)
  • h n is the Hankel function of second kind. Note that the Green' s function has a scale of unit meter -1 (—i due to
  • the storage format according to the invention allows storing more than one HOA representation and additional directional streams together in one data container. It enables different formats of HOA descriptions which enable decoders to opti- mise reproduction, and it offers an efficient data storage for sizes >4GB. Further advantages are:
  • Ambisonics coefficient packing and scaling information Ambisonics wave type (plane, spherical), reference radius (for decoding of spherical waves);
  • Position information of these directional signals can be described using either angle and distance information or an encoding-vector of Ambisonics coefficients.
  • Metadata fields are available for associating tracks for special decoding (frontal, ambient) and for allowing storage of accompanying information about the file, like recording information for microphone signals:
  • the format is suitable for storage of multiple frames containing different tracks, allowing audio scene changes without a scene description.
  • one track contains a HOA sound field description or a single source with position information.
  • a frame is the combination of one or more parallel tracks.
  • Tracks may start at the beginning of a frame or end at the end of a frame, therefore no time code is re ⁇ quired .
  • the format facilitates fast access of audio track data (fast-forward or jumping to cue points) and determining a time code relative to the time of the beginning of file data .
  • Table 6 summarises the parameters required to be defined for a non-ambiguous exchange of HOA signal data.
  • the definition of the spherical harmonics is fixed for the complex-valued and the real-valued cases, cf. Eqs . (3) (6) .
  • the file format for storing audio scenes composed of Higher Order Ambisonics (HOA) or single sources with position information is described in detail.
  • the audio scene can contain multiple HOA sequences which can use dif ⁇ ferent normalisation schemes.
  • a decoder can compute the corresponding loudspeaker signals for the desired loudspeaker setup as a superposition of all audio tracks from a current file.
  • the file contains all data required for decod ⁇ ing the audio content.
  • the file format according to the in ⁇ vention offers the feature of storing more than one HOA or single source signal in single file.
  • the file format uses a composition of frames, each of which can contain several tracks, wherein the data of a track is stored in one or more packets called TrackPackets .
  • Constant identifiers ID which identify the beginning of a frame, track or chunk, and strings are defined as data type byte.
  • the byte order of byte arrays is most significant byte and bit first. Therefore the ID 'TRCK' is defined in a 32- bit byte field wherein the bytes are written in the physical order 'T', 'R', 'C and 'K' ( ⁇ 0x54; 0x52; 0x42; 0x4b>) .
  • Hexadecimal values start with 'Ox' (e.g. 0xAB64C5) .
  • Header field names always start with the header name fol ⁇ lowed by the field name, wherein the first letter of each word is capitalised (e.g. TrackHeaderSize) .
  • the HOA File Format can include more than one Frame, Packet or Track. For the discrimination of multiple header fields a number can follow the field or header name. For example, the second TrackPacket of the third Track is named
  • the HOA file format can include complex-valued fields. These complex values are stored as real and imaginary part wherein the real part is written first.
  • the complex number l+i2 in 'int8' format would be stored as '0x01' followed by '0x02'.
  • fields or coefficients in a complex-value format type require twice the storage size as compared to the corre- sponding real-value format type.
  • the Higher Order Ambisonics file format includes at least one FileHeader, one FrameHeader, one TrackHeader and one
  • TrackPacket as depicted in Fig. 9, which shows a simple ex ⁇ ample HOA file format file that carries one Track in one or more Packets .
  • HOA file is one File- Header followed by a Frame that includes at least one Track.
  • a Track consists always of a TrackHeader and one or more TrackPackets .
  • the HOA File can contain more than one Frame, wherein a Frame can contain more than one Track.
  • a new FrameHeader is used if the maximal size of a Frame is exceeded or Tracks are added, or removed from one Frame to the other.
  • the structure of a multiple Track and Frame HOA File is shown in Fig. 10.
  • the structure of a multiple Track Frame starts with the FrameHeader followed by all TrackHeaders of the Frame. Con ⁇ sequently, the TrackPackets of each Track are sent successive ⁇ sively to the FrameHeaders, wherein the TrackPackets are in- terleaved in the same order as the TrackHeaders .
  • each Track is synchro ⁇ nised, e.g. the samples of TracklPacketl are synchronous to the samples of Track2Packetl .
  • Specific TrackCodingTypes can cause a delay at decoder side, and such specific delay needs to be known at decoder side, or is to be included in the TrackCodingType dependent part of the TrackHeader, because the decoder synchronises all TrackPackets to the maximal de ⁇ lay of all Tracks of a Frame.
  • Metadata that refer to the complete HOA File can optionally be added after the FileHeader in MetaDataChunks .
  • Fig. 11 shows the structure of a HOA file format using several MetaDataChunks .
  • a Track of the HOA Format differentiates between a general HOATrack and a SingleSourceTrack .
  • the HOATrack includes the complete sound field coded as HOACoefficients. Therefore, a scene description, e.g. the positions of the encoded
  • the SingleSourceTrack includes only one source coded as PCM samples together with the posi ⁇ tion of the source within an audio scene. Over time, the po ⁇ sition of the SingleSourceTrack can be fixed or variable.
  • the source position is sent as TrackHOAEncodingVector or TrackPositionVector.
  • the TrackHOAEncodingVector contains the HOA encoding values for obtaining the HOACoefficient for each sample.
  • the TrackPositionVector contains the position of the source as angle and distance with respect to the cen ⁇ tre listening position.
  • the FileHeader includes all constant information for the complete HOA File.
  • the FilelD is used for identifying the HOA File Format.
  • the sample rate is constant for all Tracks even if it is sent in the FrameHeader.
  • HOA Files that change their sample rate from one frame to another are invalid.
  • the number of Frames is indicated in the FileHeader to indicate the Frame structure to the decoder.
  • the FrameHeader holds the constant information of all Tracks of a Frame and indicates changes within the HOA File.
  • the FramelD and the FrameSize indicate the beginning of a Frame and the length of the Frame. These two fields allow an easy access of each frame and a crosscheck of the Frame struc ⁇ ture. If the Frame length requires more than 32 bit, one Frame can be separated in several Frames. Each Frame has a unique FrameNumber. The FrameNumber should start with 0 and should be incremented by one for each new Frame.
  • the number of samples of the Frame is constant for all
  • Tracks of a Frame The number of Tracks within the Frame is constant for the Frame.
  • a new Frame Header is sent for end ⁇ ing or starting Tracks at a desired sample position.
  • the samples of each Track are stored in Packets.
  • the size of these TrackPackets is indicated in samples and is constant for all Tracks.
  • the number of Packets is equal to the inte ⁇ ger number that is required for storing the number of samples of the Frame. Therefore the last Packet of a Track can contain fewer samples than the indicated Packet size.
  • the sample rate of a frame is equal to the FileSampleRate and is indicated in the FrameHeader to allow decoding of a Frame without knowledge of the FileHeader. This can be used when decoding from the middle of a multi frame file without knowledge of the FileHeader, e.g. for streaming applica- tions.
  • the term 'dyn' refers to a dynamic field size due to condi ⁇ tional fields.
  • the TrackHeader holds the constant informa ⁇ tion for the Packets of the specific Track.
  • the TrackHeader is separated into a constant part and a variable part for two TrackSourceTypes .
  • the TrackHeader starts with a constant TrackID for verification and identification of the beginning of the TrackHeader.
  • a unique TrackNumber is assigned to each Track to indicate coherent Tracks over Frame borders. Thus, a track with the same TrackNumber can occur in the following frame.
  • the TrackHeaderSize is provided for skipping to the next TrackHeader and it is indicated as an offset from the end of the TrackHeaderSize field.
  • the TrackMetaDataOffset provides the number of samples to jump directly to the be- ginning of the TrackMetaData field, which can be used for skipping the variable length part of the TrackHeader.
  • a TrackMetaDataOffset of zero indicates that the TrackMetaData field does not exist.
  • Reliant on the TrackSourceType, the HOATrackHeader or the SingleSourceTrackHeader is provided.
  • the HOATrackHeader provides the side information for standard HOA coefficients that describe the complete sound field.
  • the SingleSourceTrackHeader holds information for the samples of a mono PCM track and the position of the source. For SingleSourceTracks the decoder has to include the Tracks into the scene.
  • TrackMetaData field which uses the XML format for providing track dependent Metadata, e.g. additional information for A- format transmission (microphone-array signals) .
  • TrackRegionLastBin 16 uint16 last coded MDCT bin (upper cut-off frequency)
  • Downsampling factor IW must be a divider of
  • the HOATrackHeader is a part of the TrackHeader that holds information for decoding a HOATrack .
  • the TrackPackets of a HOATrack transfer HOA coefficients that code the entire sound field of a Track. Basically the HOATrackHeader holds all HOA parameters that are required at decoder side for de ⁇ coding the HOA coefficients for the given speaker setup.
  • the TrackComplexValueFlag and the TrackSampleFormat define the format type of the HOA coefficients of each TrackPacket. For encoded or compressed coefficients the TrackSampleFormat defines the format of the decoded or uncompressed coeffi ⁇ cients. All format types can be real or complex numbers. More information on complex numbers is provided in the above section File Format Details .
  • TrackHOAParams All HOA dependent information is defined in the TrackHOAPar- ams .
  • the TrackHOAParams are re-used in other TrackSour- ceTypes . Therefore, the fields of the TrackHOAParams are de ⁇ fined and described in section TrackHOAParams.
  • the TrackCodingType field indicates the coding (compression) format of the HOA coefficients.
  • the basic version of the HOA file format includes e.g. two CodingTypes.
  • the order and the normalisation of the HOA coefficients are defined in the TrackHOAParams fields.
  • a second CodingType allows a change of the sample format and to limit the bandwidth of the coefficients of each HOA or- der.
  • the TrackBandwidthReductionType determines the type of proc ⁇ essing that has been used to limit the bandwidth of each HOA order. If the bandwidth of all coefficients is unaltered, the bandwidth reduction can be switched off by setting the TrackBandwidthReductionType field to zero.
  • Two other band ⁇ width reduction processing types are defined.
  • the format includes a frequency domain MDCT processing and optionally a time domain filter processing. For more information on the MDCT processing see section Bandwidth reduction via MDCT.
  • the HOA orders can be combined into regions of same sample format and bandwidth.
  • the TrackRegionUseBandwidthReduction indicates the usage of the bandwidth reduction processing for the coefficients of the orders of the region. If the TrackRegionUseBandwidthRe- duction flag is set, the bandwidth reduction side informa ⁇ tion will follow.
  • the window type and the first and last coded MDCT bin are defined. Hereby the first bin is equivalent to the lower cut-off frequency and the last bin defines the upper cut-off frequency.
  • the MDCT bins are also coded in the TrackRegionSampleFormat, cf. section Bandwidth reduction via MDCT.
  • Single Sources are subdivided into fixed position and moving position sources.
  • the source type is indicated in the Track- MovingSourceFlag.
  • the difference between the moving and the fixed position source type is that the position of the fixed source is indicated only once in the TrackHeader and in each TrackPackage for moving sources.
  • the position of a source can be indicated explicitly with the position vector in spherical coordinates or implicitly as HOA encoding vector.
  • the source itself is a PCM mono track that has to be encoded to HOA coefficients at decoder side in case of using an Am- bisonics decoder for playback.
  • the fixed position source type is defined by a TrackMoving- SourceFlag of zero.
  • the second field indicates the Track- PositionType that gives the coding of the source position as vector in spherical coordinates or as HOA encoding vector.
  • the coding format of the mono PCM samples is indicated by the TrackSampleFormat field. If the source position is sent as TrackPositionVector, the spherical coordinates of the source position are defined in the fields TrackPositionTheta (inclination from s-axis to the x-, y-plane) , TrackPosition- Phi (azimuth counter clockwise starting at x-axis) and
  • TrackPosi ti onRadi us TrackPosi ti onRadi us .
  • the TrackHOAParams are defined first. These parameters are defined in section TrackHOAParams and indicate the used nor ⁇ malisations and definitions of the HOA encoding vector.
  • the TrackEncodeVectorComplexFlag and the TrackEncodeVectorFormat field define the format type of the following TrackHOAEncod- ing vector.
  • the TrackHOAEncodingVector consists of TrackHOA- ParamNumberOfCoeffs values that are either coded in the 'float32' or 'float64' format.
  • the moving position source type is defined by a TrackMoving ⁇
  • SourceFlag of '1' The header is identical to the fix source header except that the source position data fields Track- PositionTheta, TrackPositionPhi , TrackPositionRadius and TrackHOAEncodingVector are absent. For moving sources these are located in the TrackPackets to indicate the new (moving) source position in each Packet.
  • the format according to the invention allows storage of most known HOA representations.
  • the TrackHOAParams are defined to clarify which kind of normalisation and order sequence of coefficients has been used at the encoder side. These defi ⁇ nitions have to be taken into account at decoder side for the mixing of HOA tracks and for applying the decoder matrix .
  • HOA coefficients can be applied for the complete three- dimensional sound field or only for the two-dimensional x/y- plane.
  • the dimension of the HOATrack is defined by the
  • the TrackHOAParamRegionOfInterest reflects two sound pres ⁇ sure expansions in series whereby the sources reside inside or outside the region of interest, and the region of inter ⁇ est does not contain any sources.
  • the computation of the sound pressure for the interior and exterior cases is de ⁇ fined in above equations (1) and (2), respectively, whereby the directional information of the HOA signal ATM(k is deter- mined by the conjugated complex spherical harmonic
  • TrackHOAParamSphericalHarmonicType indicates which kind of spherical harmonic function has been applied at encoder side.
  • spherical harmonic func ⁇ tion is defined by the associated Legendre functions and a complex or real trigonometric function.
  • the associated Leg ⁇ endre functions are defined by Eq. (5) .
  • the complex-valued spherical harmonic representation is
  • N nm is a scaling factor (cf . Eq. (3) ) .
  • This complex- valued representation can be transformed into a real-valued representation using the following equation:
  • the real-valued representation of the circular harmonic is defined by .
  • the dedicated value of the Track- HOAParamSphericalHarmonicNorm field is available.
  • the scaling factor for each HOA coefficient is defined at the end of the TrackHOAParams .
  • the dedicated scaling factors TrackScalingFactors can be trans ⁇ mitted as real or complex ' float32 ' or 'float64' values.
  • the scaling factor format is defined in the TrackComplexValueS- calingFlag and TrackScalingFormat fields in case of dedi ⁇ cated scaling.
  • the Furse-Malham normalisation can be applied additionally to the coded HOA coefficients for equalising the amplitudes of the coefficients of different HOA orders to absolute val ⁇ ues of less than 'one' for a transmission in integer format types.
  • the Furse-Malham normalisation was designed for the SN3D real valued spherical harmonic function up to order three coefficients. Therefore it is recommended to use the Furse-Malham normalisation only in combination with the SN3D real-valued spherical harmonic function.
  • the Track- HOAParamFurseMalhamFlag is ignored for Tracks with an HOA order greater than three.
  • the Furse-Malham normalisation has to be inverted at decoder side for decoding the HOA coeffi ⁇ cients. Table 8 defines the Furse-Malham coefficients.
  • the TrackHOAParamDecoderType defines which kind of decoder is at encoder side assumed to be present at decoder side.
  • the decoder type determines the loudspeaker model (spherical or plane wave) that is to be used at decoder side for ren ⁇ dering the sound field.
  • the computational complexity of the decoder can be reduced by shifting parts of the de ⁇ coder equation to the encoder equation.
  • nu- merical issues at encoder side can be reduced.
  • the decoder can be reduced to an identical processing for all HOA coefficients because all inconsistencies at decoder side can be moved to the encoder.
  • the TrackHOAParamDecoderType normalisation of the HOA coef ⁇ ficients CTM depends on the usage of the interior or exterior sound field expansion in series selected in TrackHOAParamRe- gionOfInterest .
  • coefficients dTM in Eq. (18) and the following equations correspond to coefficients CTM in the following.
  • the coefficients CTM are determined from the coefficients A m or BTM as defined in Table 9, and are stored.
  • the used normalisation is indicated in the TrackHOAParamDecoderType field of the TrackHOAParam header:
  • the HOA coefficients for one time sample comprise TrackHOA- ParamNumberOfCoeffs(O) number of coefficients CTM .
  • N depends on the dimension of the HOA coefficients.
  • For 2D soundfields '0' is equal to 2N + 1 where N is equal to the TrackHOAParam- HorizontalOrder field from the TrackHOAParam header.
  • the mixed-order de- coding will be performed. In mixed-order-signals some higher-order coefficients are transmitted only in 2D.
  • the TrackHOAParamVerticalOrder field determines the vertical order where all coefficients are transmitted. From the verti ⁇ cal order to the TrackHOAParamHorizontalOrder only the 2D coefficients are used. Thus the TrackHOAParamHorizontalOrder is equal or greater than the TrackHOAParamVerticalOrder.
  • Table 1 An example for a mixed-order representation of a horizontal order of four and a vertical order of two is depicted in Table
  • Table 11 Representation of HOA coefficients for a mixed-order representation of vertical order two and horizontal order four.
  • the HOA coefficients CTM are stored in the Packets of a Track.
  • the sequence of the coefficients e.g. which coeffi ⁇ cient comes first and which follow, has been defined differ ⁇ ently in the past. Therefore, the field TrackHOAParamCoeff- Sequence indicates three types of coefficient sequences. The three sequences are derived from the HOA coefficient ar ⁇ rangement of Table 10.
  • the B-Format sequence uses a special wording for the HOA co ⁇ efficients up to the order of three as shown in Table 12:
  • the HOA coefficients are transmitted from the lowest to the highest order, wherein the HOA coeffi ⁇ cients of each order are transmitted in alphabetic order.
  • the coefficients of a 3D setup of the HOA order three are stored in the sequence W, X, Y, S, R, S, T, U, V, K, L, M, N, 0, P and Q.
  • the B-format is defined up to the third HOA order only.
  • the supplemental 3D coefficients are ig- nored, e.g. W, X, Y, U, V, P, Q.
  • This Packet contains the HOA coefficients in the order defined in the TrackHOAParamCoeffSequence, wherein all co ficients of one time sample are transmitted successively.
  • This Packet is used for standard HOA Tracks with a Track- SourceType of zero and a TrackCodingType of zero.
  • the dynamic resolution package is used for a TrackSourceType of 'zero' and a TrackCodingType of 'one'.
  • the different resolutions of the TrackOrderRegions lead to different stor ⁇ age sizes for each TrackOrderRegion. Therefore, the HOA co- efficients are stored in a de-interleaved manner, e.g. all coefficients of one HOA order are stored successively.
  • the Single Source fixed Position Packet is used for a Track ⁇ SourceType of 'one' and a TrackMovingSourceFlag of 'zero'.
  • the Packet holds the PCM samples of a mono source.
  • the Single Source moving Position Packet is used for a
  • TrackSourceType of 'one' and a TrackMovingSourceFlag of 'one' holds the mono PCM samples and the position infor- mation for the sample of the TrackPacket.
  • the PacketDirectionFlag indicates if the direction of the Packet has been changed or the direction of the previous Packet should be used. To ensure decoding from the beginning of each Frame, the PacketDirectionFlag equals 'one' for the first moving source TrackPacket of a Frame.
  • the direction information of the following PCM sample source is transmitted.
  • the direction information is sent as TrackPositionVector in spherical coordinates or as Track- HOAEncodingVector with the defined TrackEncodingVectorFor- mat.
  • the TrackEncodingVector generates HOA Coefficients that are conforming to the HOAParamHeader field definitions.
  • HOA signals can be derived from Soundfield recordings with microphone arrays.
  • the Eigenmike disclosed in WO 03/061336 Al can be used for obtaining HOA recordings of order three.
  • the finite size of the microphone ar ⁇ rays leads to restrictions for the recorded HOA coeffi ⁇ cients.
  • WO 03/061336 Al and in the above-mentioned arti- cle “Three-dimensional surround sound systems based on spherical harmonics" issues caused by finite microphone ar ⁇ rays are discussed.
  • the distance of the microphone capsules results in an upper frequency boundary given by the spatial sampling theorem.
  • the microphone array can not pro ⁇ quiz correct HOA coefficients.
  • the finite dis ⁇ tance of the microphone from the HOA listening position re ⁇ quires an equalisation filter.
  • These filters obtain high gains for low frequencies which even increase with each HOA order.
  • WO 03/061336 Al a lower cut-off frequency for the higher order coefficients is introduced in order to handle the dynamic range of the equalisation filter. This shows that the bandwidth of HOA coefficients of different HOA or ⁇ ders can differ. Therefore the HOA file format offers the
  • TrackRegionBandwidthReduction that enables the transmission of only the required frequency bandwidth for each HOA order. Due to the high dynamic range of the equalisation filter and due to the fact that the zero order coefficient is basically the sum of all microphone signals, the coefficients of dif ⁇ ferent HOA orders can have different dynamical ranges.
  • the HOA file format offers also the feature of adapting the format type to the dynamic range of each HOA order .
  • the interleaved HOA coefficients are fed into the first de-interleaving step or stage 1211, which is assigned to the first TrackRegion and separates all HOA coefficients of the TrackRegion into de-interleaved buffers to FramePacketSize samples.
  • the coefficients of the TrackRe ⁇ gion are derived from the TrackRegionLastOrder and TrackRe- gionFirstOrder field of the HOA Track Header.
  • De-interleaving means that coefficients CTM for one combination of n and m are grouped into one buffer. From the de-interleaving step or stage 1211 the de-interleaved HOA coefficients are passed to the TrackRegion encoding section.
  • the remaining interleaved HOA coefficients are passed to the following TrackRegion de-interleave step or stage, and so on until de- interleaving step or stage 121N.
  • the number N of de- interleaving steps or stages is equal to TrackNumberOfOrder- Regions plus 'one'.
  • the additional de-interleaving step or stage 125 de-interleaves the remaining coefficients that are not part of the TrackRegion into a standard processing path including a format conversion step or stage 126.
  • the TrackRegion encoding path includes an optional bandwidth reduction step or stage 1221 and a format conversion step or stage 1231 and performs a parallel processing for each HOA coefficient buffer.
  • the bandwidth reduction is performed if the TrackRegionUseBandwidthReduction field is set to 'one'.
  • a processing is selected for limiting the frequency range of the HOA coefficients and for critically downsampling them. This is performed in order to reduce the number of HOA coef ⁇ ficients to the minimum required number of samples.
  • the for ⁇ mat conversion converts the current HOA coefficient format to the TrackRegionSampleFormat defined in the HOATrack header. This is the only step/stage in the standard process- ing path that converts the HOA coefficients to the indicated TrackSampleFormat of the HOA Track Header.
  • the multiplexer TrackPacket step or stage 124 multiplexes the HOA coefficient buffers into the TrackPacket data file stream as defined in the selected TrackHOAParamCoeffSequence field, wherein the coefficients CTM for one combination of n and m indices stay de-interleaved (within one buffer) .
  • the decoding processing is inverse to the encoding processing.
  • the de-multiplexer step or stage 134 de-multiplexes the TrackPacket data file or stream from the indicated TrackHOAParamCoeffSequence into de-interleaved HOA coefficient buffers (not depicted) .
  • Each buffer contains FramePacketLength coefficients CTM for one combination of n and m .
  • Step/stage 134 initialises TrackNumberOfOrderRegion plus 'one' processing paths and passes the content of the de- interleaved HOA coefficient buffers to the appropriate proc- essing path.
  • the coefficients of each TrackRegion are defined by the TrackRegionLastOrder and TrackRegionFirstOrder fields of the HOA Track Header.
  • HOA orders that are not cov ⁇ ered by the selected TrackRegions are processed in the stan ⁇ dard processing path including a format conversion step or stage 136 and a remaining coefficients interleaving step or stage 135.
  • the standard processing path corresponds to a TrackProcessing path without a bandwidth reduction step or stage .
  • a format conversion step/stage 1331 to 133N converts the HOA coefficients that are encoded in the TrackRegionSampleFormat into the data format that is used for the processing of the decoder.
  • an optional bandwidth reconstruction step or stage 1321 to 132N follows in which the band limited and critically sampled HOA coeffi ⁇ cients are reconstructed to the full bandwidth of the Track.
  • the kind of reconstruction processing is defined in the TrackBandwidthReductionType field of the HOA Track Header.
  • the content of the de-interleaved buffers of HOA coefficients are interleaved by grouping HOA coefficients of one time sample, and the HOA coefficients of the current TrackRegion are combined with the HOA coefficients of the previous
  • the resulting sequence of the HOA coefficients can be adapted to the processing of the Track. Furthermore, the interleaving steps/stages deal with the delays between the TrackRegions using bandwidth reduction and TrackRegions not using bandwidth reduction, which delay depends on the selected TrackBandwidthReductionType processing. For exam ⁇ ple, the MDCT processing adds a delay of FramePacketSize samples and therefore the interleaving steps/stages of proc essing paths without bandwidth reduction will delay their output by one packet.
  • Fig. 14 shows bandwidth reduction using MDCT (modified discrete cosine transform) processing.
  • Each HOA coefficient of the TrackRegion of FramePacketSize samples passes via a buffer 1411 to 141M a corresponding MDCT window adding step or stage 1421 to 142M.
  • Each input buffer contains the tempo ral successive HOA coefficients CTM of one combination of n and m, i.e., one buffer is defined as
  • the number M of buffers is the same as the number of Ambi- sonics components ( (N + l) 2 for a full 3D sound field of order N ) .
  • the buffer handling performs a 50% overlap for the fol ⁇ lowing MDCT processing by combining the previous buffer con- tent with the current buffer content into a new content for the MDCT processing in corresponding steps or stages 1431 to 143M, and it stores the current buffer content for the proc ⁇ essing of the following buffer content.
  • the MDCT processing re-starts at the beginning of each Frame, which means that all coefficients of a Track of the current Frame can be de ⁇ coded without knowledge of the previous Frame, and following the last buffer content of the current Frame an additional buffer content of zeros is processed. Therefore the MDCT processed TrackRegions produce one extra TrackPacket.
  • the corresponding buffer content is multiplied with the selected window function w(t), which is defined in the HOATrack header field TrackRegion- WindowType for each TrackRegion .
  • the Modified Discrete Cosine Transform is first mentioned in J. P. Princen, A.B. Bradley, "Analysis/Synthesis Filter Bank Design Based on Time Domain Aliasing Cancellation", IEEE Transactions on Acoustics r Speech and Signal Processing, vol.ASSP-34, no.5, pages 1153-1161, October 1986.
  • the MDCT can be considered as representing a critically sampled fil ⁇ ter bank of FramePacketSize subbands, and it requires a 50% input buffer overlap.
  • the input buffer has a length of twice the subband size.
  • the MDCT is defined by the following equa ⁇ tion with T equal to FramePacketSize:
  • the coefficients C'TM(k) are called MDCT bins.
  • the MDCT compu ⁇ tation can be implemented using the Fast Fourier Transform.
  • the bandwidth reduction is performed by remov- ing all MDCT bins C'TM(U with k ⁇ TrackRegionFirstBin and k > TrackRegionLastBin, for the reduction of the buffer length to TrackRegionLastBin - TrackRegionFirstBin + 1, wherein TrackRegionFirstBin is the lower cut-off frequency for the TrackRegion and TrackRegionLastBin is the upper cut-off fre- quency.
  • Fig. 15 shows bandwidth decoding or reconstruction using MDCT processing, in which HOA coefficients of bandwidth limited TrackRegions are reconstructed to the full bandwidths of the Track.
  • This bandwidth reconstruction processes buffer content of temporally de-interleaved HOA coefficients in parallel, wherein each buffer contains TrackRegionLastBin - TrackRegionFirstBin + 1 MDCT bins of coefficients C'TM(k) .
  • the missing frequency regions adding steps or stages 1541 to 154M reconstruct the complete MDCT buffer content of size FramePacketLength by complementing the received MDCT bins with the missing MDCT bins k ⁇ TrackRegionFirstBin and
  • Inverse MDCT can be interpreted as a synthesis filter bank wherein FramePacketLength MDCT bins are converted to two times FramePacketLength time domain co ⁇ efficients.
  • the complete reconstruction of the time domain samples requires a multiplication with the window function w(t) used in the encoder and an overlap-add of the first half of the current buffer content with the second half of the previous buffer content.
  • the inverse MDCT is de ⁇ fined by the following equation:
  • the inverse MDCT can be implemented using the inverse Fast Fourier Transform.
  • the MDCT window adding steps or stages 1521 to 152M multiply the reconstructed time domain coefficients with the window function defined by the TrackRegionWindowType .
  • the following buffers 1511 to 151M add the first half of the current
  • TrackPacket buffer content to the second half of the last TrackPacket buffer content in order to reconstruct Frame- PacketSize time domain coefficients.
  • the second half of the current TrackPacket buffer content is stored for the proc ⁇ essing of the following TrackPacket, which overlap-add proc- essing removes the contrary aliasing components of both buffer contents.
  • the encoder is prohibited to use the last buffer content of the previous frame for the over ⁇ lap-add procedure at the beginning of a new Frame. Therefore at Frame borders or at the beginning of a new Frame the overlap-add buffer content is missing, and the reconstruc ⁇ tion of the first TrackPacket of a Frame can be performed at the second TrackPacket, whereby a delay of one FramePacket and decoding of one extra TrackPacket is introduced as com- pared to the processing paths without bandwidth reduction. This delay is handled by the interleaving steps/stages de ⁇ scribed in connection with Fig. 13.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • Mathematical Physics (AREA)
  • Stereophonic System (AREA)
  • Obtaining Desirable Characteristics In Audible-Bandwidth Transducers (AREA)
  • Circuit For Audible Band Transducer (AREA)
PCT/EP2011/068782 2010-11-05 2011-10-26 Data structure for higher order ambisonics audio data Ceased WO2012059385A1 (en)

Priority Applications (8)

Application Number Priority Date Filing Date Title
US13/883,094 US9241216B2 (en) 2010-11-05 2011-10-26 Data structure for higher order ambisonics audio data
CN201180053153.7A CN103250207B (zh) 2010-11-05 2011-10-26 高阶高保真度立体声响复制音频数据的数据结构
JP2013537071A JP5823529B2 (ja) 2010-11-05 2011-10-26 高次アンビソニックス・オーディオ・データ用のデータ構造
BR112013010754-5A BR112013010754B1 (pt) 2010-11-05 2011-10-26 Estrutura de dados para dados de áudio ambisonics de ordens elevadas, método para codificar e dispor dados para uma estrutura de dados, método para apresentação de áudio e aparelho para apresentação de áudio
EP11776422.5A EP2636036B1 (en) 2010-11-05 2011-10-26 Data structure for higher order ambisonics audio data
KR1020137011661A KR101824287B1 (ko) 2010-11-05 2011-10-26 고차 앰비소닉 오디오 데이터를 위한 데이터 구조
HK14102354.0A HK1189297B (en) 2010-11-05 2011-10-26 Data structure for higher order ambisonics audio data
AU2011325335A AU2011325335B8 (en) 2010-11-05 2011-10-26 Data structure for Higher Order Ambisonics audio data

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP10306211.3 2010-11-05
EP10306211A EP2450880A1 (en) 2010-11-05 2010-11-05 Data structure for Higher Order Ambisonics audio data

Publications (1)

Publication Number Publication Date
WO2012059385A1 true WO2012059385A1 (en) 2012-05-10

Family

ID=43806783

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP2011/068782 Ceased WO2012059385A1 (en) 2010-11-05 2011-10-26 Data structure for higher order ambisonics audio data

Country Status (9)

Country Link
US (1) US9241216B2 (enExample)
EP (2) EP2450880A1 (enExample)
JP (1) JP5823529B2 (enExample)
KR (1) KR101824287B1 (enExample)
CN (1) CN103250207B (enExample)
AU (1) AU2011325335B8 (enExample)
BR (1) BR112013010754B1 (enExample)
PT (1) PT2636036E (enExample)
WO (1) WO2012059385A1 (enExample)

Cited By (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2733963A1 (en) 2012-11-14 2014-05-21 Thomson Licensing Method and apparatus for facilitating listening to a sound signal for matrixed sound signals
WO2014134462A3 (en) * 2013-03-01 2014-11-13 Qualcomm Incorporated Specifying spherical harmonic and/or higher order ambisonics coefficients in bitstreams
US20140358558A1 (en) * 2013-05-29 2014-12-04 Qualcomm Incorporated Identifying sources from which higher order ambisonic audio data is generated
JP2015525897A (ja) * 2012-07-15 2015-09-07 クゥアルコム・インコーポレイテッドQualcomm Incorporated 後方互換性のあるオーディオ符号化のためのシステム、方法、装置、およびコンピュータ可読媒体
KR20150134336A (ko) * 2013-03-22 2015-12-01 톰슨 라이센싱 1차 앰비소닉스 신호의 지향성을 강화하기 위한 방법 및 장치
KR20160002846A (ko) * 2013-04-29 2016-01-08 톰슨 라이센싱 고차 앰비소닉스 표현을 압축 및 압축해제하기 위한 방법 및 장치
CN105325015A (zh) * 2013-05-29 2016-02-10 高通股份有限公司 经旋转高阶立体混响的双耳化
JP2016509812A (ja) * 2013-02-08 2016-03-31 トムソン ライセンシングThomson Licensing 音場の高次アンビソニクス表現における無相関な音源の方向を決定する方法及び装置
JP2016524883A (ja) * 2013-06-18 2016-08-18 ドルビー ラボラトリーズ ライセンシング コーポレイション オーディオ・レンダリングのためのベース管理
US9451363B2 (en) 2012-03-06 2016-09-20 Dolby Laboratories Licensing Corporation Method and apparatus for playback of a higher-order ambisonics audio signal
CN105981100A (zh) * 2014-01-08 2016-09-28 杜比国际公司 用于改善对声场的高阶高保真度立体声响复制表示进行编码所需的边信息的编码的方法和装置
US9466305B2 (en) 2013-05-29 2016-10-11 Qualcomm Incorporated Performing positional analysis to code spherical harmonic coefficients
US9489955B2 (en) 2014-01-30 2016-11-08 Qualcomm Incorporated Indicating frame parameter reusability for coding vectors
US9620137B2 (en) 2014-05-16 2017-04-11 Qualcomm Incorporated Determining between scalar and vector quantization in higher order ambisonic coefficients
US9641834B2 (en) 2013-03-29 2017-05-02 Qualcomm Incorporated RTP payload format designs
US9747910B2 (en) 2014-09-26 2017-08-29 Qualcomm Incorporated Switching between predictive and non-predictive quantization techniques in a higher order ambisonics (HOA) framework
CN107180637A (zh) * 2012-05-14 2017-09-19 杜比国际公司 压缩和解压缩高阶高保真度立体声响复制信号表示的方法及装置
US9852737B2 (en) 2014-05-16 2017-12-26 Qualcomm Incorporated Coding vectors decomposed from higher-order ambisonics audio signals
CN107533843A (zh) * 2015-01-30 2018-01-02 Dts公司 用于捕获、编码、分布和解码沉浸式音频的系统和方法
US9922656B2 (en) 2014-01-30 2018-03-20 Qualcomm Incorporated Transitioning of ambient higher-order ambisonic coefficients
US9936321B2 (en) 2014-03-24 2018-04-03 Dolby Laboratories Licensing Corporation Method and device for applying dynamic range compression to a higher order ambisonics signal
CN108632736A (zh) * 2013-10-23 2018-10-09 杜比国际公司 用于音频信号呈现的方法和装置
CN109545235A (zh) * 2012-12-12 2019-03-29 杜比国际公司 对声场的高阶立体混响表示进行压缩和解压缩的方法和设备
JP2019113858A (ja) * 2013-07-11 2019-07-11 ドルビー・インターナショナル・アーベー Hoa信号の係数領域表現からこのhoa信号の混合した空間/係数領域表現を生成する方法および装置
US10542364B2 (en) 2014-03-21 2020-01-21 Dolby Laboratories Licensing Corporation Methods, apparatus and systems for decompressing a higher order ambisonics (HOA) signal
US10770087B2 (en) 2014-05-16 2020-09-08 Qualcomm Incorporated Selecting codebooks for coding vectors decomposed from higher-order ambisonic audio signals

Families Citing this family (87)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2469741A1 (en) * 2010-12-21 2012-06-27 Thomson Licensing Method and apparatus for encoding and decoding successive frames of an ambisonics representation of a 2- or 3-dimensional sound field
DE102012200512B4 (de) * 2012-01-13 2013-11-14 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Vorrichtung und Verfahren zum Berechnen von Lautsprechersignalen für eine Mehrzahl von Lautsprechern unter Verwendung einer Verzögerung im Frequenzbereich
EP2645748A1 (en) 2012-03-28 2013-10-02 Thomson Licensing Method and apparatus for decoding stereo loudspeaker signals from a higher-order Ambisonics audio signal
EP2688066A1 (en) * 2012-07-16 2014-01-22 Thomson Licensing Method and apparatus for encoding multi-channel HOA audio signals for noise reduction, and method and apparatus for decoding multi-channel HOA audio signals for noise reduction
KR102597573B1 (ko) 2012-07-16 2023-11-02 돌비 인터네셔널 에이비 오디오 재생을 위한 오디오 음장 표현을 렌더링하는 방법 및 장치
EP2875511B1 (en) 2012-07-19 2018-02-21 Dolby International AB Audio coding for improving the rendering of multi-channel audio signals
EP2898506B1 (en) * 2012-09-21 2018-01-17 Dolby Laboratories Licensing Corporation Layered approach to spatial audio coding
KR102115345B1 (ko) * 2013-01-16 2020-05-26 돌비 인터네셔널 에이비 Hoa 라우드니스 레벨을 측정하기 위한 방법 및 hoa 라우드니스 레벨을 측정하기 위한 장치
US9736609B2 (en) * 2013-02-07 2017-08-15 Qualcomm Incorporated Determining renderers for spherical harmonic coefficients
US9609452B2 (en) 2013-02-08 2017-03-28 Qualcomm Incorporated Obtaining sparseness information for higher order ambisonic audio renderers
US9883310B2 (en) 2013-02-08 2018-01-30 Qualcomm Incorporated Obtaining symmetry information for higher order ambisonic audio renderers
US10178489B2 (en) * 2013-02-08 2019-01-08 Qualcomm Incorporated Signaling audio rendering information in a bitstream
JP5734329B2 (ja) * 2013-02-28 2015-06-17 日本電信電話株式会社 音場収音再生装置、方法及びプログラム
JP5734327B2 (ja) * 2013-02-28 2015-06-17 日本電信電話株式会社 音場収音再生装置、方法及びプログラム
JP5734328B2 (ja) * 2013-02-28 2015-06-17 日本電信電話株式会社 音場収音再生装置、方法及びプログラム
US9412385B2 (en) * 2013-05-28 2016-08-09 Qualcomm Incorporated Performing spatial masking with respect to spherical harmonic coefficients
CN105340008B (zh) * 2013-05-29 2019-06-14 高通股份有限公司 声场的经分解表示的压缩
JP6186900B2 (ja) 2013-06-04 2017-08-30 ソニー株式会社 固体撮像装置、電子機器、レンズ制御方法、および撮像モジュール
CN105264595B (zh) * 2013-06-05 2019-10-01 杜比国际公司 用于编码和解码音频信号的方法和装置
EP2830332A3 (en) 2013-07-22 2015-03-11 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Method, signal processing unit, and computer program for mapping a plurality of input channels of an input channel configuration to output channels of an output channel configuration
JP6458738B2 (ja) * 2013-11-19 2019-01-30 ソニー株式会社 音場再現装置および方法、並びにプログラム
CN103618986B (zh) * 2013-11-19 2015-09-30 深圳市新一代信息技术研究院有限公司 一种3d空间中音源声像体的提取方法及装置
EP2879408A1 (en) * 2013-11-28 2015-06-03 Thomson Licensing Method and apparatus for higher order ambisonics encoding and decoding using singular value decomposition
US10020000B2 (en) * 2014-01-03 2018-07-10 Samsung Electronics Co., Ltd. Method and apparatus for improved ambisonic decoding
US20150243292A1 (en) * 2014-02-25 2015-08-27 Qualcomm Incorporated Order format signaling for higher-order ambisonic audio data
CN109410962B (zh) * 2014-03-21 2023-06-06 杜比国际公司 用于对压缩的hoa信号进行解码的方法、装置和存储介质
EP4089674B1 (en) * 2014-03-21 2024-10-30 Dolby International AB Method for decompressing a compressed hoa signal and apparatus for decompressing a compressed hoa signal
US10412522B2 (en) * 2014-03-21 2019-09-10 Qualcomm Incorporated Inserting audio channels into descriptions of soundfields
EP2928216A1 (en) 2014-03-26 2015-10-07 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for screen related audio object remapping
WO2015152666A1 (ko) * 2014-04-02 2015-10-08 삼성전자 주식회사 Hoa 신호를 포함하는 오디오 신호를 디코딩하는 방법 및 장치
US20150332682A1 (en) * 2014-05-16 2015-11-19 Qualcomm Incorporated Spatial relation coding for higher order ambisonic coefficients
RU2699406C2 (ru) * 2014-05-30 2019-09-05 Сони Корпорейшн Устройство обработки информации и способ обработки информации
CN110827839B (zh) * 2014-05-30 2023-09-19 高通股份有限公司 用于渲染高阶立体混响系数的装置和方法
EP3855766A1 (en) * 2014-06-27 2021-07-28 Dolby International AB Coded hoa data frame representation that includes non-differential gain values associated with channel signals of specific ones of the data frames of an hoa data frame representation
JP6641303B2 (ja) * 2014-06-27 2020-02-05 ドルビー・インターナショナル・アーベー 非差分的な利得値を表現するのに必要とされる最低整数ビット数をhoaデータ・フレーム表現の圧縮のために決定する装置
CN117612540A (zh) * 2014-06-27 2024-02-27 杜比国际公司 用于解码声音或声场的高阶高保真度立体声响复制(hoa)表示的方法
EP2960903A1 (en) 2014-06-27 2015-12-30 Thomson Licensing Method and apparatus for determining for the compression of an HOA data frame representation a lowest integer number of bits required for representing non-differential gain values
CN113851139A (zh) * 2014-06-30 2021-12-28 索尼公司 信息处理装置和信息处理方法
US9838819B2 (en) * 2014-07-02 2017-12-05 Qualcomm Incorporated Reducing correlation between higher order ambisonic (HOA) background channels
JP2017523454A (ja) * 2014-07-02 2017-08-17 ドルビー・インターナショナル・アーベー Hoa信号表現のサブバンド内の優勢な方向性信号の方向のエンコード/デコードのための方法および装置
CN106463131B (zh) * 2014-07-02 2020-12-08 杜比国际公司 用于对hoa信号表示的子带内的主导方向信号的方向进行编码/解码的方法和装置
US9536531B2 (en) * 2014-08-01 2017-01-03 Qualcomm Incorporated Editing of higher-order ambisonic audio data
US9847088B2 (en) * 2014-08-29 2017-12-19 Qualcomm Incorporated Intermediate compression for higher order ambisonic audio data
US9875745B2 (en) 2014-10-07 2018-01-23 Qualcomm Incorporated Normalization of ambient higher order ambisonic audio data
EP3007167A1 (en) * 2014-10-10 2016-04-13 Thomson Licensing Method and apparatus for low bit rate compression of a Higher Order Ambisonics HOA signal representation of a sound field
US10140996B2 (en) 2014-10-10 2018-11-27 Qualcomm Incorporated Signaling layers for scalable coding of higher order ambisonic audio data
GB2532034A (en) * 2014-11-05 2016-05-11 Lee Smiles Aaron A 3D visual-audio data comprehension method
US9712936B2 (en) * 2015-02-03 2017-07-18 Qualcomm Incorporated Coding higher-order ambisonic audio data with motion stabilization
WO2016182184A1 (ko) * 2015-05-08 2016-11-17 삼성전자 주식회사 입체 음향 재생 방법 및 장치
JP6466251B2 (ja) * 2015-05-20 2019-02-06 アルパイン株式会社 音場再現システム
TWI607655B (zh) 2015-06-19 2017-12-01 Sony Corp Coding apparatus and method, decoding apparatus and method, and program
US9961475B2 (en) 2015-10-08 2018-05-01 Qualcomm Incorporated Conversion from object-based audio to HOA
US9961467B2 (en) 2015-10-08 2018-05-01 Qualcomm Incorporated Conversion from channel-based audio to HOA
US10249312B2 (en) * 2015-10-08 2019-04-02 Qualcomm Incorporated Quantization of spatial vectors
CN105895111A (zh) * 2015-12-15 2016-08-24 乐视致新电子科技(天津)有限公司 基于Android的音频内容处理方法及设备
CN108496221B (zh) 2016-01-26 2020-01-21 杜比实验室特许公司 自适应量化
EP3209036A1 (en) 2016-02-19 2017-08-23 Thomson Licensing Method, computer readable storage medium, and apparatus for determining a target sound scene at a target position from two or more source sound scenes
EP3232688A1 (en) 2016-04-12 2017-10-18 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for providing individual sound zones
US10074012B2 (en) 2016-06-17 2018-09-11 Dolby Laboratories Licensing Corporation Sound and video object tracking
CN106340301B (zh) * 2016-09-13 2020-01-24 广州酷狗计算机科技有限公司 一种音频播放方法和装置
US11032663B2 (en) 2016-09-29 2021-06-08 The Trustees Of Princeton University System and method for virtual navigation of sound fields through interpolation of signals from an array of microphone assemblies
US10158963B2 (en) * 2017-01-30 2018-12-18 Google Llc Ambisonic audio with non-head tracked stereo based on head position and time
KR20180090022A (ko) * 2017-02-02 2018-08-10 한국전자통신연구원 다중 전방향 카메라 및 마이크 기반 가상현실 제공 방법 및 가상 현실 제공 방법을 수행하는 음향 신호 처리 장치 및 영상 신호 처리 장치
EP3627850A4 (en) * 2017-05-16 2020-05-06 Sony Corporation SPEAKER ARRAY AND SIGNAL PROCESSOR
US10390166B2 (en) * 2017-05-31 2019-08-20 Qualcomm Incorporated System and method for mixing and adjusting multi-input ambisonics
CN114895785A (zh) * 2017-06-15 2022-08-12 杜比国际公司 一种包括再现和存储媒体内容的装置的系统及其相关装置
US10405126B2 (en) 2017-06-30 2019-09-03 Qualcomm Incorporated Mixed-order ambisonics (MOA) audio data for computer-mediated reality systems
JP7122793B2 (ja) * 2017-07-14 2022-08-22 フラウンホーファー-ゲゼルシャフト・ツール・フェルデルング・デル・アンゲヴァンテン・フォルシュング・アインゲトラーゲネル・フェライン 深度拡張DirAC技術またはその他の技術を使用して、拡張音場記述または修正音場記述を生成するための概念
MY204183A (en) * 2017-07-14 2024-08-14 Fraunhofer Ges Zur Frderung Der Angewandten Forschung E V Concept for generating an enhanced sound field description or a modified sound field description using a multi-point sound field description
KR102652670B1 (ko) * 2017-07-14 2024-04-01 프라운호퍼 게젤샤프트 쭈르 푀르데룽 데어 안겐반텐 포르슝 에. 베. 다중-층 묘사를 이용하여 증강된 음장 묘사 또는 수정된 음장 묘사를 생성하기 위한 개념
CN109756683B (zh) * 2017-11-02 2024-06-04 深圳市裂石影音科技有限公司 全景音视频录制方法、装置、存储介质和计算机设备
CN107920303B (zh) * 2017-11-21 2019-12-24 北京时代拓灵科技有限公司 一种音频采集的方法及装置
US10595146B2 (en) * 2017-12-21 2020-03-17 Verizon Patent And Licensing Inc. Methods and systems for extracting location-diffused ambient sound from a real-world scene
US10264386B1 (en) * 2018-02-09 2019-04-16 Google Llc Directional emphasis in ambisonics
KR102637876B1 (ko) * 2018-04-10 2024-02-20 가우디오랩 주식회사 메타데이터를 이용하는 오디오 신호 처리 방법 및 장치
GB2574238A (en) 2018-05-31 2019-12-04 Nokia Technologies Oy Spatial audio parameter merging
GB2576769A (en) 2018-08-31 2020-03-04 Nokia Technologies Oy Spatial parameter signalling
KR102323529B1 (ko) 2018-12-17 2021-11-09 한국전자통신연구원 복합 차수 앰비소닉을 이용한 오디오 신호 처리 방법 및 장치
GB2582910A (en) * 2019-04-02 2020-10-14 Nokia Technologies Oy Audio codec extension
CN114127843B (zh) 2019-07-02 2023-08-11 杜比国际公司 用于离散指向性数据的表示、编码和解码的方法、设备和系统
JP2022541291A (ja) 2019-07-19 2022-09-22 エヴァテック・アーゲー 圧電コーティングおよび堆積プロセス
JP7285434B2 (ja) * 2019-08-08 2023-06-02 日本電信電話株式会社 スピーカアレイ、信号処理装置、信号処理方法および信号処理プログラム
US10735887B1 (en) * 2019-09-19 2020-08-04 Wave Sciences, LLC Spatial audio array processing system and method
US11430451B2 (en) * 2019-09-26 2022-08-30 Apple Inc. Layered coding of audio with discrete objects
RU2751440C1 (ru) * 2020-10-19 2021-07-13 Федеральное государственное бюджетное образовательное учреждение высшего образования «Московский государственный университет имени М.В.Ломоносова» (МГУ) Система для голографической записи и воспроизведения звуковой информации
CN115226001B (zh) * 2021-11-24 2024-05-03 广州汽车集团股份有限公司 声能量补偿方法、装置及计算机设备
US20240298130A1 (en) * 2023-03-03 2024-09-05 Sony Interactive Entertainment Inc. Systems and methods for generating and applying audio-based basis functions

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4042779A (en) 1974-07-12 1977-08-16 National Research Development Corporation Coincident microphone simulation covering three dimensional space and yielding various directional outputs
WO2003061336A1 (en) 2002-01-11 2003-07-24 Mh Acoustics, Llc Audio system based on at least second-order eigenbeams
EP2205007A1 (en) * 2008-12-30 2010-07-07 Fundació Barcelona Media Universitat Pompeu Fabra Method and apparatus for three-dimensional acoustic field encoding and optimal reconstruction

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5956674A (en) 1995-12-01 1999-09-21 Digital Theater Systems, Inc. Multi-channel predictive subband audio coder using psychoacoustic adaptive bit allocation in frequency, time and over the multiple channels
FR2858403B1 (fr) 2003-07-31 2005-11-18 Remy Henri Denis Bruno Systeme et procede de determination d'une representation d'un champ acoustique
CN1677490A (zh) * 2004-04-01 2005-10-05 北京宫羽数字技术有限责任公司 一种增强音频编解码装置及方法
JP5023662B2 (ja) * 2006-11-06 2012-09-12 ソニー株式会社 信号処理システム、信号送信装置、信号受信装置およびプログラム
EP2451196A1 (en) * 2010-11-05 2012-05-09 Thomson Licensing Method and apparatus for generating and for decoding sound field data including ambisonics sound field data of an order higher than three

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4042779A (en) 1974-07-12 1977-08-16 National Research Development Corporation Coincident microphone simulation covering three dimensional space and yielding various directional outputs
WO2003061336A1 (en) 2002-01-11 2003-07-24 Mh Acoustics, Llc Audio system based on at least second-order eigenbeams
EP2205007A1 (en) * 2008-12-30 2010-07-07 Fundació Barcelona Media Universitat Pompeu Fabra Method and apparatus for three-dimensional acoustic field encoding and optimal reconstruction

Non-Patent Citations (14)

* Cited by examiner, † Cited by third party
Title
"Associated Legendre polynomials", WIKIPEDIA, 12 October 2010 (2010-10-12), Retrieved from the Internet <URL:http://en.wikipedia .org/W/7/index.php?title=Associated Legendre polynomials&oldid =363001511>
CHRIS TRAVIS, FOUR CANDIDATE COMPONENT SEQUENCES, 2008, Retrieved from the Internet <URL:http://ambisonics.googlegroups.com/web/Four +candidate+component+sequences+V09.pdf>
DANIEL J ET AL: "Further Investigations of High Order Ambisonics and Wavefield Synthesis for Holophonic Sound Imaging", 114TH AES CONVENTION, AUDIO ENGINEERING SOCIETY, 22 March 2003 (2003-03-22) - 24 March 2003 (2003-03-24), XP040372092 *
DAVE MALHAM, 3-D ACOUSTIC SPACE AND ITS SIMULATION USING AMBISONICS, Retrieved from the Internet <URL:http://www.dxarts.washington.edu/courses/567 /current/malham 3d.pdf.>
EARL G. WILLIAMS: "Fourier Acoustics", 1999, ACADEMIC PRESS
J.P. PRINCEN, A.B. BRADLEY: "Analysis/Synthesis Filter Bank Design Based on Time Domain Aliasing Cancellation", IEEE TRANSACTIONS ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, vol. ASSP-34, no. 5, October 1986 (1986-10-01), pages 1153 - 1161, XP001617002
JENS AHRENS, SASCHA SPORS: "Analytical driving functions for higher order ambisonics", PROCEEDINGS OF THE ICASSP, 2008, pages 373 - 376, XP031250566
JÉRÔME DANIEL: "Représentation de champs acoustiques, application a la transmission et à la reproduction de scenes sonores complexes dans un contexte mul- timédia", PHD THESIS, 2001
JÉRÔME DANIEL: "Spatial sound encoding including near field effect: Introducing distance coding filters and a viable, new ambisonic format", AES 23RD INTERNATIONAL CONFERENCE, May 2003 (2003-05-01)
M.A. GERSON: "General metatheory of auditory localisation", 92TH AES CONVENTION, 1992, pages 3306
M.A. POLETTI: "Three-dimensional surround sound systems based on spherical harmonics", JOURNAL OF AUDIO ENGINEERING SOCIETY, vol. 53, no. 11, November 2005 (2005-11-01), pages 1004 - 1025
MARK POLETTI: "Unified description of Ambisonics using real and complex spherical harmonics", PROCEEDINGS OF THE AMBISONICS SYMPOSIUM 2009, June 2009 (2009-06-01)
MILLER R E: "Scalable Tri-play Recording for Stereo, ITU 5.1/6.1 2D, and Periphonic 3D (with Height) Compatible Surround Sound Reproduction", 115TH AES CONVENTION, AUDIO ENGINEERING SOCIETY, 10 October 2003 (2003-10-10) - 13 October 2003 (2003-10-13), XP040372301 *
WILLIAM H. PRESS, SAUL A. TEUKOLSKY, WILLIAM T. VETTERLING, BRIAN P. FLANNERY: "Numerical Recipes in C", 1992, CAMBRIDGE UNIVERSITY PRESS

Cited By (132)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11228856B2 (en) 2012-03-06 2022-01-18 Dolby Laboratories Licensing Corporation Method and apparatus for screen related adaptation of a higher-order ambisonics audio signal
US9451363B2 (en) 2012-03-06 2016-09-20 Dolby Laboratories Licensing Corporation Method and apparatus for playback of a higher-order ambisonics audio signal
US11895482B2 (en) 2012-03-06 2024-02-06 Dolby Laboratories Licensing Corporation Method and apparatus for screen related adaptation of a Higher-Order Ambisonics audio signal
JP2017175632A (ja) * 2012-03-06 2017-09-28 ドルビー・インターナショナル・アーベー 高次アンビソニックス・オーディオ信号の再生のための方法および装置
US11570566B2 (en) 2012-03-06 2023-01-31 Dolby Laboratories Licensing Corporation Method and apparatus for screen related adaptation of a Higher-Order Ambisonics audio signal
JP2024156988A (ja) * 2012-03-06 2024-11-06 ドルビー・インターナショナル・アーベー 高次アンビソニックス・オーディオ信号の再生のための方法および装置
JP7678917B2 (ja) 2012-03-06 2025-05-16 ドルビー・インターナショナル・アーベー 高次アンビソニックス・オーディオ信号の再生のための方法および装置
JP2019193292A (ja) * 2012-03-06 2019-10-31 ドルビー・インターナショナル・アーベー 高次アンビソニックス・オーディオ信号の再生のための方法および装置
US12317059B2 (en) 2012-03-06 2025-05-27 Dolby Laboratories Licensing Corporation Method and apparatus for screen related adaptation of a higher-order ambisonics audio signal
JP2018137799A (ja) * 2012-03-06 2018-08-30 ドルビー・インターナショナル・アーベー 高次アンビソニックス・オーディオ信号の再生のための方法および装置
US10771912B2 (en) 2012-03-06 2020-09-08 Dolby Laboratories Licensing Corporation Method and apparatus for screen related adaptation of a higher-order ambisonics audio signal
US10299062B2 (en) 2012-03-06 2019-05-21 Dolby Laboratories Licensing Corporation Method and apparatus for playback of a higher-order ambisonics audio signal
JP2019133175A (ja) * 2012-05-14 2019-08-08 ドルビー・インターナショナル・アーベー 高次アンビソニックス信号表現を圧縮又は圧縮解除するための方法又は装置
JP7090119B2 (ja) 2012-05-14 2022-06-23 ドルビー・インターナショナル・アーベー 高次アンビソニックス信号表現を圧縮又は圧縮解除するための方法又は装置
CN107180637A (zh) * 2012-05-14 2017-09-19 杜比国际公司 压缩和解压缩高阶高保真度立体声响复制信号表示的方法及装置
US11792591B2 (en) 2012-05-14 2023-10-17 Dolby Laboratories Licensing Corporation Method and apparatus for compressing and decompressing a higher order Ambisonics signal representation
JP2020144384A (ja) * 2012-05-14 2020-09-10 ドルビー・インターナショナル・アーベー 高次アンビソニックス信号表現を圧縮又は圧縮解除するための方法又は装置
US10390164B2 (en) 2012-05-14 2019-08-20 Dolby Laboratories Licensing Corporation Method and apparatus for compressing and decompressing a higher order ambisonics signal representation
US11234091B2 (en) 2012-05-14 2022-01-25 Dolby Laboratories Licensing Corporation Method and apparatus for compressing and decompressing a Higher Order Ambisonics signal representation
CN107180638B (zh) * 2012-05-14 2021-01-15 杜比国际公司 压缩和解压缩高阶高保真度立体声响复制信号表示的方法及装置
JP2018025808A (ja) * 2012-05-14 2018-02-15 ドルビー・インターナショナル・アーベー 高次アンビソニックス信号表現を圧縮又は圧縮解除するための方法又は装置
US12245012B2 (en) 2012-05-14 2025-03-04 Dolby Laboratories Licensing Corporation Method and apparatus for compressing and decompressing a higher order ambisonics signal representation
CN107180637B (zh) * 2012-05-14 2021-01-12 杜比国际公司 压缩和解压缩高阶高保真度立体声响复制信号表示的方法及装置
CN107180638A (zh) * 2012-05-14 2017-09-19 杜比国际公司 压缩和解压缩高阶高保真度立体声响复制信号表示的方法及装置
JP2015525897A (ja) * 2012-07-15 2015-09-07 クゥアルコム・インコーポレイテッドQualcomm Incorporated 後方互換性のあるオーディオ符号化のためのシステム、方法、装置、およびコンピュータ可読媒体
US9788133B2 (en) 2012-07-15 2017-10-10 Qualcomm Incorporated Systems, methods, apparatus, and computer-readable media for backward-compatible audio coding
EP2733963A1 (en) 2012-11-14 2014-05-21 Thomson Licensing Method and apparatus for facilitating listening to a sound signal for matrixed sound signals
US9723424B2 (en) 2012-11-14 2017-08-01 Dolby Laboratories Licensing Corporation Making available a sound signal for higher order ambisonics signals
WO2014075934A1 (en) 2012-11-14 2014-05-22 Thomson Licensing Making available a sound signal for higher order ambisonics signals
CN109545235B (zh) * 2012-12-12 2023-11-17 杜比国际公司 对声场的高阶立体混响表示进行压缩和解压缩的方法和设备
CN109545235A (zh) * 2012-12-12 2019-03-29 杜比国际公司 对声场的高阶立体混响表示进行压缩和解压缩的方法和设备
JP2016509812A (ja) * 2013-02-08 2016-03-31 トムソン ライセンシングThomson Licensing 音場の高次アンビソニクス表現における無相関な音源の方向を決定する方法及び装置
US9685163B2 (en) 2013-03-01 2017-06-20 Qualcomm Incorporated Transforming spherical harmonic coefficients
US9959875B2 (en) 2013-03-01 2018-05-01 Qualcomm Incorporated Specifying spherical harmonic and/or higher order ambisonics coefficients in bitstreams
WO2014134462A3 (en) * 2013-03-01 2014-11-13 Qualcomm Incorporated Specifying spherical harmonic and/or higher order ambisonics coefficients in bitstreams
JP2016513811A (ja) * 2013-03-01 2016-05-16 クゥアルコム・インコーポレイテッドQualcomm Incorporated 変換球面調和係数
TWI646847B (zh) * 2013-03-22 2019-01-01 瑞典商杜比國際公司 屬於第1階保真立體音響訊號且具有第0階和第1階係數的輸入訊號指向性之增進方法及裝置
KR20150134336A (ko) * 2013-03-22 2015-12-01 톰슨 라이센싱 1차 앰비소닉스 신호의 지향성을 강화하기 위한 방법 및 장치
KR102208258B1 (ko) 2013-03-22 2021-01-27 돌비 인터네셔널 에이비 1차 앰비소닉스 신호의 지향성을 강화하기 위한 방법 및 장치
US9641834B2 (en) 2013-03-29 2017-05-02 Qualcomm Incorporated RTP payload format designs
US11758344B2 (en) 2013-04-29 2023-09-12 Dolby Laboratories Licensing Corporation Methods and apparatus for compressing and decompressing a higher order ambisonics representation
CN107293304A (zh) * 2013-04-29 2017-10-24 杜比国际公司 对更高阶高保真度立体声响复制表示进行压缩和解压缩的方法和装置
US11895477B2 (en) 2013-04-29 2024-02-06 Dolby Laboratories Licensing Corporation Methods and apparatus for compressing and decompressing a higher order ambisonics representation
US12317055B2 (en) 2013-04-29 2025-05-27 Dolby Laboratories Licensing Corporation Methods and apparatus for compressing and decompressing a higher order ambisonics representation
KR20160002846A (ko) * 2013-04-29 2016-01-08 톰슨 라이센싱 고차 앰비소닉스 표현을 압축 및 압축해제하기 위한 방법 및 장치
CN107293304B (zh) * 2013-04-29 2021-01-05 杜比国际公司 对更高阶高保真度立体声响复制表示进行压缩和解压缩的方法和装置
KR102232486B1 (ko) 2013-04-29 2021-03-29 돌비 인터네셔널 에이비 고차 앰비소닉스 표현을 압축 및 압축해제하기 위한 방법 및 장치
US10999688B2 (en) 2013-04-29 2021-05-04 Dolby Laboratories Licensing Corporation Methods and apparatus for compressing and decompressing a higher order ambisonics representation
US10623878B2 (en) 2013-04-29 2020-04-14 Dolby Laboratories Licensing Corporation Methods and apparatus for compressing and decompressing a higher order ambisonics representation
JP2016520864A (ja) * 2013-04-29 2016-07-14 トムソン ライセンシングThomson Licensing 高次アンビソニックス表現を圧縮および圧縮解除する方法および装置
US10264382B2 (en) 2013-04-29 2019-04-16 Dolby Laboratories Licensing Corporation Methods and apparatus for compressing and decompressing a higher order ambisonics representation
US11284210B2 (en) 2013-04-29 2022-03-22 Dolby Laboratories Licensing Corporation Methods and apparatus for compressing and decompressing a higher order ambisonics representation
US11146903B2 (en) 2013-05-29 2021-10-12 Qualcomm Incorporated Compression of decomposed representations of a sound field
CN105340009B (zh) * 2013-05-29 2019-08-09 高通股份有限公司 声场的经分解表示的压缩
US9495968B2 (en) * 2013-05-29 2016-11-15 Qualcomm Incorporated Identifying sources from which higher order ambisonic audio data is generated
US9769586B2 (en) 2013-05-29 2017-09-19 Qualcomm Incorporated Performing order reduction with respect to higher order ambisonic coefficients
US20140358558A1 (en) * 2013-05-29 2014-12-04 Qualcomm Incorporated Identifying sources from which higher order ambisonic audio data is generated
JP2016524727A (ja) * 2013-05-29 2016-08-18 クゥアルコム・インコーポレイテッドQualcomm I 音場の分解された表現の圧縮
US9980074B2 (en) 2013-05-29 2018-05-22 Qualcomm Incorporated Quantization step sizes for compression of spatial components of a sound field
US9502044B2 (en) 2013-05-29 2016-11-22 Qualcomm Incorporated Compression of decomposed representations of a sound field
US11962990B2 (en) 2013-05-29 2024-04-16 Qualcomm Incorporated Reordering of foreground audio objects in the ambisonics domain
US9716959B2 (en) 2013-05-29 2017-07-25 Qualcomm Incorporated Compensating for error in decomposed representations of sound fields
US9774977B2 (en) 2013-05-29 2017-09-26 Qualcomm Incorporated Extracting decomposed representations of a sound field based on a second configuration mode
US9466305B2 (en) 2013-05-29 2016-10-11 Qualcomm Incorporated Performing positional analysis to code spherical harmonic coefficients
CN105340009A (zh) * 2013-05-29 2016-02-17 高通股份有限公司 声场的经分解表示的压缩
US9883312B2 (en) 2013-05-29 2018-01-30 Qualcomm Incorporated Transformed higher order ambisonics audio data
CN105325015A (zh) * 2013-05-29 2016-02-10 高通股份有限公司 经旋转高阶立体混响的双耳化
US10499176B2 (en) 2013-05-29 2019-12-03 Qualcomm Incorporated Identifying codebooks to use when coding spatial components of a sound field
US9763019B2 (en) 2013-05-29 2017-09-12 Qualcomm Incorporated Analysis of decomposed representations of a sound field
US9854377B2 (en) 2013-05-29 2017-12-26 Qualcomm Incorporated Interpolation for decomposed representations of a sound field
US9749768B2 (en) 2013-05-29 2017-08-29 Qualcomm Incorporated Extracting decomposed representations of a sound field based on a first configuration mode
US9723425B2 (en) 2013-06-18 2017-08-01 Dolby Laboratories Licensing Corporation Bass management for audio rendering
JP2016524883A (ja) * 2013-06-18 2016-08-18 ドルビー ラボラトリーズ ライセンシング コーポレイション オーディオ・レンダリングのためのベース管理
JP7158452B2 (ja) 2013-07-11 2022-10-21 ドルビー・インターナショナル・アーベー Hoa信号の係数領域表現からこのhoa信号の混合した空間/係数領域表現を生成する方法および装置
US11863958B2 (en) 2013-07-11 2024-01-02 Dolby Laboratories Licensing Corporation Methods and apparatus for decoding encoded HOA signals
JP7772487B2 (ja) 2013-07-11 2025-11-18 ドルビー・インターナショナル・アーベー Hoa信号の係数領域表現からこのhoa信号の混合した空間/係数領域表現を生成する方法および装置
TWI779381B (zh) * 2013-07-11 2022-10-01 瑞典商杜比國際公司 用於解碼高階保真立體音響表示之方法、裝置及非暫態電腦可讀取儲存媒體
JP2019113858A (ja) * 2013-07-11 2019-07-11 ドルビー・インターナショナル・アーベー Hoa信号の係数領域表現からこのhoa信号の混合した空間/係数領域表現を生成する方法および装置
US12245013B2 (en) 2013-07-11 2025-03-04 Dolby Laboratories Licensing Corporation Methods and apparatus for decoding encoded HOA signals
TWI871529B (zh) * 2013-07-11 2025-02-01 瑞典商杜比國際公司 用於解碼高階保真立體音響表示之方法、裝置及非暫態電腦可讀取儲存媒體
JP2024113161A (ja) * 2013-07-11 2024-08-21 ドルビー・インターナショナル・アーベー Hoa信号の係数領域表現からこのhoa信号の混合した空間/係数領域表現を生成する方法および装置
JP7504174B2 (ja) 2013-07-11 2024-06-21 ドルビー・インターナショナル・アーベー Hoa信号の係数領域表現からこのhoa信号の混合した空間/係数領域表現を生成する方法および装置
JP2022185105A (ja) * 2013-07-11 2022-12-13 ドルビー・インターナショナル・アーベー Hoa信号の係数領域表現からこのhoa信号の混合した空間/係数領域表現を生成する方法および装置
JP2021036333A (ja) * 2013-07-11 2021-03-04 ドルビー・インターナショナル・アーベー Hoa信号の係数領域表現からこのhoa信号の混合した空間/係数領域表現を生成する方法および装置
US11750996B2 (en) 2013-10-23 2023-09-05 Dolby Laboratories Licensing Corporation Method for and apparatus for decoding/rendering an Ambisonics audio soundfield representation for audio playback using 2D setups
US11770667B2 (en) 2013-10-23 2023-09-26 Dolby Laboratories Licensing Corporation Method for and apparatus for decoding/rendering an ambisonics audio soundfield representation for audio playback using 2D setups
CN108632736A (zh) * 2013-10-23 2018-10-09 杜比国际公司 用于音频信号呈现的方法和装置
US10986455B2 (en) 2013-10-23 2021-04-20 Dolby Laboratories Licensing Corporation Method for and apparatus for decoding/rendering an ambisonics audio soundfield representation for audio playback using 2D setups
US12245014B2 (en) 2013-10-23 2025-03-04 Dolby Laboratories Licensing Corporation Method for and apparatus for decoding/rendering an Ambisonics audio soundfield representation for audio playback using 2D setups
CN108632736B (zh) * 2013-10-23 2021-06-01 杜比国际公司 用于音频信号呈现的方法和装置
US10694308B2 (en) 2013-10-23 2020-06-23 Dolby Laboratories Licensing Corporation Method for and apparatus for decoding/rendering an ambisonics audio soundfield representation for audio playback using 2D setups
US11451918B2 (en) 2013-10-23 2022-09-20 Dolby Laboratories Licensing Corporation Method for and apparatus for decoding/rendering an Ambisonics audio soundfield representation for audio playback using 2D setups
US11488614B2 (en) 2014-01-08 2022-11-01 Dolby Laboratories Licensing Corporation Method and apparatus for decoding a bitstream including encoded Higher Order Ambisonics representations
US12277948B2 (en) * 2014-01-08 2025-04-15 Dolby Laboratories Licensing Corporation Method and apparatus for decoding a bitstream including encoded Higher Order Ambisonics representations
US9990934B2 (en) * 2014-01-08 2018-06-05 Dolby Laboratories Licensing Corporation Method and apparatus for improving the coding of side information required for coding a Higher Order Ambisonics representation of a sound field
US20220115027A1 (en) * 2014-01-08 2022-04-14 Dolby Laboratories Licensing Corporation Method and apparatus for decoding a bitstream including encoded higher order ambisonics representations
US10714112B2 (en) 2014-01-08 2020-07-14 Dolby Laboratories Licensing Corporation Method and apparatus for decoding a bitstream including encoded higher order Ambisonics representations
US20240185872A1 (en) * 2014-01-08 2024-06-06 Dolby Laboratories Licensing Corporation Method and apparatus for decoding a bitstream including encoded higher order ambisonics representations
US11211078B2 (en) 2014-01-08 2021-12-28 Dolby Laboratories Licensing Corporation Method and apparatus for decoding a bitstream including encoded higher order ambisonics representations
US10147437B2 (en) 2014-01-08 2018-12-04 Dolby Laboratories Licensing Corporation Method and apparatus for decoding a bitstream including encoding higher order ambisonics representations
US11869523B2 (en) 2014-01-08 2024-01-09 Dolby Laboratories Licensing Corporation Method and apparatus for decoding a bitstream including encoded higher order ambisonics representations
US10553233B2 (en) 2014-01-08 2020-02-04 Dolby Laboratories Licensing Corporation Method and apparatus for decoding a bitstream including encoded higher order ambisonics representations
CN105981100A (zh) * 2014-01-08 2016-09-28 杜比国际公司 用于改善对声场的高阶高保真度立体声响复制表示进行编码所需的边信息的编码的方法和装置
US20160336021A1 (en) * 2014-01-08 2016-11-17 Dolby International Ab Method and apparatus for improving the coding of side information required for coding a higher order ambisonics representation of a sound field
US20230108008A1 (en) * 2014-01-08 2023-04-06 Dolby Laboratories Licensing Corporation Method and apparatus for decoding a bitstream including encoded higher order ambisonics representations
US10424312B2 (en) 2014-01-08 2019-09-24 Dolby Laboratories Licensing Corporation Method and apparatus for decoding a bitstream including encoded higher order ambisonics representations
US9502045B2 (en) 2014-01-30 2016-11-22 Qualcomm Incorporated Coding independent frames of ambient higher-order ambisonic coefficients
US9489955B2 (en) 2014-01-30 2016-11-08 Qualcomm Incorporated Indicating frame parameter reusability for coding vectors
US9754600B2 (en) 2014-01-30 2017-09-05 Qualcomm Incorporated Reuse of index of huffman codebook for coding vectors
US9747911B2 (en) 2014-01-30 2017-08-29 Qualcomm Incorporated Reuse of syntax element indicating vector quantization codebook used in compressing vectors
US9922656B2 (en) 2014-01-30 2018-03-20 Qualcomm Incorporated Transitioning of ambient higher-order ambisonic coefficients
US9747912B2 (en) 2014-01-30 2017-08-29 Qualcomm Incorporated Reuse of syntax element indicating quantization mode used in compressing vectors
US9653086B2 (en) 2014-01-30 2017-05-16 Qualcomm Incorporated Coding numbers of code vectors for independent frames of higher-order ambisonic coefficients
US10542364B2 (en) 2014-03-21 2020-01-21 Dolby Laboratories Licensing Corporation Methods, apparatus and systems for decompressing a higher order ambisonics (HOA) signal
US12069465B2 (en) 2014-03-21 2024-08-20 Dolby Laboratories Licensing Corporation Methods, apparatus and systems for decompressing a Higher Order Ambisonics (HOA) signal
TWI697893B (zh) * 2014-03-21 2020-07-01 瑞典商杜比國際公司 將高階保真立體音響信號壓縮之方法,將已壓縮高階保真立體音響信號解壓縮之方法,將高階保真立體音響信號壓縮之裝置,以及將已壓縮高階保真立體音響信號解壓縮之裝置
US11722830B2 (en) 2014-03-21 2023-08-08 Dolby Laboratories Licensing Corporation Methods, apparatus and systems for decompressing a Higher Order Ambisonics (HOA) signal
US11395084B2 (en) 2014-03-21 2022-07-19 Dolby Laboratories Licensing Corporation Methods, apparatus and systems for decompressing a higher order ambisonics (HOA) signal
US10779104B2 (en) 2014-03-21 2020-09-15 Dolby Laboratories Licensing Corporation Methods, apparatus and systems for decompressing a higher order ambisonics (HOA) signal
US11838738B2 (en) 2014-03-24 2023-12-05 Dolby Laboratories Licensing Corporation Method and device for applying Dynamic Range Compression to a Higher Order Ambisonics signal
US10567899B2 (en) 2014-03-24 2020-02-18 Dolby Laboratories Licensing Corporation Method and device for applying dynamic range compression to a higher order ambisonics signal
US10893372B2 (en) 2014-03-24 2021-01-12 Dolby Laboratories Licensing Corporation Method and device for applying dynamic range compression to a higher order ambisonics signal
US10638244B2 (en) 2014-03-24 2020-04-28 Dolby Laboratories Licensing Corporation Method and device for applying dynamic range compression to a higher order ambisonics signal
US12273696B2 (en) 2014-03-24 2025-04-08 Dolby Laboratories Licensing Corporation Method and device for applying dynamic range compression to a higher order ambisonics signal
RU2658888C2 (ru) * 2014-03-24 2018-06-25 Долби Интернэшнл Аб Способ и устройство для применения сжатия динамического диапазона к сигналу амбиофонии высшего порядка
US10362424B2 (en) 2014-03-24 2019-07-23 Dolby Laboratories Licensing Corporation Method and device for applying dynamic range compression to a higher order ambisonics signal
US9936321B2 (en) 2014-03-24 2018-04-03 Dolby Laboratories Licensing Corporation Method and device for applying dynamic range compression to a higher order ambisonics signal
US9620137B2 (en) 2014-05-16 2017-04-11 Qualcomm Incorporated Determining between scalar and vector quantization in higher order ambisonic coefficients
US10770087B2 (en) 2014-05-16 2020-09-08 Qualcomm Incorporated Selecting codebooks for coding vectors decomposed from higher-order ambisonic audio signals
US9852737B2 (en) 2014-05-16 2017-12-26 Qualcomm Incorporated Coding vectors decomposed from higher-order ambisonics audio signals
US9747910B2 (en) 2014-09-26 2017-08-29 Qualcomm Incorporated Switching between predictive and non-predictive quantization techniques in a higher order ambisonics (HOA) framework
CN107533843A (zh) * 2015-01-30 2018-01-02 Dts公司 用于捕获、编码、分布和解码沉浸式音频的系统和方法

Also Published As

Publication number Publication date
EP2636036B1 (en) 2014-08-27
PT2636036E (pt) 2014-10-13
US20130216070A1 (en) 2013-08-22
AU2011325335A8 (en) 2015-06-04
BR112013010754A2 (pt) 2018-05-02
BR112013010754B1 (pt) 2021-06-15
EP2450880A1 (en) 2012-05-09
AU2011325335B8 (en) 2015-06-04
EP2636036A1 (en) 2013-09-11
BR112013010754A8 (pt) 2018-06-12
AU2011325335A1 (en) 2013-05-09
JP2013545391A (ja) 2013-12-19
KR101824287B1 (ko) 2018-01-31
US9241216B2 (en) 2016-01-19
CN103250207B (zh) 2016-01-20
AU2011325335B2 (en) 2015-05-21
HK1189297A1 (en) 2014-05-30
KR20140000240A (ko) 2014-01-02
CN103250207A (zh) 2013-08-14
JP5823529B2 (ja) 2015-11-25

Similar Documents

Publication Publication Date Title
WO2012059385A1 (en) Data structure for higher order ambisonics audio data
TWI646847B (zh) 屬於第1階保真立體音響訊號且具有第0階和第1階係數的輸入訊號指向性之增進方法及裝置
KR102201713B1 (ko) 다채널 오디오 신호들의 렌더링을 향상시키기 위한 방법 및 디바이스
CN110459229B (zh) 用于解码声音或声场的高阶高保真度立体声响复制(hoa)表示的方法
CN105766002B (zh) 用于对区域的声场数据进行压缩和解压缩的方法和装置
US12424229B2 (en) Methods and apparatus for determining for decoding a compressed HOA sound representation
EP3161821B1 (en) Method for determining for the compression of an hoa data frame representation a lowest integer number of bits required for representing non-differential gain values
US20240404531A1 (en) Method and System for Coding Audio Data
HK1189297B (en) Data structure for higher order ambisonics audio data
HK40041126B (en) Method and apparatus for determining for the decompression of an hoa data frame representation a lowest integer number of bits representing non-differential gain values
HK1233043A1 (en) Method for determining for the compression of an hoa data frame representation a lowest integer number of bits required for representing non-differential gain values
HK1233104A1 (en) Apparatus for determining for the compression of an hoa data frame representation a lowest integer number of bits required for representing non-differential gain values

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 11776422

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2013537071

Country of ref document: JP

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: 13883094

Country of ref document: US

ENP Entry into the national phase

Ref document number: 20137011661

Country of ref document: KR

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 2011776422

Country of ref document: EP

ENP Entry into the national phase

Ref document number: 2011325335

Country of ref document: AU

Date of ref document: 20111026

Kind code of ref document: A

REG Reference to national code

Ref country code: BR

Ref legal event code: B01A

Ref document number: 112013010754

Country of ref document: BR

ENP Entry into the national phase

Ref document number: 112013010754

Country of ref document: BR

Kind code of ref document: A2

Effective date: 20130430