WO2015152666A1 - Procédé et dispositif de décodage de signal audio comprenant un signal hoa - Google Patents

Procédé et dispositif de décodage de signal audio comprenant un signal hoa Download PDF

Info

Publication number
WO2015152666A1
WO2015152666A1 PCT/KR2015/003334 KR2015003334W WO2015152666A1 WO 2015152666 A1 WO2015152666 A1 WO 2015152666A1 KR 2015003334 W KR2015003334 W KR 2015003334W WO 2015152666 A1 WO2015152666 A1 WO 2015152666A1
Authority
WO
WIPO (PCT)
Prior art keywords
hoa
signal
frequency domain
domain
audio
Prior art date
Application number
PCT/KR2015/003334
Other languages
English (en)
Korean (ko)
Inventor
전상배
김선민
Original Assignee
삼성전자 주식회사
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 삼성전자 주식회사 filed Critical 삼성전자 주식회사
Publication of WO2015152666A1 publication Critical patent/WO2015152666A1/fr

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • H04S3/008Systems employing more than two channels, e.g. quadraphonic in which the audio signals are in digital form, i.e. employing more than two discrete digital channels
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/01Multi-channel, i.e. more than two input channels, sound reproduction with two speakers wherein the multi-channel information is substantially preserved
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/11Application of ambisonics in stereophonic audio systems

Definitions

  • the present invention relates to a method and apparatus for decoding an audio signal comprising a higher order ambisonincs (HOA) signal.
  • HOA ambisonincs
  • high quality multichannel audio signals such as 7.1 channels, 10.2 channels, 13.2 channels, and 22.2 channels are used, which have more channels than the 5.1 audio signals.
  • high-quality multi-channel audio signals are often heard through two-channel stereo speakers or headphones through a personal terminal such as a smartphone or a PC.
  • binaural rendering may be used, which downmixes the multichannel audio signal into the stereo audio signal so that a high quality multichannel audio signal can be listened to in two channels of stereo speakers or headphones.
  • the present invention relates to a method and apparatus for decoding an audio signal including a HOA signal for reducing the amount of computation during binaural rendering.
  • the complexity in the audio decoding stage may be reduced.
  • FIG. 1 is a block diagram illustrating an internal structure of an audio decoder including a HOA decoder according to an embodiment.
  • FIG. 2 is a block diagram illustrating an internal structure of an audio decoder according to an embodiment.
  • FIG. 3 is a flowchart illustrating a method of decoding an audio signal including a HOA signal according to an embodiment.
  • FIG. 4 is a flowchart illustrating a method of decoding an audio signal including a HOA signal according to a processing domain of a HOA decoder according to an embodiment.
  • FIG. 5 is a block diagram illustrating an internal structure of an audio decoder according to an embodiment.
  • An apparatus for decoding an audio signal including a HOA signal comprising: an audio core codec for decoding a bitstream including an audio signal and outputting the HOA signal in a frequency domain or a time domain; And a HOA decoder for rendering and outputting the HOA signal of the frequency domain in the frequency domain.
  • In the frequency domain further comprises a binaural renderer for binaural rendering the signal mixed by the mixer.
  • the audio core codec outputs a HOA signal in the frequency domain when the processing domain of the HOA decoder is neutral or frequency domain, and the HOA decoder renders and outputs a HOA signal in the frequency domain in the frequency domain. .
  • the processing domain of the HOA decoder is determined as a neutral or frequency domain.
  • a method of decoding an audio signal including a HOA signal comprising: decoding the bitstream including an audio signal and outputting the HOA signal in a frequency domain or a time domain; In the frequency domain, rendering and outputting the HOA signal of the frequency domain.
  • any part of the specification is to “include” any component, this means that it may further include other components, except to exclude other components unless otherwise stated.
  • a part is “connected” with another part, this includes not only the case where it is “directly connected” but also the case where it is “electrically connected” with another element between them.
  • part refers to a hardware component, such as software, FPGA or ASIC, and “part” plays certain roles. However, “part” is not meant to be limited to software or hardware.
  • the “unit” may be configured to be in an addressable storage medium and may be configured to play one or more processors.
  • a “part” refers to components such as software components, object-oriented software components, class components, and task components, processes, functions, properties, procedures, Subroutines, segments of program code, drivers, firmware, microcode, circuits, data, databases, data structures, tables, arrays and variables.
  • the functionality provided within the components and “parts” may be combined into a smaller number of components and “parts” or further separated into additional components and “parts”.
  • an audio object refers to each of sound components included in an audio signal.
  • One audio signal may include various audio objects.
  • the audio signal generated by recording the performance of an orchestra includes a plurality of audio objects generated from a plurality of musical instruments such as guitar, violin, and oboe.
  • the HOA signal refers to a signal in which the audio signal is represented by coefficients representing a three-dimensional sound field.
  • the HOA signal is one of content types for representing an audio signal such as an object and a channel.
  • the HOA signal may be included in the bitstream in addition to the information about the channel and the object, and may be rendered as a channel through which the audio signal is output by the HOA decoder.
  • the method of rendering the HOA signal is described as a reference, but it is not limited thereto, and the exemplary embodiments described herein may be applied to a method of rendering various types of audio signals.
  • the processing domain refers to a domain in which a corresponding component is operated.
  • the processing domain can be set to one of time domain, frequency domain and neutral.
  • the processing domain of components that can operate in either the time domain or the frequency domain may be set to neutral.
  • the described technique is described based on the Moving Picture Experts Group-H (MPEG-H) standard, but is not limited thereto and may be applied to other audio coding techniques.
  • MPEG-H Moving Picture Experts Group-H
  • FIG. 1 is a block diagram illustrating an internal structure of an audio decoder including a HOA decoder according to an embodiment.
  • the audio decoder 100 of FIG. 1 may include an audio core codec 110, a format converter 120, an object renderer 130, a HOA decoder 140, a mixer 150, and a binaural renderer 160. Can be.
  • the audio decoder 100 is not limited to the components shown in FIG. 1, and may further include other components as necessary.
  • the audio core codec 110 may extract a plurality of channels, objects, and a HOA signal by decoding a bitstream including an audio signal.
  • the audio core codec 110 may be a unified speech and audio coding (USAC) core decoder.
  • the audio core codec 110 may be various types of decoders for decoding a bitstream including an audio signal.
  • the audio core codec 110 may decode the audio signal using a spectral band replication (SBR) technique that obtains a high band signal from a low band signal and a parameter.
  • SBR spectral band replication
  • the audio core codec 110 may output a decoded audio signal in the frequency domain.
  • the audio core codec 110 may output the HOA signal in the frequency domain by decoding the bitstream.
  • the audio core codec 110 demuxes the bit stream and performs T (time) / frequency (F) conversion on the demuxed data to extract channels, objects, HOA signals, etc. from the bit stream in the frequency domain.
  • the audio core codec 110 may convert the HOA signal extracted as a result of performing the main process into the time domain and output the HOA signal of the frequency domain without domain conversion.
  • the audio core codec 110 may include the HOA decoder 140.
  • the HOA signal in the frequency domain may be converted into a value in the time domain according to the processing domain of and output.
  • the processing domain of the HOA decoder 140 may be determined according to whether the domain conversion methods of the audio core codec 110 and the binaural renderer 160 are the same.
  • the audio core codec 110 converts the domain of the HOA signal with QMF synthesis and the binaural renderer 160 converts the domain of the audio signal with QMF analysis corresponding to QMF synthesis
  • the same method is used. It may be determined that domain conversion is performed. Inverse fast fourier transform (IFFT) and fast fourier transform (FFT) methods may be determined to be the same domain transformation method.
  • IFFT Inverse fast fourier transform
  • FFT fast fourier transform
  • domain conversion is performed in different ways. Can be judged.
  • the HOA decoder 140 When it is determined that the domain transformation is performed in the same manner, respectively, the domain transformation process of the audio core codec 110 and the domain transformation process of the binaural renderer 160 are omitted, and the HOA decoder 140 performs the HOA in the frequency domain.
  • the processing domain of the HOA decoder 140 is the time domain
  • the audio core codec 110 may convert the HOA signal of the decoded frequency domain into a value of the time domain and output the converted signal. If the domain conversion method of the audio core codec 110 and the domain conversion method of the binaural renderer 160 do not correspond to each other, the HOA decoder 140 may operate in the time domain.
  • the audio core codec 110 may output a decoded HOA signal of a frequency domain without domain conversion.
  • the format converter 120 may convert the audio signal to be output to each channel according to the output environment in which the audio signal is to be output.
  • the audio signal to be output through each channel may be input to the format converter 120 as channel information and pre-rendered object information among data output by the audio core codec 110.
  • the output environment in which the audio signal is output may include layout information, performance information, and the like of the speaker to which the audio signal is output. Since the output environment in which the audio signal is output is different from the output environment assumed at the time of encoding, the format converter 120 may convert the audio signal based on the environment information in which the audio signal is actually output.
  • the object renderer 130 may render the audio object at a predetermined spatial position based on metadata regarding the audio object.
  • the HOA decoder 140 may render a HOA signal including the HOA coefficients and the HOA side information output by the audio core codec 110 in multiple channels.
  • HOA coefficients are values representing an audio signal in a three-dimensional sound field space. Based on the HOA side information, the HOA signal can be rendered in multiple channels.
  • the HOA decoder 140 may render the HOA signal and output the rendered HOA signal.
  • the HOA decoder may be referred to as a renderer, a HOA renderer, or the like.
  • the processing domain of HOA decoder 140 may be time domain, frequency domain or neutral.
  • the processing domain of the HOA decoder 140 may be determined according to whether the domain conversion methods of the audio core codec 110 and the binaural renderer 160 are the same.
  • the HOA decoder 140 may receive the HOA signal of the time domain. As the domain conversion method of the audio core codec 110 and the binaural renderer 160 is determined to be different, the processing domain of the HOA decoder 140 may be determined as the time domain. In addition, the HOA decoder 140 may render the HOA signal in the time domain.
  • the HOA decoder 140 in which the processing domain is the neutral or frequency domain can be operated in the frequency domain. Therefore, the audio core codec 110 may output the HOA signal in the frequency domain to the HOA decoder 140 without domain conversion.
  • the processing domain of the HOA decoder 140 may be determined as the frequency domain or the neutral.
  • the HOA decoder 140 may render the HOA signal in the frequency domain in the frequency domain and output the rendered HOA signal in the frequency domain.
  • the mixer 150 may mix a plurality of audio signals such as a rendered object, a rendered HOA signal, and channel information.
  • the mixer 150 is not limited to the example described above, and may mix various types of audio signals.
  • the mixer 150 may appropriately convert and mix the volume, tone, and the like of the rendered audio signals.
  • the mixer 150 may output a mixed audio signal that may be output to each channel.
  • the mixer 150 may output the mixed audio signal of the frequency domain or the time domain according to the input signal. When the mixer 150 mixes the audio signal of the frequency domain, the mixer 150 may output the mixed audio signal of the frequency domain.
  • the mixer 150 may output the mixed audio signals in the time domain.
  • the binaural renderer 160 may downmix the audio signal mixed by the mixer 150 and binaurally render the 2-channel signal.
  • the binaural renderer 160 performs binaural rendering of the multi-channel mixed audio signal so that the multi-channel mixed audio signal can be output to two-channel stereo speakers or headphones through a terminal device such as a smartphone or a PC. can do.
  • the binaural renderer 160 converts the multi-channel audio signal in the time domain into an audio signal in the frequency domain.
  • the processing domain of the HOA decoder 140 may be determined as the time domain, and the binaural renderer 160 may have time.
  • the audio signal of the domain may be input.
  • the binaural renderer 160 may perform binaural rendering on the audio signal of the domain transformed frequency domain.
  • the binaural renderer 160 may perform binaural rendering on the audio signal of the frequency domain without domain conversion.
  • the processing domain of the HOA decoder 140 may be determined as the frequency domain or the neutral, and the binaural renderer 160 is determined.
  • An audio signal in a frequency domain may be input to the channel.
  • the binaural renderer 160 may generate two channels of binaural signals in the frequency domain by performing binaural rendering.
  • the binaural renderer 160 may convert the domain of the binaural signal into the time domain and output an audio signal of the time domain.
  • the rendered HOA signal output from the HOA decoder 140 is a signal of the frequency domain. Accordingly, the mixer 150 may output the mixed audio signal in the frequency domain, and the audio signal in the frequency domain may be input to the binaural renderer 160. Accordingly, since the binaural renderer 160 may perform binaural rendering without converting the audio signal of the time domain into the frequency domain, the complexity of the audio decoding stage may be reduced.
  • the audio core codec 110 converts the decoded HOA signal of the decoded frequency domain into a time domain value and the binaural renderer 160 converts the time domain audio signal into the frequency domain in the same manner.
  • the HOA decoder 140 may render the HOA signal in the frequency domain. For example, when the audio core codec 110 converts the domain of the HOA signal by QMF synthesis, when the binaural renderer 160 converts the domain of the audio signal by QMF analysis corresponding to QMF synthesis, the audio core The domain conversion process in the codec 110 and the binaural renderer 160 may be omitted, and the HOA decoder 140 may render the HOA signal in the frequency domain.
  • the HOA signal rendered by the HOA decoder 140 is a signal in the frequency domain, and the rendered channels and objects are also signals in the frequency domain.
  • a unified interface in the frequency domain may be provided for the post-rendering process for signals such as HOA signals, channels, objects, and the like.
  • the binaural renderer 160 may binaurally render not only the mixed HOA signal but also the signals of the mixed channel and the object.
  • the mixed channel and object signals that may be input to the binaural renderer 160 may be signals in a frequency domain.
  • the binaural renderer 160 may binaurally render the signals of the mixed channels and objects in the frequency domain.
  • the binaural renderer 240 converts the domain of the mixed HOA signal into the frequency domain, unlike the signals of the channel and the object. After that, you can perform binaural rendering, which is the main process.
  • the binaural renderer 160 receives the mixed HOA signal in the frequency domain. can do. Accordingly, the binaural renderer 160 performs the binaural rendering in the frequency domain on the mixed HOA signal in the frequency domain without mixing the other audio signals in the frequency domain, for example, the channel and object signals. Can be done.
  • the binaural signal in the frequency domain may be converted into a value in the time domain according to F / T conversion.
  • the binaural signal in the frequency domain may be converted into the time domain such that the binaural signal is sequentially output through the audio output device over time.
  • the converted binaural signal of the time domain may be finally output through a two-channel audio output device such as a speaker or a headphone.
  • FIG. 2 is a block diagram illustrating an internal structure of an audio decoder according to an embodiment.
  • the audio decoder 200 may be a terminal device that can be used by a user.
  • the audio decoder 200 may include a smart television (television), ultra high definition (UHD) TV, a monitor, a personal computer (PC), a notebook computer, a mobile phone, a tablet PC, a navigation terminal, Smart phones, personal digital assistants (PDAs), portable multimedia players (PMPs), and digital broadcast receivers.
  • the audio decoder 200 may include an audio core codec 210 and a HOA decoder 220.
  • the audio core codec 210 and the HOA decoder 220 of FIG. 2 may correspond to the audio core codec 110 and the HOA decoder 140 of FIG. 1, respectively.
  • the audio core codec 210 may obtain a HOA signal by decoding a bitstream including an audio signal. When decoding data using the SBR technology, the audio core codec 210 may obtain a decoded audio signal in the frequency domain. The audio core codec 210 may convert and output the HOA signal in the frequency domain into a value in the time domain according to the processing domain of the HOA decoder 220. The processing domain of the HOA renderer 220 may be determined depending on whether the domain conversion methods of the audio core codec 210 and the binaural renderer 160 are the same.
  • the audio core codec 210 may convert and output the HOA signal of the decoded frequency domain into a value of the time domain.
  • the audio core codec 210 may output the decoded HOA signal of the frequency domain without domain conversion.
  • the HOA decoder 220 may render the HOA signal output by the audio core codec 210.
  • the processing domain of the HOA decoder 220 is a neutral or frequency domain
  • the HOA signal of the frequency domain input from the audio core codec 210 may be rendered.
  • the HOA decoder 220 may render the HOA signal in the frequency domain and output the HOA signal in the rendered frequency domain.
  • the HOA signal rendered by the HOA decoder 220 may be mixed with other audio signals in the frequency domain and then binaurally rendered and finally output.
  • FIG. 3 is a flowchart illustrating a method of decoding an audio signal including a HOA signal according to an embodiment.
  • the audio core codec 210 may decode a bit stream and output a HOA signal in a frequency domain or a time domain.
  • the HOA signal may include a HOA coefficient and HOA side information.
  • the audio core codec 210 obtains a decoded audio signal in the frequency domain, and decodes the decoded HOA signal in the time domain or the frequency domain according to the processing domain of the HOA decoder 220. You can print
  • the audio core codec 210 may output the time domain HOA signal by converting the decoded HOA signal into a value of the time domain.
  • the audio core codec 210 may output the decoded HOA signal of the frequency domain without domain conversion.
  • the processing domain of the HOA decoder 220 may be determined according to whether the domain conversion methods of the audio core codec 110 and the binaural renderer are the same.
  • the HOA decoder 220 may render the HOA signal in a plurality of channels in the frequency domain or the time domain according to the processing domain of the HOA decoder 220.
  • the HOA decoder 220 may receive a time domain HOA signal from the audio core codec 210.
  • the HOA decoder 220 may output the rendered HOA signal in the time domain.
  • the HOA decoder 220 may receive a HOA signal in the frequency domain from the audio core codec 210.
  • the HOA decoder 220 may render the HOA signal in the frequency domain and output the HOA signal in the rendered frequency domain. Therefore, unlike the HOA decoder 220 in the time domain, the HOA decoder 220 may directly output the rendered HOA signal in the frequency domain without performing a domain conversion process.
  • FIG. 4 is a flowchart illustrating a method of decoding an audio signal including a HOA signal according to a processing domain of a HOA decoder according to an embodiment.
  • the audio core codec 210 may obtain a HOA signal in a frequency domain by decoding a bit stream including an audio signal.
  • the audio core codec 210 may obtain a decoded audio signal in the frequency domain.
  • the processing domain of the HOA decoder 220 may be determined depending on whether the audio core codec and the binaural renderer have the same domain conversion method.
  • the processing domain of HOA decoder 220 may be set to one of time domain, frequency domain, and neutral.
  • the processing domain of the HOA decoder 220 may be determined as a frequency domain or a neutral. Therefore, the audio core codec 210 may output the decoded HOA signal of the frequency domain without domain conversion.
  • the HOA decoder 220 may render a plurality of channels in the frequency domain with respect to the HOA signal in the frequency domain output by the audio core codec 210.
  • the processing domain of the HOA decoder 220 may be determined as the time domain.
  • the HOA decoder 220 may render the HOA signal decoded in the time domain.
  • the audio core codec 210 may convert and output a HOA signal in a frequency domain into a HOA signal in a time domain.
  • the HOA decoder 220 may output the HOA signal in the time domain by rendering the HOA signal in the time domain input from the audio core codec 210 in the time domain.
  • the audio decoder 200 may mix the HOA signal of the frequency domain or time domain rendered by the HOA decoder 220 with another audio signal.
  • the audio decoder 200 may further include a mixer 150 for mixing the plurality of audio signals.
  • the audio decoder 200 may binaurally render the mixed signal in the frequency domain or the time domain in the frequency domain.
  • the audio decoder 200 may further include a binaural renderer 160 for performing binaural rendering. Since the binaural renderer 160 may perform binaural rendering in the frequency domain, when the mixed signal of the time domain is received, the binaural renderer 160 may further perform processing for converting the time domain signal into the frequency domain. . However, when the binaural renderer 160 receives the mixed signal of the frequency domain, the binaural renderer may perform binaural rendering without performing a domain conversion process. The binaural renderer 160 may receive the mixed signal of the frequency domain when the HOA decoder 140 operates in the frequency domain.
  • the HOA signal rendered by the HOA decoder 220 is a signal in the frequency domain, and the rendered channels and objects are also signals in the frequency domain.
  • a uniform interface in the frequency domain can be provided for tasks performed by mixer 150 and binaural renderer 160 using audio signals, such as rendered HOA signals, channels, objects, and the like.
  • the computation amount of the task performed in the frequency domain is less than the computation amount of the task performed in the time domain, the computation amount of the task after rendering may be reduced.
  • the binaural rendered binaural signal may be converted to the time domain and output to a device capable of outputting an audio signal such as a speaker or a headphone.
  • FIG. 5 is a block diagram illustrating an internal structure of an audio decoder according to an embodiment.
  • the audio decoder 500 may include an audio core codec 510, a HOA decoder 520, and a binaural renderer 530.
  • the audio core codec 510 demuxes 511 the bit stream and performs T (time) / frequency (F) conversion on the demuxed data (512), so that channels, objects,
  • the main process 513 for extracting the HOA signal and the like may be performed.
  • the audio core codec 110 may output the HOA signal of the frequency domain extracted as a result of the main process.
  • the HOA decoder 520 may render the HOA signal output by the audio core codec 110 in the frequency domain.
  • the HOA decoder 520 may output the HOA signal in the rendered frequency domain.
  • the binaural renderer 530 may perform binaural rendering in the frequency domain, which is a main process 531 for re-rendering the HOA signal in the rendered frequency domain in two channels.
  • the binaural renderer 530 performs binaural rendering, and then performs F / T conversion (532) to convert a signal in the frequency domain to the time domain and output the signal to a two-channel output device such as a speaker or a headphone. Can be.
  • a domain conversion method for converting the HOA signal output from the audio core codec 510 into the time domain and a domain conversion method for converting the HOA signal of the time domain input from the binaural renderer 530 into the frequency domain 5 the F / T conversion process of the audio core codec 510 and the T / F conversion process of the binaural renderer 530 may be omitted.
  • the HOA decoder 520 may render in the frequency domain instead of rendering in the time domain. Therefore, according to an embodiment, since some processes of the audio core codec 510 and the binaural renderer 530 may be omitted, the amount of computation at the decoding stage may be reduced.
  • the complexity in the audio decoding stage may be reduced.
  • the method according to some embodiments may be embodied in the form of program instructions that may be executed by various computer means and recorded on a computer readable medium.
  • the computer readable medium may include program instructions, data files, data structures, etc. alone or in combination.
  • Program instructions recorded on the media may be those specially designed and constructed for the purposes of the present invention, or they may be of the kind well-known and available to those having skill in the computer software arts.
  • Examples of computer-readable recording media include magnetic media such as hard disks, floppy disks, and magnetic tape, optical media such as CD-ROMs, DVDs, and magnetic disks, such as floppy disks.
  • Examples of program instructions include not only machine code generated by a compiler, but also high-level language code that can be executed by a computer using an interpreter or the like.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Mathematical Physics (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Stereophonic System (AREA)

Abstract

L'invention concerne un dispositif de décodage d'un signal audio comprenant un signal HOA, comprenant : un codec audio de noyau destiné à décoder un train de bits comprenant un signal audio pour délivrer en sortie le signal HOA d'un domaine fréquentiel ou d'un domaine temporel ; et un décodeur HOA destiné à restituer et à délivrer en sortie un signal HOA du domaine fréquentiel dans le domaine fréquentiel.
PCT/KR2015/003334 2014-04-02 2015-04-02 Procédé et dispositif de décodage de signal audio comprenant un signal hoa WO2015152666A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201461974081P 2014-04-02 2014-04-02
US61/974,081 2014-04-02

Publications (1)

Publication Number Publication Date
WO2015152666A1 true WO2015152666A1 (fr) 2015-10-08

Family

ID=54240887

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR2015/003334 WO2015152666A1 (fr) 2014-04-02 2015-04-02 Procédé et dispositif de décodage de signal audio comprenant un signal hoa

Country Status (1)

Country Link
WO (1) WO2015152666A1 (fr)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110099351A (zh) * 2019-04-01 2019-08-06 中车青岛四方机车车辆股份有限公司 一种声场回放方法、装置和系统

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090012796A1 (en) * 2006-02-07 2009-01-08 Lg Electronics Inc. Apparatus and Method for Encoding/Decoding Signal
US20110264456A1 (en) * 2008-10-07 2011-10-27 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Binaural rendering of a multi-channel audio signal
US20130216070A1 (en) * 2010-11-05 2013-08-22 Florian Keiler Data structure for higher order ambisonics audio data
US20140023196A1 (en) * 2012-07-20 2014-01-23 Qualcomm Incorporated Scalable downmix design with feedback for object-based surround codec

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090012796A1 (en) * 2006-02-07 2009-01-08 Lg Electronics Inc. Apparatus and Method for Encoding/Decoding Signal
US20110264456A1 (en) * 2008-10-07 2011-10-27 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Binaural rendering of a multi-channel audio signal
US20130216070A1 (en) * 2010-11-05 2013-08-22 Florian Keiler Data structure for higher order ambisonics audio data
US20140023196A1 (en) * 2012-07-20 2014-01-23 Qualcomm Incorporated Scalable downmix design with feedback for object-based surround codec

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
SEO, JEONG IL ET AL.: "Current State of Audio Standardization for Realistic Boradcasting", KOREA SOCIETY BROADCAST ENGINEERS MAGAZINE, vol. 19, 1 January 2014 (2014-01-01), pages 37 - 46, Retrieved from the Internet <URL:http://libra.kaist.ac.kr/42493d2/_Lib_Proxy_Url/www.dbpia.co.kr/Journal/ArticleDetail/3361153> *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110099351A (zh) * 2019-04-01 2019-08-06 中车青岛四方机车车辆股份有限公司 一种声场回放方法、装置和系统
CN110099351B (zh) * 2019-04-01 2020-11-03 中车青岛四方机车车辆股份有限公司 一种声场回放方法、装置和系统

Similar Documents

Publication Publication Date Title
US10674262B2 (en) Merging audio signals with spatial metadata
JP6105062B2 (ja) 後方互換性のあるオーディオ符号化のためのシステム、方法、装置、およびコンピュータ可読媒体
KR101759005B1 (ko) 3d 오디오 계층적 코딩을 이용한 라우드스피커 포지션 보상
WO2015105393A1 (fr) Procédé et appareil de reproduction d&#39;un contenu audio tridimensionnel
WO2011021845A2 (fr) Procédé et appareil destinés à coder un signal audio multicanal et procédé et appareil destinés à décoder un signal audio multicanal
US20140350944A1 (en) Encoding and reproduction of three dimensional audio soundtracks
WO2015152661A1 (fr) Procédé et appareil pour restituer un objet audio
US11924627B2 (en) Ambience audio representation and associated rendering
KR20120082738A (ko) 다채널 멀티트랙 오디오 시스템 및 오디오 처리 방법
WO2014021587A1 (fr) Dispositif et procédé de traitement de signal audio
CN112567765B (zh) 空间音频捕获、传输和再现
EP3836571A1 (fr) Appareil de sortie audio et son procédé de commande
CN111316353A (zh) 确定空间音频参数编码和相关联的解码
Goodwin et al. Binaural 3-D audio rendering based on spatial audio scene coding
WO2010150635A1 (fr) Système de traitement de signal acoustique, dispositif de décodage de signal acoustique, procédé de traitement et programme associé
WO2015152666A1 (fr) Procédé et dispositif de décodage de signal audio comprenant un signal hoa
WO2014021586A1 (fr) Procédé et dispositif de traitement de signal audio
WO2015126228A1 (fr) Procédé et dispositif de classification de signal, et procédé et dispositif de codage audio les utilisant
WO2012087042A2 (fr) Appareil de transmission de programme audiovisuel et procédé de transmission de programme audiovisuel pour fournir un signal audio basé objet, et appareil de lecture de programme audiovisuel et procédé de lecture de programme audiovisuel
WO2014171791A1 (fr) Appareil et procédé de traitement de signal audio multicanal
CN108028988A (zh) 处理低复杂度格式转换的内部声道的设备和方法
WO2014058275A1 (fr) Dispositif et méthode de production de données audios, et dispositif et méthode de lecture de données audios
WO2014112793A1 (fr) Appareil de codage/décodage pour traiter un signal de canal et procédé pour celui-ci
WO2016108655A1 (fr) Procédé de codage de signal audio multicanal, et dispositif de codage pour exécuter le procédé de codage, et procédé de décodage de signal audio multicanal, et dispositif de décodage pour exécuter le procédé de décodage
CN112133316A (zh) 空间音频表示和渲染

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15772919

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase
122 Ep: pct application non-entry in european phase

Ref document number: 15772919

Country of ref document: EP

Kind code of ref document: A1