WO2006052188A1 - Procede et dispositif de traitement du son enveloppant - Google Patents

Procede et dispositif de traitement du son enveloppant Download PDF

Info

Publication number
WO2006052188A1
WO2006052188A1 PCT/SE2005/001659 SE2005001659W WO2006052188A1 WO 2006052188 A1 WO2006052188 A1 WO 2006052188A1 SE 2005001659 W SE2005001659 W SE 2005001659W WO 2006052188 A1 WO2006052188 A1 WO 2006052188A1
Authority
WO
WIPO (PCT)
Prior art keywords
format
virtual
signals
coincident
microphone
Prior art date
Application number
PCT/SE2005/001659
Other languages
English (en)
Other versions
WO2006052188B1 (fr
Inventor
Bengt-Inge DALENBÄCK
Original Assignee
Catt (Computer Aided Theatre Technique)
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Catt (Computer Aided Theatre Technique) filed Critical Catt (Computer Aided Theatre Technique)
Publication of WO2006052188A1 publication Critical patent/WO2006052188A1/fr
Publication of WO2006052188B1 publication Critical patent/WO2006052188B1/fr

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • H04S3/02Systems employing more than two channels, e.g. quadraphonic of the matrix type, i.e. in which input signals are combined algebraically, e.g. after having been phase shifted with respect to each other
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R5/00Stereophonic arrangements
    • H04R5/027Spatial or constructional arrangements of microphones, e.g. in dummy heads
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/11Application of ambisonics in stereophonic audio systems

Definitions

  • the present invention relates to processing sound signals and especially transcoding of recorded or artificially created sound, directionally encoded using essentially coincident microphones, via an intermediate transformation stage, to mono, stereo or multi-channel surround sound for loudspeaker replay.
  • the transcoding process extends the possibilities to vary the properties of the replayed sound as compared to currently available techniques, and especially can offer a more stable frontal image and a wider listening area.
  • Non-coincident (spaced) microphone arrays such as INA5, OCT, Fukada Tree, Decca Tree [1] (the latter three augmented with surround microphones), Williams MMA [8], and numerous other possible configurations each with their unique properties.
  • Non-coincident arrays that utilize some kind of physical object separating the microphones such as a Schoeps KFM360 surround sphere [9] or the Holophone [10].
  • the coincident B-format offers many technical advantages such as 3D transformations and mixing and can be ambisonically decoded to a range of different loudspeaker layouts including those with height [5].
  • an irregular loudspeaker layout a layout where loudspeakers are not placed at equal relative angles
  • an ambisonic decode appears to have severe difficulties, typically resulting in a small listening area (small "sweetspot"), frontal image instability and head movement sound coloration [12] [13], and is unusual in current surround post- processing practice.
  • Spaced microphone arrays appear to offer a wide listening area, frontal image stability [12] [13], and an overall sound that is perceived as more spacious.
  • each of the spaced microphone arrays gives a different result when the recorded signals are replayed over a loudspeaker array, typically the ITU recommendation [11], see Fig. 3.
  • Results differ regarding useful listening area size, frontal image stability, coloration, reproduced image size and localization, overall sound character, and front/rear/side balance.
  • several variants are possible such as a variation of the main recording angles, cf. [I].
  • a good example of the many variants that are claimed to be possible is a conference paper about the Williams MMA describing over 220 ways to configure 5 cardioid microphones [8]. This paper, in itself, clearly indicates the need for an approach that does not necessitate a decision on the microphone array already at the recording stage where an unfortunate choice may lead to unsatisfactory results or even to a costly second recording event.
  • Reference [14] relates to techniques of making a recording of or transmitting a sound field from either multiple monaural or directional sound signals that reproduce through multiple discrete loud speakers a sound field with spatial harmonics that substantially exactly match those of the original sound field.
  • the present invention does relate to creating the B-format, only how it is further processed for loudspeaker replay. Indeed, the present invention can be applied on B-format signals created in a way described in [14]. Further [14] emphasizes that the spatial harmonics should be preserved and that is not the case with the present invention since as soon as delays (via the spaced microphones) have been introduced in the processing a discussion of spatial harmonics, that assumes coincident signals, is not relevant. Instead, with spaced microphones, sound localization is partly based on time delays between sound arrivals.
  • the present invention offers an opportunity to overcome actual and perceived problems with coincident microphone arrays so that, after a transcode as described herein, the benefits of spaced arrays are essentially achieved, in particular the stable frontal image and wide listening area but also the more spacious sound, while making it possible to post-pone the decision about exactly which spaced microphone array properties to use until the post-processing stage.
  • the present invention explores the inherent generality and orthogonality of B-format signals and teaches ways to transcode from B-format to nearly any other type of coincident or spaced microphone array so that after the actual B-format recording, or after B-format signals having been artificially generated e.g.
  • B-format consists of highly correlated signals that can, by the transcoding process described, subsequently at the post-processing stage, be decorrelated in a chosen manner by use of a synthesized primary virtual coincident microphone array that creates feeds for a virtual loudspeaker replay (here called L- format) the outputs of which are virtually recorded by a secondary virtual spaced or coincident microphone array of chosen type, as illustrated in Fig. 2.
  • L- format virtual loudspeaker replay
  • the process described represents a virtual re-recording of the original sound field where the choice of microphone array is postponed until the post-processing stage in a similar way as current coincident-only methods, e.g.
  • the process introduces certain artefacts consisting of additional sound pickup by the secondary virtual microphone array due to the omni-directional components in the primary virtual microphones, as follows directly from the description herein, but these artefacts can generally be perceived as beneficial rather than detrimental in comparison to a decode using purely coincident methods.
  • Physical spaced microphone arrays always create comb-filtering, objectively an artefact but that by many are perceived as giving a more spacious sound.
  • the transcode process described introduces additional comb-filtering but not to such a degree that it adversely affects the listening experience.
  • the final loudspeaker replay, see Fig 3 is closer in sound to that of an actual physical microphone array of the same type than to that of a purely coincident decode such as ambisonic.
  • a further benefit of the present invention is that the properties of the final replay can be changed in real time by varying the primary virtual microphone configuration, as well as individual secondary virtual microphone properties, aims and locations, while listening, thus also offering additional "artistic" options at the post-processing stage, options that are of a wider range and also different character than those of current coincident-only techniques.
  • the present invention may be used to transform between any two different spaced arrays by first performing a virtual B-format recording of the original spaced array signals and then applying the invention on the thus obtained B-format signals, but for reasons stated the results will always be limited since the signals from the original spaced microphone array are too decorrelated.
  • the present invention comprises a transcoder arrangement provided for transcoding from directionally encoded audio signals in form of a B-format (W, X, Y, Z) or B-format-like signals to audio signals reproducible via loudspeakers over a listening area.
  • the arrangement comprises: inputs for receiving the directionally encoded audio signals; primary matrix means; inter-connection means; secondary matrix means; and output means for subsequent amplification and replay of transcoded audio signals.
  • the directionally encoded audio signals comprises one or several of: recorded B-format; artificially created B-format; measured B-format room impulse responses; artificially created B-format room impulse responses; recorded B-format-like signals; artificially created B-format-like signals; measured B-format-like room impulse responses; artificially created B-format-like room impulse responses.
  • the directional encoded B-format or B-format-like audio signals are those of substantially perfectly coincident microphones consisting of at least one omni-directional microphone and three orthogonal figure-of-8 microphones.
  • B-format-like is meant B-format that has not to fulfill all qualitative and mathematical demands of a perfect B-format.
  • the directionally encoded audio signals may comprise of B-format signals limited to omni-directional and horizontal components.
  • the directionally encoded audio signals may also comprise B-format or B-format-like signals that have undergone a spatial transform.
  • the primary matrix means comprises signed linear coefficients a / , b j , C j configured to create virtual individual polar microphone directivities derived from said B-format signals;
  • U j ( ⁇ ) h j ( OfJ ⁇ W( ⁇ ) + b j cos( ⁇ j) X( ⁇ ) + c j sin( ⁇ j) Y(n) )
  • K is an arbitrary positive constant.
  • the number (N) of primary virtual microphones and primary matrix output channels are ⁇ 3.
  • the time-delay of signals from the N virtual loudspeakers to the chosen center of the secondary microphone array are effectively and essentially the same.
  • One embodiment may comprise control means for adjusting coefficients of primary matrix means thus altering primary virtual microphone types.
  • a control means may be provided for adjusting coefficients and delays of secondary matrix means, thus altering secondary virtual microphone types, aims, locations, filtering means and delays.
  • the primary and secondary matrices may be integrated.
  • the invention also relates to a method of transcoding a sound signal where directionally encoded audio signals in form of a B-format (W, X, Y, Z) or B-format-like signals are via a primary matrix, synthesizing virtual coincident polar microphones, encoded to intermediate channels used as virtual loudspeaker feeds.
  • the method comprises feeding the output of said virtual loudspeakers to a secondary matrix implementing multiple coincident or non-coincident secondary virtual non-polar or polar microphones thereby transcoding to multiple output channels suitable for subsequent mono, stereo or surround sound loudspeaker replay.
  • the directionally encoded audio signals comprises one or several of: recorded B-format; artificially created B-format; measured B-format room impulse responses; artificially created B-format room impulse responses; recorded B-format-like signals; artificially created B-format-like signals; measured B-format-like room impulse responses; and artificially created B-format-like room impulse responses.
  • the invention also relates to a computer program, preferably realized as a plug-in, for transcoding a digital surround sound in form of a B-format (W, X, Y, Z) or B-format- like signals where directionally encoded audio signals are via a primary matrix, synthesizing virtual polar microphones, encoded to intermediate channels used as virtual loudspeaker feeds, the program comprising a procedure for feeding virtual loudspeakers via a secondary matrix transcoding to multiple channels corresponding to coincident or non-coincident virtual non-polar or polar microphone signals suitable for subsequent mono, stereo or surround sound loudspeaker replay.
  • a computer program preferably realized as a plug-in, for transcoding a digital surround sound in form of a B-format (W, X, Y, Z) or B-format- like signals where directionally encoded audio signals are via a primary matrix, synthesizing virtual polar microphones, encoded to intermediate channels used as virtual loudspeaker feeds
  • the program comprising a procedure for feeding
  • the invention also relates to a computer readable product having there on an instruction set for transcoding to a digital surround sound in form of a B-format (W, X, Y, Z) or B- format-like signals where directionally encoded audio signals are via a primary matrix, synthesizing virtual polar microphones, encoded to intermediate channels used as virtual loudspeaker feeds, the instruction set comprises feeding virtual loudspeakers via a secondary matrix transcoding to multiple channels corresponding to coincident or non-coincident virtual non-polar or polar microphone signals suitable for subsequent mono, stereo or surround sound loudspeaker replay.
  • a computer readable product having there on an instruction set for transcoding to a digital surround sound in form of a B-format (W, X, Y, Z) or B- format-like signals where directionally encoded audio signals are via a primary matrix, synthesizing virtual polar microphones, encoded to intermediate channels used as virtual loudspeaker feeds
  • the instruction set comprises feeding virtual loudspeakers via a secondary matrix trans
  • Figure 1 is a general schematic of a transcoding system
  • Figure 2 is a sample schematic illustration of a 3 x 4 x 5 transcode for ITU surround, see Fig. 3;
  • Figure 3 is a schematic illustration of an actual surround replay example for ITU surround
  • Figure 4 illustrates sound sample delays of individual secondary virtual microphone signals at a large distance r from virtual loudspeakers
  • Figure 6 shows schematically the main steps of a transcoding system according to the invention.
  • the examples are only based on the omni-directional component (W) and horizontal components (X, Y).
  • the example uses the common 5-channel surround system but may as well use, e.g. 6, 7 or more channels or stereo.
  • Figure 1 illustrates a surround processing (transcoding) system comprising: input means (1), primary matrix means (2), virtual loudspeaker feeds (3), secondary matrix means (4), and virtual microphone signals and output means (5).
  • the transcode for ITU surround as illustrated in Fig. 2 comprises: W 5 X 5 Y input signals (6), primary matrix means (7), synthesized primary microphone pattern (8), secondary matrix means (9), virtual loudspeaker (10), secondary virtual microphone (11), surround output (12), and coordinate system origin (13).
  • Output channels of the primary matrix (7) are created by N primary coincident virtual microphones (8) synthesized from the W,X,Y input (6), and are connected to N virtual loudspeakers (10), typically placed horizontally and equidistant at a typically large radius r, see Fig. 2. With air absorption neglected, and except for a common propagation constant and delay, the feed to a virtual loudspeaker , / is then approximately given by:
  • N 4 being the most practical application giving high channel separation j primary virtual microphone and virtual loudspeaker index, 1 ⁇ j ⁇ N h j variable gain factor for primary virtual microphone j, typically h j — 1 Win)
  • B-format W sample n (traditionally, a W signal is that of an omni ⁇ directional microphone divided by the factor 4 ⁇ , otherwise this factor is removed from Eq. 1)
  • Y( ⁇ ) B-format Y sample n (signal from a figure-of-8 microphone in the lateral left direction) ci j , b j , C j primary matrix coefficients for calculation of feeds to virtual loudspeaker j-
  • Fractional delay techniques may be applied as an alternative embodiment s sampling frequency, Hz co speed of sound in air, m/s
  • Fi j ideal first-order polar directivity of secondary virtual microphone i evaluated in direction of virtual loudspeaker ⁇ :
  • N-channel L-format (uj) as a distribution format, from which many Af-channel surround formats can be derived, but that also allows for recreation of the original B-format in cases where e.g. an ambisonic decode [5] is desired or where the encoded sound-field is to be pre-processed, e.g. via B-format transforms [5], before final decode for loudspeaker replay.
  • uj N-channel L-format
  • Examples of useful transcoding to stereo are when the virtual secondary microphones are configured as ORTF, NOS, RAI, DIN or spaced omni [I].
  • the claimed invention does not directly address the problem of surround replay to a loudspeaker layout differing from the ITU recommendation [11], but it offers the possibility to choose a secondary virtual microphone array that gives a result less dependent on exact loudspeaker layout or that is designed for another loudspeaker layout.
  • L-format is not a psycho-acoustic decode, that attempts to take into account aspects of the human hearing and that is aimed for direct physical loudspeaker feeds, but represents intermediate signals suitable for a virtual re-recording of the originally recorded sound field, and the secondary virtual microphones applied do not register energy but pressure including phase.
  • the main steps of a method according to one preferred embodiment of the invention are summarised in the block diagram of figure 6.
  • the directionally encoded audio signals (24) are via a primary matrix (25), which synthesizes virtual coincident polar microphones, encoded to intermediate channels used as virtual loudspeaker feeds (26).
  • the outputs from the virtual loudspeakers are fed to a secondary matrix (27), which implements multiple coincident or non-coincident secondary virtual non-polar or polar microphones.
  • the multiple output channels (28) are suitable for subsequent mono, stereo or surround sound loudspeaker replay.
  • Example 1 A virtual loudspeaker distance dependence may be included so that the ⁇ y delays instead are calculated as:
  • Example 2 The geometrical spread from each virtual loudspeaker, assuming it to behave as a pure point source, may be taken into account where Eq. 2 instead will be calculated as:
  • Virtual loudspeakers may apply directivities G ⁇ so that the level at secondary virtual microphones located far off axis of each virtual loudspeaker is attenuated, see Fig. 5, in which (21) a virtual loudspeaker, (22) secondary virtual microphone, and (23) axis of secondary virtual microphone. Eq. 2 will then instead be calculated as:
  • Example 5 For some applications it can be beneficial to transcode low and high frequency sound using different methods since the human directional hearing has different properties below and above ca. 500 Hz (head size related to the wavelength of sound).
  • two parallel transcoders as described herein may be used.
  • the input (1) of the first transcoder is fed by directionally encoded low-pass filtered signals and the input (1) of the second transcoder is fed with high-pass filtered signals.
  • the first transcoder uses primary and secondary matrices whose settings are optimized or adjusted for low frequency localization while the second transcoder uses matrices optimized or adjusted for high frequency localization.
  • the actual cross-over frequency may vary depending on application, typically between 300 and 700 Hz.
  • the outputs (4) of both transcoders are then summed to form new signals for subsequent replay to loudspeakers for listening.
  • Example 6 A useful alternative to direct recording of music or other performances in B-format is to measure, or artificially create, B-format room impulse responses using one of the many available impulse response measurement methods (Maximum Length)
  • B-format signals obtained in this manner can also be used with the claimed invention but it is equally possible to use the invention on the B-format impulse responses themselves treating them as the input signals (1) of the transcoder.
  • the output of the transcoder will then instead be a new set of room impulse responses corresponding to the secondary virtual microphones. This technique is useful for creating several different types of multi-channel or stereo impulse responses, for reverberation purposes, based on the same measured or simulated B-format impulse responses.

Landscapes

  • Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Algebra (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Analysis (AREA)
  • Mathematical Optimization (AREA)
  • Mathematical Physics (AREA)
  • Pure & Applied Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Stereophonic System (AREA)

Abstract

L'invention porte sur un dispositif de traitement du son enveloppant dans lequel des signaux audio codés de manière directionnelle sous un format B (W, X, Y, Z) ou en signaux analogues au format B sont codés, via une matrice primaire synthétisant des microphones arbitraires virtuels, en canaux intermédiaires utilisés comme sources de haut-parleurs virtuels. Ces sources de haut-parleurs virtuels sont transcodées, via une matrice secondaire, en plusieurs canaux correspondant à des signaux de microphones arbitraires virtuels coïncidents ou non coïncidents, appropriés à une réexécution ultérieure par le haut-parleur de son mono, stéréo ou enveloppant. Le procédé de transcodage précité permet la variation des propriétés des paramètres concernant le champ sonore reproduit, tels que localisation, profondeur, taille de l'image, extension, enveloppement et, notamment, peut offrir une image frontale stable et une grande dimension de zone d'écoute.
PCT/SE2005/001659 2004-11-12 2005-11-07 Procede et dispositif de traitement du son enveloppant WO2006052188A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
SE0402772A SE528706C2 (sv) 2004-11-12 2004-11-12 Anordning och processmetod för surroundljud
SE0402772-8 2004-11-12

Publications (2)

Publication Number Publication Date
WO2006052188A1 true WO2006052188A1 (fr) 2006-05-18
WO2006052188B1 WO2006052188B1 (fr) 2006-12-28

Family

ID=33488235

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/SE2005/001659 WO2006052188A1 (fr) 2004-11-12 2005-11-07 Procede et dispositif de traitement du son enveloppant

Country Status (2)

Country Link
SE (1) SE528706C2 (fr)
WO (1) WO2006052188A1 (fr)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2469741A1 (fr) * 2010-12-21 2012-06-27 Thomson Licensing Procédé et appareil pour coder et décoder des trames successives d'une représentation d'ambiophonie d'un champ sonore bi et tridimensionnel
US9332372B2 (en) 2010-06-07 2016-05-03 International Business Machines Corporation Virtual spatial sound scape
US9830918B2 (en) 2013-07-05 2017-11-28 Dolby International Ab Enhanced soundfield coding using parametric component generation
CN109756683A (zh) * 2017-11-02 2019-05-14 深圳市裂石影音科技有限公司 全景音视频录制方法、装置、存储介质和计算机设备
WO2020087678A1 (fr) * 2018-11-01 2020-05-07 华南理工大学 Procédé de lecture virtuelle d'environnement sonore dans un espace tridimensionnel à canaux multiples
US20220030370A1 (en) * 2013-03-12 2022-01-27 Dolby Laboratories Licensing Corporation Method of rendering one or more captured audio soundfields to a listener

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5671287A (en) * 1992-06-03 1997-09-23 Trifield Productions Limited Stereophonic signal processor
US6072878A (en) * 1997-09-24 2000-06-06 Sonic Solutions Multi-channel surround sound mastering and reproduction techniques that preserve spatial harmonics
US20040054689A1 (en) * 2002-02-25 2004-03-18 Oak Technology, Inc. Transcoding media system

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5671287A (en) * 1992-06-03 1997-09-23 Trifield Productions Limited Stereophonic signal processor
US6072878A (en) * 1997-09-24 2000-06-06 Sonic Solutions Multi-channel surround sound mastering and reproduction techniques that preserve spatial harmonics
US20040054689A1 (en) * 2002-02-25 2004-03-18 Oak Technology, Inc. Transcoding media system

Cited By (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9332372B2 (en) 2010-06-07 2016-05-03 International Business Machines Corporation Virtual spatial sound scape
KR20180115652A (ko) * 2010-12-21 2018-10-23 돌비 인터네셔널 에이비 2차원 또는 3차원 음장의 앰비소닉스 표현의 연속 프레임을 인코딩 및 디코딩하는 방법 및 장치
JP2012133366A (ja) * 2010-12-21 2012-07-12 Thomson Licensing 二次元または三次元音場のアンビソニックス表現の一連のフレームをエンコードおよびデコードする方法および装置
KR102010914B1 (ko) 2010-12-21 2019-08-14 돌비 인터네셔널 에이비 2차원 또는 3차원 음장의 앰비소닉스 표현의 연속 프레임을 인코딩 및 디코딩하는 방법 및 장치
EP2469742A3 (fr) * 2010-12-21 2012-09-05 Thomson Licensing Procédé et appareil de codage et de décodage de cadres successifs d'une représentation d'ambiophonie de champ sonore bi ou tridimensionnel
KR20120070521A (ko) * 2010-12-21 2012-06-29 톰슨 라이센싱 2차원 또는 3차원 음장의 앰비소닉스 표현의 연속 프레임을 인코딩 및 디코딩하는 방법 및 장치
US9397771B2 (en) 2010-12-21 2016-07-19 Dolby Laboratories Licensing Corporation Method and apparatus for encoding and decoding successive frames of an ambisonics representation of a 2- or 3-dimensional sound field
JP2016224472A (ja) * 2010-12-21 2016-12-28 ドルビー・インターナショナル・アーベー 二次元または三次元音場のアンビソニックス表現の一連のフレームをエンコードおよびデコードする方法および装置
JP7342091B2 (ja) 2010-12-21 2023-09-11 ドルビー・インターナショナル・アーベー 二次元または三次元音場のアンビソニックス表現の一連のフレームをエンコードおよびデコードする方法および装置
KR20190096318A (ko) * 2010-12-21 2019-08-19 돌비 인터네셔널 에이비 2차원 또는 3차원 음장의 앰비소닉스 표현의 연속 프레임을 인코딩 및 디코딩하는 방법 및 장치
EP2469741A1 (fr) * 2010-12-21 2012-06-27 Thomson Licensing Procédé et appareil pour coder et décoder des trames successives d'une représentation d'ambiophonie d'un champ sonore bi et tridimensionnel
JP2022016544A (ja) * 2010-12-21 2022-01-21 ドルビー・インターナショナル・アーベー 二次元または三次元音場のアンビソニックス表現の一連のフレームをエンコードおよびデコードする方法および装置
CN102547549A (zh) * 2010-12-21 2012-07-04 汤姆森特许公司 编码解码2或3维声场环绕声表示的连续帧的方法和装置
KR101909573B1 (ko) 2010-12-21 2018-10-19 돌비 인터네셔널 에이비 2차원 또는 3차원 음장의 앰비소닉스 표현의 연속 프레임을 인코딩 및 디코딩하는 방법 및 장치
KR102131748B1 (ko) 2010-12-21 2020-07-08 돌비 인터네셔널 에이비 2차원 또는 3차원 음장의 앰비소닉스 표현의 연속 프레임을 인코딩 및 디코딩하는 방법 및 장치
JP2020079961A (ja) * 2010-12-21 2020-05-28 ドルビー・インターナショナル・アーベー 二次元または三次元音場のアンビソニックス表現の一連のフレームをエンコードおよびデコードする方法および装置
US20220030370A1 (en) * 2013-03-12 2022-01-27 Dolby Laboratories Licensing Corporation Method of rendering one or more captured audio soundfields to a listener
US11770666B2 (en) * 2013-03-12 2023-09-26 Dolby Laboratories Licensing Corporation Method of rendering one or more captured audio soundfields to a listener
US9830918B2 (en) 2013-07-05 2017-11-28 Dolby International Ab Enhanced soundfield coding using parametric component generation
CN109756683B (zh) * 2017-11-02 2024-06-04 深圳市裂石影音科技有限公司 全景音视频录制方法、装置、存储介质和计算机设备
CN109756683A (zh) * 2017-11-02 2019-05-14 深圳市裂石影音科技有限公司 全景音视频录制方法、装置、存储介质和计算机设备
WO2020087678A1 (fr) * 2018-11-01 2020-05-07 华南理工大学 Procédé de lecture virtuelle d'environnement sonore dans un espace tridimensionnel à canaux multiples
US11962995B2 (en) 2018-11-01 2024-04-16 South China University Of Technology Virtual playback method for surround-sound in multi-channel three-dimensional space

Also Published As

Publication number Publication date
SE0402772D0 (sv) 2004-11-12
SE528706C2 (sv) 2007-01-30
SE0402772L (sv) 2006-07-13
WO2006052188B1 (fr) 2006-12-28

Similar Documents

Publication Publication Date Title
JP6510021B2 (ja) オーディオ装置及びそのオーディオ提供方法
US9584912B2 (en) Spatial audio rendering and encoding
Theile Multichannel natural music recording based on psychoacoustic principles
US8705750B2 (en) Device and method for converting spatial audio signal
EP2279628B1 (fr) Génération d'un son ambiophonique à partir d'un ensemble microphone
US20080298610A1 (en) Parameter Space Re-Panning for Spatial Audio
US8605914B2 (en) Nonlinear filter for separation of center sounds in stereophonic audio
WO2012025580A1 (fr) Procédé et dispositif de reproduction de champ sonore améliorée de signaux d'entrée audio spatialement codés
EP3895451B1 (fr) Procédé et appareil de traitement d'un signal stéréo
Pulkki et al. First‐Order Directional Audio Coding (DirAC)
CN102461213A (zh) 中心声道呈现
WO2006052188A1 (fr) Procede et dispositif de traitement du son enveloppant
Politis et al. Parametric spatial audio effects
WO2017165968A1 (fr) Système et procédé pour créer un audio binaural tridimensionnel à partir de sources sonores stéréo, mono et multicanaux
McCormack et al. Parametric spatial audio effects based on the multi-directional decomposition of ambisonic sound scenes
Kim Height channels
Kim et al. Multi-channel audio recording based on superdirective beamforming for portable multimedia recording devices
Theile Natural 5.1 music recording based on psychoacoustic principals
US20200045419A1 (en) Stereo unfold technology
Pulkki Evaluating spatial sound with binaural auditory model
US20240056735A1 (en) Stereo headphone psychoacoustic sound localization system and method for reconstructing stereo psychoacoustic sound signals using same
Pfanzagl-Cardone Comparative 3D Audio Microphone Array Tests
WO2024081957A1 (fr) Traitement d'externalisation binaurale
Politis et al. Spatial audio coding with spaced microphone arrays for music recording and reproduction.
Pekonen Microphone techniques for spatial sound

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KM KN KP KR KZ LC LK LR LS LT LU LV LY MA MD MG MK MN MW MX MZ NA NG NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SM SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): BW GH GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LT LU LV MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
122 Ep: pct application non-entry in european phase

Ref document number: 05801475

Country of ref document: EP

Kind code of ref document: A1