WO2006052188A1 - Procede et dispositif de traitement du son enveloppant - Google Patents
Procede et dispositif de traitement du son enveloppant Download PDFInfo
- Publication number
- WO2006052188A1 WO2006052188A1 PCT/SE2005/001659 SE2005001659W WO2006052188A1 WO 2006052188 A1 WO2006052188 A1 WO 2006052188A1 SE 2005001659 W SE2005001659 W SE 2005001659W WO 2006052188 A1 WO2006052188 A1 WO 2006052188A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- format
- virtual
- signals
- coincident
- microphone
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 31
- 238000012545 processing Methods 0.000 title claims abstract description 20
- 239000011159 matrix material Substances 0.000 claims abstract description 43
- 230000005236 sound signal Effects 0.000 claims abstract description 30
- 230000002194 synthesizing effect Effects 0.000 claims abstract description 7
- 230000001934 delay Effects 0.000 claims description 11
- 238000001914 filtration Methods 0.000 claims description 7
- 230000003321 amplification Effects 0.000 claims description 3
- 238000004364 calculation method Methods 0.000 claims description 3
- 238000003199 nucleic acid amplification method Methods 0.000 claims description 3
- 238000004590 computer program Methods 0.000 claims description 2
- 230000004807 localization Effects 0.000 abstract description 5
- 238000003491 array Methods 0.000 description 16
- 230000000875 corresponding effect Effects 0.000 description 6
- 238000012805 post-processing Methods 0.000 description 6
- 230000008901 benefit Effects 0.000 description 5
- 230000009466 transformation Effects 0.000 description 4
- 230000001788 irregular Effects 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- 238000000844 transformation Methods 0.000 description 3
- 230000009286 beneficial effect Effects 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 238000004088 simulation Methods 0.000 description 2
- 240000008042 Zea mays Species 0.000 description 1
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 1
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 230000002238 attenuated effect Effects 0.000 description 1
- 230000003190 augmentative effect Effects 0.000 description 1
- 238000005094 computer simulation Methods 0.000 description 1
- 235000005822 corn Nutrition 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 230000001627 detrimental effect Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000004886 head movement Effects 0.000 description 1
- 238000009434 installation Methods 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 238000000691 measurement method Methods 0.000 description 1
- 230000000630 rising effect Effects 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S3/00—Systems employing more than two channels, e.g. quadraphonic
- H04S3/02—Systems employing more than two channels, e.g. quadraphonic of the matrix type, i.e. in which input signals are combined algebraically, e.g. after having been phase shifted with respect to each other
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R5/00—Stereophonic arrangements
- H04R5/027—Spatial or constructional arrangements of microphones, e.g. in dummy heads
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2420/00—Techniques used stereophonic systems covered by H04S but not provided for in its groups
- H04S2420/11—Application of ambisonics in stereophonic audio systems
Definitions
- the present invention relates to processing sound signals and especially transcoding of recorded or artificially created sound, directionally encoded using essentially coincident microphones, via an intermediate transformation stage, to mono, stereo or multi-channel surround sound for loudspeaker replay.
- the transcoding process extends the possibilities to vary the properties of the replayed sound as compared to currently available techniques, and especially can offer a more stable frontal image and a wider listening area.
- Non-coincident (spaced) microphone arrays such as INA5, OCT, Fukada Tree, Decca Tree [1] (the latter three augmented with surround microphones), Williams MMA [8], and numerous other possible configurations each with their unique properties.
- Non-coincident arrays that utilize some kind of physical object separating the microphones such as a Schoeps KFM360 surround sphere [9] or the Holophone [10].
- the coincident B-format offers many technical advantages such as 3D transformations and mixing and can be ambisonically decoded to a range of different loudspeaker layouts including those with height [5].
- an irregular loudspeaker layout a layout where loudspeakers are not placed at equal relative angles
- an ambisonic decode appears to have severe difficulties, typically resulting in a small listening area (small "sweetspot"), frontal image instability and head movement sound coloration [12] [13], and is unusual in current surround post- processing practice.
- Spaced microphone arrays appear to offer a wide listening area, frontal image stability [12] [13], and an overall sound that is perceived as more spacious.
- each of the spaced microphone arrays gives a different result when the recorded signals are replayed over a loudspeaker array, typically the ITU recommendation [11], see Fig. 3.
- Results differ regarding useful listening area size, frontal image stability, coloration, reproduced image size and localization, overall sound character, and front/rear/side balance.
- several variants are possible such as a variation of the main recording angles, cf. [I].
- a good example of the many variants that are claimed to be possible is a conference paper about the Williams MMA describing over 220 ways to configure 5 cardioid microphones [8]. This paper, in itself, clearly indicates the need for an approach that does not necessitate a decision on the microphone array already at the recording stage where an unfortunate choice may lead to unsatisfactory results or even to a costly second recording event.
- Reference [14] relates to techniques of making a recording of or transmitting a sound field from either multiple monaural or directional sound signals that reproduce through multiple discrete loud speakers a sound field with spatial harmonics that substantially exactly match those of the original sound field.
- the present invention does relate to creating the B-format, only how it is further processed for loudspeaker replay. Indeed, the present invention can be applied on B-format signals created in a way described in [14]. Further [14] emphasizes that the spatial harmonics should be preserved and that is not the case with the present invention since as soon as delays (via the spaced microphones) have been introduced in the processing a discussion of spatial harmonics, that assumes coincident signals, is not relevant. Instead, with spaced microphones, sound localization is partly based on time delays between sound arrivals.
- the present invention offers an opportunity to overcome actual and perceived problems with coincident microphone arrays so that, after a transcode as described herein, the benefits of spaced arrays are essentially achieved, in particular the stable frontal image and wide listening area but also the more spacious sound, while making it possible to post-pone the decision about exactly which spaced microphone array properties to use until the post-processing stage.
- the present invention explores the inherent generality and orthogonality of B-format signals and teaches ways to transcode from B-format to nearly any other type of coincident or spaced microphone array so that after the actual B-format recording, or after B-format signals having been artificially generated e.g.
- B-format consists of highly correlated signals that can, by the transcoding process described, subsequently at the post-processing stage, be decorrelated in a chosen manner by use of a synthesized primary virtual coincident microphone array that creates feeds for a virtual loudspeaker replay (here called L- format) the outputs of which are virtually recorded by a secondary virtual spaced or coincident microphone array of chosen type, as illustrated in Fig. 2.
- L- format virtual loudspeaker replay
- the process described represents a virtual re-recording of the original sound field where the choice of microphone array is postponed until the post-processing stage in a similar way as current coincident-only methods, e.g.
- the process introduces certain artefacts consisting of additional sound pickup by the secondary virtual microphone array due to the omni-directional components in the primary virtual microphones, as follows directly from the description herein, but these artefacts can generally be perceived as beneficial rather than detrimental in comparison to a decode using purely coincident methods.
- Physical spaced microphone arrays always create comb-filtering, objectively an artefact but that by many are perceived as giving a more spacious sound.
- the transcode process described introduces additional comb-filtering but not to such a degree that it adversely affects the listening experience.
- the final loudspeaker replay, see Fig 3 is closer in sound to that of an actual physical microphone array of the same type than to that of a purely coincident decode such as ambisonic.
- a further benefit of the present invention is that the properties of the final replay can be changed in real time by varying the primary virtual microphone configuration, as well as individual secondary virtual microphone properties, aims and locations, while listening, thus also offering additional "artistic" options at the post-processing stage, options that are of a wider range and also different character than those of current coincident-only techniques.
- the present invention may be used to transform between any two different spaced arrays by first performing a virtual B-format recording of the original spaced array signals and then applying the invention on the thus obtained B-format signals, but for reasons stated the results will always be limited since the signals from the original spaced microphone array are too decorrelated.
- the present invention comprises a transcoder arrangement provided for transcoding from directionally encoded audio signals in form of a B-format (W, X, Y, Z) or B-format-like signals to audio signals reproducible via loudspeakers over a listening area.
- the arrangement comprises: inputs for receiving the directionally encoded audio signals; primary matrix means; inter-connection means; secondary matrix means; and output means for subsequent amplification and replay of transcoded audio signals.
- the directionally encoded audio signals comprises one or several of: recorded B-format; artificially created B-format; measured B-format room impulse responses; artificially created B-format room impulse responses; recorded B-format-like signals; artificially created B-format-like signals; measured B-format-like room impulse responses; artificially created B-format-like room impulse responses.
- the directional encoded B-format or B-format-like audio signals are those of substantially perfectly coincident microphones consisting of at least one omni-directional microphone and three orthogonal figure-of-8 microphones.
- B-format-like is meant B-format that has not to fulfill all qualitative and mathematical demands of a perfect B-format.
- the directionally encoded audio signals may comprise of B-format signals limited to omni-directional and horizontal components.
- the directionally encoded audio signals may also comprise B-format or B-format-like signals that have undergone a spatial transform.
- the primary matrix means comprises signed linear coefficients a / , b j , C j configured to create virtual individual polar microphone directivities derived from said B-format signals;
- U j ( ⁇ ) h j ( OfJ ⁇ W( ⁇ ) + b j cos( ⁇ j) X( ⁇ ) + c j sin( ⁇ j) Y(n) )
- K is an arbitrary positive constant.
- the number (N) of primary virtual microphones and primary matrix output channels are ⁇ 3.
- the time-delay of signals from the N virtual loudspeakers to the chosen center of the secondary microphone array are effectively and essentially the same.
- One embodiment may comprise control means for adjusting coefficients of primary matrix means thus altering primary virtual microphone types.
- a control means may be provided for adjusting coefficients and delays of secondary matrix means, thus altering secondary virtual microphone types, aims, locations, filtering means and delays.
- the primary and secondary matrices may be integrated.
- the invention also relates to a method of transcoding a sound signal where directionally encoded audio signals in form of a B-format (W, X, Y, Z) or B-format-like signals are via a primary matrix, synthesizing virtual coincident polar microphones, encoded to intermediate channels used as virtual loudspeaker feeds.
- the method comprises feeding the output of said virtual loudspeakers to a secondary matrix implementing multiple coincident or non-coincident secondary virtual non-polar or polar microphones thereby transcoding to multiple output channels suitable for subsequent mono, stereo or surround sound loudspeaker replay.
- the directionally encoded audio signals comprises one or several of: recorded B-format; artificially created B-format; measured B-format room impulse responses; artificially created B-format room impulse responses; recorded B-format-like signals; artificially created B-format-like signals; measured B-format-like room impulse responses; and artificially created B-format-like room impulse responses.
- the invention also relates to a computer program, preferably realized as a plug-in, for transcoding a digital surround sound in form of a B-format (W, X, Y, Z) or B-format- like signals where directionally encoded audio signals are via a primary matrix, synthesizing virtual polar microphones, encoded to intermediate channels used as virtual loudspeaker feeds, the program comprising a procedure for feeding virtual loudspeakers via a secondary matrix transcoding to multiple channels corresponding to coincident or non-coincident virtual non-polar or polar microphone signals suitable for subsequent mono, stereo or surround sound loudspeaker replay.
- a computer program preferably realized as a plug-in, for transcoding a digital surround sound in form of a B-format (W, X, Y, Z) or B-format- like signals where directionally encoded audio signals are via a primary matrix, synthesizing virtual polar microphones, encoded to intermediate channels used as virtual loudspeaker feeds
- the program comprising a procedure for feeding
- the invention also relates to a computer readable product having there on an instruction set for transcoding to a digital surround sound in form of a B-format (W, X, Y, Z) or B- format-like signals where directionally encoded audio signals are via a primary matrix, synthesizing virtual polar microphones, encoded to intermediate channels used as virtual loudspeaker feeds, the instruction set comprises feeding virtual loudspeakers via a secondary matrix transcoding to multiple channels corresponding to coincident or non-coincident virtual non-polar or polar microphone signals suitable for subsequent mono, stereo or surround sound loudspeaker replay.
- a computer readable product having there on an instruction set for transcoding to a digital surround sound in form of a B-format (W, X, Y, Z) or B- format-like signals where directionally encoded audio signals are via a primary matrix, synthesizing virtual polar microphones, encoded to intermediate channels used as virtual loudspeaker feeds
- the instruction set comprises feeding virtual loudspeakers via a secondary matrix trans
- Figure 1 is a general schematic of a transcoding system
- Figure 2 is a sample schematic illustration of a 3 x 4 x 5 transcode for ITU surround, see Fig. 3;
- Figure 3 is a schematic illustration of an actual surround replay example for ITU surround
- Figure 4 illustrates sound sample delays of individual secondary virtual microphone signals at a large distance r from virtual loudspeakers
- Figure 6 shows schematically the main steps of a transcoding system according to the invention.
- the examples are only based on the omni-directional component (W) and horizontal components (X, Y).
- the example uses the common 5-channel surround system but may as well use, e.g. 6, 7 or more channels or stereo.
- Figure 1 illustrates a surround processing (transcoding) system comprising: input means (1), primary matrix means (2), virtual loudspeaker feeds (3), secondary matrix means (4), and virtual microphone signals and output means (5).
- the transcode for ITU surround as illustrated in Fig. 2 comprises: W 5 X 5 Y input signals (6), primary matrix means (7), synthesized primary microphone pattern (8), secondary matrix means (9), virtual loudspeaker (10), secondary virtual microphone (11), surround output (12), and coordinate system origin (13).
- Output channels of the primary matrix (7) are created by N primary coincident virtual microphones (8) synthesized from the W,X,Y input (6), and are connected to N virtual loudspeakers (10), typically placed horizontally and equidistant at a typically large radius r, see Fig. 2. With air absorption neglected, and except for a common propagation constant and delay, the feed to a virtual loudspeaker , / is then approximately given by:
- N 4 being the most practical application giving high channel separation j primary virtual microphone and virtual loudspeaker index, 1 ⁇ j ⁇ N h j variable gain factor for primary virtual microphone j, typically h j — 1 Win)
- B-format W sample n (traditionally, a W signal is that of an omni ⁇ directional microphone divided by the factor 4 ⁇ , otherwise this factor is removed from Eq. 1)
- Y( ⁇ ) B-format Y sample n (signal from a figure-of-8 microphone in the lateral left direction) ci j , b j , C j primary matrix coefficients for calculation of feeds to virtual loudspeaker j-
- Fractional delay techniques may be applied as an alternative embodiment s sampling frequency, Hz co speed of sound in air, m/s
- Fi j ideal first-order polar directivity of secondary virtual microphone i evaluated in direction of virtual loudspeaker ⁇ :
- N-channel L-format (uj) as a distribution format, from which many Af-channel surround formats can be derived, but that also allows for recreation of the original B-format in cases where e.g. an ambisonic decode [5] is desired or where the encoded sound-field is to be pre-processed, e.g. via B-format transforms [5], before final decode for loudspeaker replay.
- uj N-channel L-format
- Examples of useful transcoding to stereo are when the virtual secondary microphones are configured as ORTF, NOS, RAI, DIN or spaced omni [I].
- the claimed invention does not directly address the problem of surround replay to a loudspeaker layout differing from the ITU recommendation [11], but it offers the possibility to choose a secondary virtual microphone array that gives a result less dependent on exact loudspeaker layout or that is designed for another loudspeaker layout.
- L-format is not a psycho-acoustic decode, that attempts to take into account aspects of the human hearing and that is aimed for direct physical loudspeaker feeds, but represents intermediate signals suitable for a virtual re-recording of the originally recorded sound field, and the secondary virtual microphones applied do not register energy but pressure including phase.
- the main steps of a method according to one preferred embodiment of the invention are summarised in the block diagram of figure 6.
- the directionally encoded audio signals (24) are via a primary matrix (25), which synthesizes virtual coincident polar microphones, encoded to intermediate channels used as virtual loudspeaker feeds (26).
- the outputs from the virtual loudspeakers are fed to a secondary matrix (27), which implements multiple coincident or non-coincident secondary virtual non-polar or polar microphones.
- the multiple output channels (28) are suitable for subsequent mono, stereo or surround sound loudspeaker replay.
- Example 1 A virtual loudspeaker distance dependence may be included so that the ⁇ y delays instead are calculated as:
- Example 2 The geometrical spread from each virtual loudspeaker, assuming it to behave as a pure point source, may be taken into account where Eq. 2 instead will be calculated as:
- Virtual loudspeakers may apply directivities G ⁇ so that the level at secondary virtual microphones located far off axis of each virtual loudspeaker is attenuated, see Fig. 5, in which (21) a virtual loudspeaker, (22) secondary virtual microphone, and (23) axis of secondary virtual microphone. Eq. 2 will then instead be calculated as:
- Example 5 For some applications it can be beneficial to transcode low and high frequency sound using different methods since the human directional hearing has different properties below and above ca. 500 Hz (head size related to the wavelength of sound).
- two parallel transcoders as described herein may be used.
- the input (1) of the first transcoder is fed by directionally encoded low-pass filtered signals and the input (1) of the second transcoder is fed with high-pass filtered signals.
- the first transcoder uses primary and secondary matrices whose settings are optimized or adjusted for low frequency localization while the second transcoder uses matrices optimized or adjusted for high frequency localization.
- the actual cross-over frequency may vary depending on application, typically between 300 and 700 Hz.
- the outputs (4) of both transcoders are then summed to form new signals for subsequent replay to loudspeakers for listening.
- Example 6 A useful alternative to direct recording of music or other performances in B-format is to measure, or artificially create, B-format room impulse responses using one of the many available impulse response measurement methods (Maximum Length)
- B-format signals obtained in this manner can also be used with the claimed invention but it is equally possible to use the invention on the B-format impulse responses themselves treating them as the input signals (1) of the transcoder.
- the output of the transcoder will then instead be a new set of room impulse responses corresponding to the secondary virtual microphones. This technique is useful for creating several different types of multi-channel or stereo impulse responses, for reverberation purposes, based on the same measured or simulated B-format impulse responses.
Landscapes
- Physics & Mathematics (AREA)
- Engineering & Computer Science (AREA)
- Algebra (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Analysis (AREA)
- Mathematical Optimization (AREA)
- Mathematical Physics (AREA)
- Pure & Applied Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Acoustics & Sound (AREA)
- Signal Processing (AREA)
- Stereophonic System (AREA)
Abstract
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
SE0402772A SE528706C2 (sv) | 2004-11-12 | 2004-11-12 | Anordning och processmetod för surroundljud |
SE0402772-8 | 2004-11-12 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2006052188A1 true WO2006052188A1 (fr) | 2006-05-18 |
WO2006052188B1 WO2006052188B1 (fr) | 2006-12-28 |
Family
ID=33488235
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/SE2005/001659 WO2006052188A1 (fr) | 2004-11-12 | 2005-11-07 | Procede et dispositif de traitement du son enveloppant |
Country Status (2)
Country | Link |
---|---|
SE (1) | SE528706C2 (fr) |
WO (1) | WO2006052188A1 (fr) |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP2469741A1 (fr) * | 2010-12-21 | 2012-06-27 | Thomson Licensing | Procédé et appareil pour coder et décoder des trames successives d'une représentation d'ambiophonie d'un champ sonore bi et tridimensionnel |
US9332372B2 (en) | 2010-06-07 | 2016-05-03 | International Business Machines Corporation | Virtual spatial sound scape |
US9830918B2 (en) | 2013-07-05 | 2017-11-28 | Dolby International Ab | Enhanced soundfield coding using parametric component generation |
CN109756683A (zh) * | 2017-11-02 | 2019-05-14 | 深圳市裂石影音科技有限公司 | 全景音视频录制方法、装置、存储介质和计算机设备 |
WO2020087678A1 (fr) * | 2018-11-01 | 2020-05-07 | 华南理工大学 | Procédé de lecture virtuelle d'environnement sonore dans un espace tridimensionnel à canaux multiples |
US20220030370A1 (en) * | 2013-03-12 | 2022-01-27 | Dolby Laboratories Licensing Corporation | Method of rendering one or more captured audio soundfields to a listener |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5671287A (en) * | 1992-06-03 | 1997-09-23 | Trifield Productions Limited | Stereophonic signal processor |
US6072878A (en) * | 1997-09-24 | 2000-06-06 | Sonic Solutions | Multi-channel surround sound mastering and reproduction techniques that preserve spatial harmonics |
US20040054689A1 (en) * | 2002-02-25 | 2004-03-18 | Oak Technology, Inc. | Transcoding media system |
-
2004
- 2004-11-12 SE SE0402772A patent/SE528706C2/sv not_active IP Right Cessation
-
2005
- 2005-11-07 WO PCT/SE2005/001659 patent/WO2006052188A1/fr active Application Filing
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5671287A (en) * | 1992-06-03 | 1997-09-23 | Trifield Productions Limited | Stereophonic signal processor |
US6072878A (en) * | 1997-09-24 | 2000-06-06 | Sonic Solutions | Multi-channel surround sound mastering and reproduction techniques that preserve spatial harmonics |
US20040054689A1 (en) * | 2002-02-25 | 2004-03-18 | Oak Technology, Inc. | Transcoding media system |
Cited By (23)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9332372B2 (en) | 2010-06-07 | 2016-05-03 | International Business Machines Corporation | Virtual spatial sound scape |
KR20180115652A (ko) * | 2010-12-21 | 2018-10-23 | 돌비 인터네셔널 에이비 | 2차원 또는 3차원 음장의 앰비소닉스 표현의 연속 프레임을 인코딩 및 디코딩하는 방법 및 장치 |
JP2012133366A (ja) * | 2010-12-21 | 2012-07-12 | Thomson Licensing | 二次元または三次元音場のアンビソニックス表現の一連のフレームをエンコードおよびデコードする方法および装置 |
KR102010914B1 (ko) | 2010-12-21 | 2019-08-14 | 돌비 인터네셔널 에이비 | 2차원 또는 3차원 음장의 앰비소닉스 표현의 연속 프레임을 인코딩 및 디코딩하는 방법 및 장치 |
EP2469742A3 (fr) * | 2010-12-21 | 2012-09-05 | Thomson Licensing | Procédé et appareil de codage et de décodage de cadres successifs d'une représentation d'ambiophonie de champ sonore bi ou tridimensionnel |
KR20120070521A (ko) * | 2010-12-21 | 2012-06-29 | 톰슨 라이센싱 | 2차원 또는 3차원 음장의 앰비소닉스 표현의 연속 프레임을 인코딩 및 디코딩하는 방법 및 장치 |
US9397771B2 (en) | 2010-12-21 | 2016-07-19 | Dolby Laboratories Licensing Corporation | Method and apparatus for encoding and decoding successive frames of an ambisonics representation of a 2- or 3-dimensional sound field |
JP2016224472A (ja) * | 2010-12-21 | 2016-12-28 | ドルビー・インターナショナル・アーベー | 二次元または三次元音場のアンビソニックス表現の一連のフレームをエンコードおよびデコードする方法および装置 |
JP7342091B2 (ja) | 2010-12-21 | 2023-09-11 | ドルビー・インターナショナル・アーベー | 二次元または三次元音場のアンビソニックス表現の一連のフレームをエンコードおよびデコードする方法および装置 |
KR20190096318A (ko) * | 2010-12-21 | 2019-08-19 | 돌비 인터네셔널 에이비 | 2차원 또는 3차원 음장의 앰비소닉스 표현의 연속 프레임을 인코딩 및 디코딩하는 방법 및 장치 |
EP2469741A1 (fr) * | 2010-12-21 | 2012-06-27 | Thomson Licensing | Procédé et appareil pour coder et décoder des trames successives d'une représentation d'ambiophonie d'un champ sonore bi et tridimensionnel |
JP2022016544A (ja) * | 2010-12-21 | 2022-01-21 | ドルビー・インターナショナル・アーベー | 二次元または三次元音場のアンビソニックス表現の一連のフレームをエンコードおよびデコードする方法および装置 |
CN102547549A (zh) * | 2010-12-21 | 2012-07-04 | 汤姆森特许公司 | 编码解码2或3维声场环绕声表示的连续帧的方法和装置 |
KR101909573B1 (ko) | 2010-12-21 | 2018-10-19 | 돌비 인터네셔널 에이비 | 2차원 또는 3차원 음장의 앰비소닉스 표현의 연속 프레임을 인코딩 및 디코딩하는 방법 및 장치 |
KR102131748B1 (ko) | 2010-12-21 | 2020-07-08 | 돌비 인터네셔널 에이비 | 2차원 또는 3차원 음장의 앰비소닉스 표현의 연속 프레임을 인코딩 및 디코딩하는 방법 및 장치 |
JP2020079961A (ja) * | 2010-12-21 | 2020-05-28 | ドルビー・インターナショナル・アーベー | 二次元または三次元音場のアンビソニックス表現の一連のフレームをエンコードおよびデコードする方法および装置 |
US20220030370A1 (en) * | 2013-03-12 | 2022-01-27 | Dolby Laboratories Licensing Corporation | Method of rendering one or more captured audio soundfields to a listener |
US11770666B2 (en) * | 2013-03-12 | 2023-09-26 | Dolby Laboratories Licensing Corporation | Method of rendering one or more captured audio soundfields to a listener |
US9830918B2 (en) | 2013-07-05 | 2017-11-28 | Dolby International Ab | Enhanced soundfield coding using parametric component generation |
CN109756683B (zh) * | 2017-11-02 | 2024-06-04 | 深圳市裂石影音科技有限公司 | 全景音视频录制方法、装置、存储介质和计算机设备 |
CN109756683A (zh) * | 2017-11-02 | 2019-05-14 | 深圳市裂石影音科技有限公司 | 全景音视频录制方法、装置、存储介质和计算机设备 |
WO2020087678A1 (fr) * | 2018-11-01 | 2020-05-07 | 华南理工大学 | Procédé de lecture virtuelle d'environnement sonore dans un espace tridimensionnel à canaux multiples |
US11962995B2 (en) | 2018-11-01 | 2024-04-16 | South China University Of Technology | Virtual playback method for surround-sound in multi-channel three-dimensional space |
Also Published As
Publication number | Publication date |
---|---|
SE0402772D0 (sv) | 2004-11-12 |
SE528706C2 (sv) | 2007-01-30 |
SE0402772L (sv) | 2006-07-13 |
WO2006052188B1 (fr) | 2006-12-28 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP6510021B2 (ja) | オーディオ装置及びそのオーディオ提供方法 | |
US9584912B2 (en) | Spatial audio rendering and encoding | |
Theile | Multichannel natural music recording based on psychoacoustic principles | |
US8705750B2 (en) | Device and method for converting spatial audio signal | |
EP2279628B1 (fr) | Génération d'un son ambiophonique à partir d'un ensemble microphone | |
US20080298610A1 (en) | Parameter Space Re-Panning for Spatial Audio | |
US8605914B2 (en) | Nonlinear filter for separation of center sounds in stereophonic audio | |
WO2012025580A1 (fr) | Procédé et dispositif de reproduction de champ sonore améliorée de signaux d'entrée audio spatialement codés | |
EP3895451B1 (fr) | Procédé et appareil de traitement d'un signal stéréo | |
Pulkki et al. | First‐Order Directional Audio Coding (DirAC) | |
CN102461213A (zh) | 中心声道呈现 | |
WO2006052188A1 (fr) | Procede et dispositif de traitement du son enveloppant | |
Politis et al. | Parametric spatial audio effects | |
WO2017165968A1 (fr) | Système et procédé pour créer un audio binaural tridimensionnel à partir de sources sonores stéréo, mono et multicanaux | |
McCormack et al. | Parametric spatial audio effects based on the multi-directional decomposition of ambisonic sound scenes | |
Kim | Height channels | |
Kim et al. | Multi-channel audio recording based on superdirective beamforming for portable multimedia recording devices | |
Theile | Natural 5.1 music recording based on psychoacoustic principals | |
US20200045419A1 (en) | Stereo unfold technology | |
Pulkki | Evaluating spatial sound with binaural auditory model | |
US20240056735A1 (en) | Stereo headphone psychoacoustic sound localization system and method for reconstructing stereo psychoacoustic sound signals using same | |
Pfanzagl-Cardone | Comparative 3D Audio Microphone Array Tests | |
WO2024081957A1 (fr) | Traitement d'externalisation binaurale | |
Politis et al. | Spatial audio coding with spaced microphone arrays for music recording and reproduction. | |
Pekonen | Microphone techniques for spatial sound |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KM KN KP KR KZ LC LK LR LS LT LU LV LY MA MD MG MK MN MW MX MZ NA NG NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SM SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): BW GH GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LT LU LV MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
122 | Ep: pct application non-entry in european phase |
Ref document number: 05801475 Country of ref document: EP Kind code of ref document: A1 |