TWI647961B - 聲場的高階保真立體音響表示法中不相關聲源方向之決定方法及裝置 - Google Patents
聲場的高階保真立體音響表示法中不相關聲源方向之決定方法及裝置 Download PDFInfo
- Publication number
- TWI647961B TWI647961B TW103104224A TW103104224A TWI647961B TW I647961 B TWI647961 B TW I647961B TW 103104224 A TW103104224 A TW 103104224A TW 103104224 A TW103104224 A TW 103104224A TW I647961 B TWI647961 B TW I647961B
- Authority
- TW
- Taiwan
- Prior art keywords
- sound source
- time frame
- dominant
- hoa
- dominant sound
- Prior art date
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S3/00—Systems employing more than two channels, e.g. quadraphonic
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0272—Voice signal separating
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2420/00—Techniques used stereophonic systems covered by H04S but not provided for in its groups
- H04S2420/11—Application of ambisonics in stereophonic audio systems
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Signal Processing (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Computational Linguistics (AREA)
- Mathematical Physics (AREA)
- Quality & Reliability (AREA)
- Stereophonic System (AREA)
- General Physics & Mathematics (AREA)
- Radar, Positioning & Navigation (AREA)
- Remote Sensing (AREA)
- Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| ??13305156.5 | 2013-02-08 | ||
| EP20130305156 EP2765791A1 (en) | 2013-02-08 | 2013-02-08 | Method and apparatus for determining directions of uncorrelated sound sources in a higher order ambisonics representation of a sound field |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| TW201448616A TW201448616A (zh) | 2014-12-16 |
| TWI647961B true TWI647961B (zh) | 2019-01-11 |
Family
ID=47780000
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| TW103104224A TWI647961B (zh) | 2013-02-08 | 2014-02-10 | 聲場的高階保真立體音響表示法中不相關聲源方向之決定方法及裝置 |
Country Status (7)
| Country | Link |
|---|---|
| US (1) | US9622008B2 (https=) |
| EP (2) | EP2765791A1 (https=) |
| JP (1) | JP6374882B2 (https=) |
| KR (1) | KR102220187B1 (https=) |
| CN (1) | CN104995926B (https=) |
| TW (1) | TWI647961B (https=) |
| WO (1) | WO2014122287A1 (https=) |
Families Citing this family (25)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP2665208A1 (en) * | 2012-05-14 | 2013-11-20 | Thomson Licensing | Method and apparatus for compressing and decompressing a Higher Order Ambisonics signal representation |
| EP2743922A1 (en) | 2012-12-12 | 2014-06-18 | Thomson Licensing | Method and apparatus for compressing and decompressing a higher order ambisonics representation for a sound field |
| EP2800401A1 (en) | 2013-04-29 | 2014-11-05 | Thomson Licensing | Method and Apparatus for compressing and decompressing a Higher Order Ambisonics representation |
| US9495968B2 (en) | 2013-05-29 | 2016-11-15 | Qualcomm Incorporated | Identifying sources from which higher order ambisonic audio data is generated |
| US9466305B2 (en) | 2013-05-29 | 2016-10-11 | Qualcomm Incorporated | Performing positional analysis to code spherical harmonic coefficients |
| US9502045B2 (en) | 2014-01-30 | 2016-11-22 | Qualcomm Incorporated | Coding independent frames of ambient higher-order ambisonic coefficients |
| US9922656B2 (en) | 2014-01-30 | 2018-03-20 | Qualcomm Incorporated | Transitioning of ambient higher-order ambisonic coefficients |
| US9852737B2 (en) | 2014-05-16 | 2017-12-26 | Qualcomm Incorporated | Coding vectors decomposed from higher-order ambisonics audio signals |
| US9620137B2 (en) | 2014-05-16 | 2017-04-11 | Qualcomm Incorporated | Determining between scalar and vector quantization in higher order ambisonic coefficients |
| US10770087B2 (en) | 2014-05-16 | 2020-09-08 | Qualcomm Incorporated | Selecting codebooks for coding vectors decomposed from higher-order ambisonic audio signals |
| US9747910B2 (en) | 2014-09-26 | 2017-08-29 | Qualcomm Incorporated | Switching between predictive and non-predictive quantization techniques in a higher order ambisonics (HOA) framework |
| EP3357259B1 (en) | 2015-09-30 | 2020-09-23 | Dolby International AB | Method and apparatus for generating 3d audio content from two-channel stereo content |
| CN105516875B (zh) * | 2015-12-02 | 2020-03-06 | 上海航空电器有限公司 | 用于快速测量虚拟声音产生设备空间角度分辨率的装置 |
| GR1008860B (el) * | 2015-12-29 | 2016-09-27 | Κωνσταντινος Δημητριου Σπυροπουλος | Συστημα διαχωρισμου ομιλητων απο οπτικοακουστικα δεδομενα |
| US10089063B2 (en) | 2016-08-10 | 2018-10-02 | Qualcomm Incorporated | Multimedia device for processing spatialized audio based on movement |
| JP6723120B2 (ja) * | 2016-09-05 | 2020-07-15 | 本田技研工業株式会社 | 音響処理装置および音響処理方法 |
| CN107147975B (zh) * | 2017-04-26 | 2019-05-14 | 北京大学 | 一种面向不规则扬声器摆放的Ambisonics匹配投影解码方法 |
| JP7224302B2 (ja) | 2017-05-09 | 2023-02-17 | ドルビー ラボラトリーズ ライセンシング コーポレイション | マルチチャネル空間的オーディオ・フォーマット入力信号の処理 |
| US10405126B2 (en) | 2017-06-30 | 2019-09-03 | Qualcomm Incorporated | Mixed-order ambisonics (MOA) audio data for computer-mediated reality systems |
| FR3074584A1 (fr) * | 2017-12-05 | 2019-06-07 | Orange | Traitement de donnees d'une sequence video pour un zoom sur un locuteur detecte dans la sequence |
| CN110751956B (zh) * | 2019-09-17 | 2022-04-26 | 北京时代拓灵科技有限公司 | 一种沉浸式音频渲染方法及系统 |
| CN111933182B (zh) * | 2020-08-07 | 2024-04-19 | 抖音视界有限公司 | 声源跟踪方法、装置、设备和存储介质 |
| CN112019971B (zh) * | 2020-08-21 | 2022-03-22 | 安声(重庆)电子科技有限公司 | 声场构建方法、装置、电子设备及计算机可读存储介质 |
| US11743670B2 (en) | 2020-12-18 | 2023-08-29 | Qualcomm Incorporated | Correlation-based rendering with multiple distributed streams accounting for an occlusion for six degree of freedom applications |
| CN115038027B (zh) * | 2021-03-05 | 2023-07-07 | 华为技术有限公司 | Hoa系数的获取方法和装置 |
Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20100329466A1 (en) * | 2009-06-25 | 2010-12-30 | Berges Allmenndigitale Radgivningstjeneste | Device and method for converting spatial audio signal |
| WO2011041834A1 (en) * | 2009-10-07 | 2011-04-14 | The University Of Sydney | Reconstruction of a recorded sound field |
| EP2469741A1 (en) * | 2010-12-21 | 2012-06-27 | Thomson Licensing | Method and apparatus for encoding and decoding successive frames of an ambisonics representation of a 2- or 3-dimensional sound field |
Family Cites Families (12)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| GB9915398D0 (en) | 1999-07-02 | 1999-09-01 | Baker Matthew J | Magnetic particles |
| FR2801108B1 (fr) | 1999-11-16 | 2002-03-01 | Maxmat S A | Analyseur chimique ou biochimique a regulation de la temperature reactionnelle |
| FR2839565B1 (fr) * | 2002-05-07 | 2004-11-19 | Remy Henri Denis Bruno | Procede et systeme de representation d'un champ acoustique |
| FR2858403B1 (fr) * | 2003-07-31 | 2005-11-18 | Remy Henri Denis Bruno | Systeme et procede de determination d'une representation d'un champ acoustique |
| EP2297557B1 (en) * | 2008-07-08 | 2013-10-30 | Brüel & Kjaer Sound & Vibration Measurement A/S | Reconstructing an acoustic field |
| ES2472456T3 (es) | 2010-03-26 | 2014-07-01 | Thomson Licensing | Método y dispositivo para decodificar una representación de un campo ac�stico de audio para reproducción de audio |
| WO2012025580A1 (en) * | 2010-08-27 | 2012-03-01 | Sonicemotion Ag | Method and device for enhanced sound field reproduction of spatially encoded audio input signals |
| EP2450880A1 (en) * | 2010-11-05 | 2012-05-09 | Thomson Licensing | Data structure for Higher Order Ambisonics audio data |
| EP2541547A1 (en) * | 2011-06-30 | 2013-01-02 | Thomson Licensing | Method and apparatus for changing the relative positions of sound objects contained within a higher-order ambisonics representation |
| EP2665208A1 (en) | 2012-05-14 | 2013-11-20 | Thomson Licensing | Method and apparatus for compressing and decompressing a Higher Order Ambisonics signal representation |
| EP2738962A1 (en) | 2012-11-29 | 2014-06-04 | Thomson Licensing | Method and apparatus for determining dominant sound source directions in a higher order ambisonics representation of a sound field |
| US9736609B2 (en) * | 2013-02-07 | 2017-08-15 | Qualcomm Incorporated | Determining renderers for spherical harmonic coefficients |
-
2013
- 2013-02-08 EP EP20130305156 patent/EP2765791A1/en not_active Withdrawn
-
2014
- 2014-02-07 KR KR1020157021230A patent/KR102220187B1/ko active Active
- 2014-02-07 US US14/766,739 patent/US9622008B2/en active Active
- 2014-02-07 WO PCT/EP2014/052479 patent/WO2014122287A1/en not_active Ceased
- 2014-02-07 EP EP14703102.5A patent/EP2954700B1/en active Active
- 2014-02-07 JP JP2015556516A patent/JP6374882B2/ja active Active
- 2014-02-07 CN CN201480008017.XA patent/CN104995926B/zh active Active
- 2014-02-10 TW TW103104224A patent/TWI647961B/zh active
Patent Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20100329466A1 (en) * | 2009-06-25 | 2010-12-30 | Berges Allmenndigitale Radgivningstjeneste | Device and method for converting spatial audio signal |
| WO2011041834A1 (en) * | 2009-10-07 | 2011-04-14 | The University Of Sydney | Reconstruction of a recorded sound field |
| EP2469741A1 (en) * | 2010-12-21 | 2012-06-27 | Thomson Licensing | Method and apparatus for encoding and decoding successive frames of an ambisonics representation of a 2- or 3-dimensional sound field |
| EP2469742A2 (en) * | 2010-12-21 | 2012-06-27 | Thomson Licensing | Method and apparatus for encoding and decoding successive frames of an ambisonics representation of a 2- or 3-dimensional sound field |
Also Published As
| Publication number | Publication date |
|---|---|
| JP2016509812A (ja) | 2016-03-31 |
| EP2765791A1 (en) | 2014-08-13 |
| WO2014122287A1 (en) | 2014-08-14 |
| TW201448616A (zh) | 2014-12-16 |
| KR102220187B1 (ko) | 2021-02-25 |
| EP2954700B1 (en) | 2018-03-07 |
| JP6374882B2 (ja) | 2018-08-15 |
| KR20150115779A (ko) | 2015-10-14 |
| CN104995926A (zh) | 2015-10-21 |
| US20150373471A1 (en) | 2015-12-24 |
| EP2954700A1 (en) | 2015-12-16 |
| US9622008B2 (en) | 2017-04-11 |
| CN104995926B (zh) | 2017-12-26 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| TWI647961B (zh) | 聲場的高階保真立體音響表示法中不相關聲源方向之決定方法及裝置 | |
| JP7158806B2 (ja) | オーディオ認識方法、ターゲットオーディオを位置決める方法、それらの装置、およびデバイスとコンピュータプログラム | |
| EP2530484B1 (en) | Sound source localization apparatus and method | |
| Yang et al. | SRP-DNN: Learning direct-path phase difference for multiple moving sound source localization | |
| US20040190730A1 (en) | System and process for time delay estimation in the presence of correlated noise and reverberation | |
| KR20180069879A (ko) | 음성 강화를 위해 전역적으로 최적화된 최소 제곱 포스트 필터링 | |
| Traa et al. | Multichannel source separation and tracking with RANSAC and directional statistics | |
| Hosseini et al. | Time difference of arrival estimation of sound source using cross correlation and modified maximum likelihood weighting function | |
| Sharma et al. | Development of a speech separation system using frequency domain blind source separation technique | |
| EP2745293B1 (en) | Signal noise attenuation | |
| Krause et al. | Data diversity for improving DNN-based localization of concurrent sound events | |
| Pirhosseinloo et al. | A new feature set for masking-based monaural speech separation | |
| Zhang et al. | Multi-Target Ensemble Learning for Monaural Speech Separation. | |
| Athanasopoulos et al. | Robust speaker localization for real-world robots | |
| Tourbabin et al. | Direction of arrival estimation in highly reverberant environments using soft time-frequency mask | |
| Yang et al. | A stacked self-attention network for two-dimensional direction-of-arrival estimation in hands-free speech communication | |
| Dehghan Firoozabadi et al. | A novel nested circular microphone array and subband processing-based system for counting and DOA estimation of multiple simultaneous speakers | |
| Wu et al. | Acoustic source tracking in reverberant environment using regional steered response power measurement | |
| CN110838303A (zh) | 一种利用传声器阵列的语音声源定位方法 | |
| Toma et al. | Efficient Detection and Localization of Acoustic Sources with a low complexity CNN network and the Diagonal Unloading Beamforming | |
| Taseska et al. | Minimum Bayes risk signal detection for speech enhancement based on a narrowband DOA model | |
| Firoozabadi et al. | Combination of nested microphone array and subband processing for multiple simultaneous speaker localization | |
| Weisman et al. | Spatial Covariance Matrix Estimation for Reverberant Speech with Application to Speech Enhancement. | |
| Mosayyebpour et al. | Time delay estimation via minimum-phase and all-pass component processing | |
| MESSANA | CNN-based estimation of dereverberated relative harmonics coefficients for localization of acoustic sources |