CN104995926B - 用于确定在声场的高阶高保真立体声表示中不相关的声源的方向的方法和装置 - Google Patents
用于确定在声场的高阶高保真立体声表示中不相关的声源的方向的方法和装置 Download PDFInfo
- Publication number
- CN104995926B CN104995926B CN201480008017.XA CN201480008017A CN104995926B CN 104995926 B CN104995926 B CN 104995926B CN 201480008017 A CN201480008017 A CN 201480008017A CN 104995926 B CN104995926 B CN 104995926B
- Authority
- CN
- China
- Prior art keywords
- dominant
- sound source
- hoa
- time frame
- source
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S3/00—Systems employing more than two channels, e.g. quadraphonic
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0272—Voice signal separating
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2420/00—Techniques used stereophonic systems covered by H04S but not provided for in its groups
- H04S2420/11—Application of ambisonics in stereophonic audio systems
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Signal Processing (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Computational Linguistics (AREA)
- Mathematical Physics (AREA)
- Quality & Reliability (AREA)
- Stereophonic System (AREA)
- General Physics & Mathematics (AREA)
- Radar, Positioning & Navigation (AREA)
- Remote Sensing (AREA)
- Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP13305156.5 | 2013-02-08 | ||
| EP20130305156 EP2765791A1 (en) | 2013-02-08 | 2013-02-08 | Method and apparatus for determining directions of uncorrelated sound sources in a higher order ambisonics representation of a sound field |
| PCT/EP2014/052479 WO2014122287A1 (en) | 2013-02-08 | 2014-02-07 | Method and apparatus for determining directions of uncorrelated sound sources in a higher order ambisonics representation of a sound field |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN104995926A CN104995926A (zh) | 2015-10-21 |
| CN104995926B true CN104995926B (zh) | 2017-12-26 |
Family
ID=47780000
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201480008017.XA Active CN104995926B (zh) | 2013-02-08 | 2014-02-07 | 用于确定在声场的高阶高保真立体声表示中不相关的声源的方向的方法和装置 |
Country Status (7)
| Country | Link |
|---|---|
| US (1) | US9622008B2 (https=) |
| EP (2) | EP2765791A1 (https=) |
| JP (1) | JP6374882B2 (https=) |
| KR (1) | KR102220187B1 (https=) |
| CN (1) | CN104995926B (https=) |
| TW (1) | TWI647961B (https=) |
| WO (1) | WO2014122287A1 (https=) |
Families Citing this family (25)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP2665208A1 (en) * | 2012-05-14 | 2013-11-20 | Thomson Licensing | Method and apparatus for compressing and decompressing a Higher Order Ambisonics signal representation |
| EP2743922A1 (en) | 2012-12-12 | 2014-06-18 | Thomson Licensing | Method and apparatus for compressing and decompressing a higher order ambisonics representation for a sound field |
| EP2800401A1 (en) | 2013-04-29 | 2014-11-05 | Thomson Licensing | Method and Apparatus for compressing and decompressing a Higher Order Ambisonics representation |
| US9495968B2 (en) | 2013-05-29 | 2016-11-15 | Qualcomm Incorporated | Identifying sources from which higher order ambisonic audio data is generated |
| US9466305B2 (en) | 2013-05-29 | 2016-10-11 | Qualcomm Incorporated | Performing positional analysis to code spherical harmonic coefficients |
| US9502045B2 (en) | 2014-01-30 | 2016-11-22 | Qualcomm Incorporated | Coding independent frames of ambient higher-order ambisonic coefficients |
| US9922656B2 (en) | 2014-01-30 | 2018-03-20 | Qualcomm Incorporated | Transitioning of ambient higher-order ambisonic coefficients |
| US9852737B2 (en) | 2014-05-16 | 2017-12-26 | Qualcomm Incorporated | Coding vectors decomposed from higher-order ambisonics audio signals |
| US9620137B2 (en) | 2014-05-16 | 2017-04-11 | Qualcomm Incorporated | Determining between scalar and vector quantization in higher order ambisonic coefficients |
| US10770087B2 (en) | 2014-05-16 | 2020-09-08 | Qualcomm Incorporated | Selecting codebooks for coding vectors decomposed from higher-order ambisonic audio signals |
| US9747910B2 (en) | 2014-09-26 | 2017-08-29 | Qualcomm Incorporated | Switching between predictive and non-predictive quantization techniques in a higher order ambisonics (HOA) framework |
| EP3357259B1 (en) | 2015-09-30 | 2020-09-23 | Dolby International AB | Method and apparatus for generating 3d audio content from two-channel stereo content |
| CN105516875B (zh) * | 2015-12-02 | 2020-03-06 | 上海航空电器有限公司 | 用于快速测量虚拟声音产生设备空间角度分辨率的装置 |
| GR1008860B (el) * | 2015-12-29 | 2016-09-27 | Κωνσταντινος Δημητριου Σπυροπουλος | Συστημα διαχωρισμου ομιλητων απο οπτικοακουστικα δεδομενα |
| US10089063B2 (en) | 2016-08-10 | 2018-10-02 | Qualcomm Incorporated | Multimedia device for processing spatialized audio based on movement |
| JP6723120B2 (ja) * | 2016-09-05 | 2020-07-15 | 本田技研工業株式会社 | 音響処理装置および音響処理方法 |
| CN107147975B (zh) * | 2017-04-26 | 2019-05-14 | 北京大学 | 一种面向不规则扬声器摆放的Ambisonics匹配投影解码方法 |
| JP7224302B2 (ja) | 2017-05-09 | 2023-02-17 | ドルビー ラボラトリーズ ライセンシング コーポレイション | マルチチャネル空間的オーディオ・フォーマット入力信号の処理 |
| US10405126B2 (en) | 2017-06-30 | 2019-09-03 | Qualcomm Incorporated | Mixed-order ambisonics (MOA) audio data for computer-mediated reality systems |
| FR3074584A1 (fr) * | 2017-12-05 | 2019-06-07 | Orange | Traitement de donnees d'une sequence video pour un zoom sur un locuteur detecte dans la sequence |
| CN110751956B (zh) * | 2019-09-17 | 2022-04-26 | 北京时代拓灵科技有限公司 | 一种沉浸式音频渲染方法及系统 |
| CN111933182B (zh) * | 2020-08-07 | 2024-04-19 | 抖音视界有限公司 | 声源跟踪方法、装置、设备和存储介质 |
| CN112019971B (zh) * | 2020-08-21 | 2022-03-22 | 安声(重庆)电子科技有限公司 | 声场构建方法、装置、电子设备及计算机可读存储介质 |
| US11743670B2 (en) | 2020-12-18 | 2023-08-29 | Qualcomm Incorporated | Correlation-based rendering with multiple distributed streams accounting for an occlusion for six degree of freedom applications |
| CN115038027B (zh) * | 2021-03-05 | 2023-07-07 | 华为技术有限公司 | Hoa系数的获取方法和装置 |
Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN1659926A (zh) * | 2002-05-07 | 2005-08-24 | 雷米·布鲁诺 | 表示声场的方法和系统 |
| CN1849844A (zh) * | 2003-07-31 | 2006-10-18 | 特因诺夫音频公司 | 确定声场的表示的系统和方法 |
| CN102089634A (zh) * | 2008-07-08 | 2011-06-08 | 布鲁尔及凯尔声音及振动测量公司 | 重建声学场 |
Family Cites Families (12)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| GB9915398D0 (en) | 1999-07-02 | 1999-09-01 | Baker Matthew J | Magnetic particles |
| FR2801108B1 (fr) | 1999-11-16 | 2002-03-01 | Maxmat S A | Analyseur chimique ou biochimique a regulation de la temperature reactionnelle |
| ES2690164T3 (es) * | 2009-06-25 | 2018-11-19 | Dts Licensing Limited | Dispositivo y método para convertir una señal de audio espacial |
| EP2486561B1 (en) * | 2009-10-07 | 2016-03-30 | The University Of Sydney | Reconstruction of a recorded sound field |
| ES2472456T3 (es) | 2010-03-26 | 2014-07-01 | Thomson Licensing | Método y dispositivo para decodificar una representación de un campo ac�stico de audio para reproducción de audio |
| WO2012025580A1 (en) * | 2010-08-27 | 2012-03-01 | Sonicemotion Ag | Method and device for enhanced sound field reproduction of spatially encoded audio input signals |
| EP2450880A1 (en) * | 2010-11-05 | 2012-05-09 | Thomson Licensing | Data structure for Higher Order Ambisonics audio data |
| EP2469741A1 (en) * | 2010-12-21 | 2012-06-27 | Thomson Licensing | Method and apparatus for encoding and decoding successive frames of an ambisonics representation of a 2- or 3-dimensional sound field |
| EP2541547A1 (en) * | 2011-06-30 | 2013-01-02 | Thomson Licensing | Method and apparatus for changing the relative positions of sound objects contained within a higher-order ambisonics representation |
| EP2665208A1 (en) | 2012-05-14 | 2013-11-20 | Thomson Licensing | Method and apparatus for compressing and decompressing a Higher Order Ambisonics signal representation |
| EP2738962A1 (en) | 2012-11-29 | 2014-06-04 | Thomson Licensing | Method and apparatus for determining dominant sound source directions in a higher order ambisonics representation of a sound field |
| US9736609B2 (en) * | 2013-02-07 | 2017-08-15 | Qualcomm Incorporated | Determining renderers for spherical harmonic coefficients |
-
2013
- 2013-02-08 EP EP20130305156 patent/EP2765791A1/en not_active Withdrawn
-
2014
- 2014-02-07 KR KR1020157021230A patent/KR102220187B1/ko active Active
- 2014-02-07 US US14/766,739 patent/US9622008B2/en active Active
- 2014-02-07 WO PCT/EP2014/052479 patent/WO2014122287A1/en not_active Ceased
- 2014-02-07 EP EP14703102.5A patent/EP2954700B1/en active Active
- 2014-02-07 JP JP2015556516A patent/JP6374882B2/ja active Active
- 2014-02-07 CN CN201480008017.XA patent/CN104995926B/zh active Active
- 2014-02-10 TW TW103104224A patent/TWI647961B/zh active
Patent Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN1659926A (zh) * | 2002-05-07 | 2005-08-24 | 雷米·布鲁诺 | 表示声场的方法和系统 |
| CN1849844A (zh) * | 2003-07-31 | 2006-10-18 | 特因诺夫音频公司 | 确定声场的表示的系统和方法 |
| CN102089634A (zh) * | 2008-07-08 | 2011-06-08 | 布鲁尔及凯尔声音及振动测量公司 | 重建声学场 |
Also Published As
| Publication number | Publication date |
|---|---|
| JP2016509812A (ja) | 2016-03-31 |
| EP2765791A1 (en) | 2014-08-13 |
| WO2014122287A1 (en) | 2014-08-14 |
| TW201448616A (zh) | 2014-12-16 |
| KR102220187B1 (ko) | 2021-02-25 |
| EP2954700B1 (en) | 2018-03-07 |
| JP6374882B2 (ja) | 2018-08-15 |
| KR20150115779A (ko) | 2015-10-14 |
| CN104995926A (zh) | 2015-10-21 |
| TWI647961B (zh) | 2019-01-11 |
| US20150373471A1 (en) | 2015-12-24 |
| EP2954700A1 (en) | 2015-12-16 |
| US9622008B2 (en) | 2017-04-11 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN104995926B (zh) | 用于确定在声场的高阶高保真立体声表示中不相关的声源的方向的方法和装置 | |
| US11922965B2 (en) | Direction of arrival estimation apparatus, model learning apparatus, direction of arrival estimation method, model learning method, and program | |
| US9689959B2 (en) | Method, apparatus and computer program product for determining the location of a plurality of speech sources | |
| Li et al. | Online localization and tracking of multiple moving speakers in reverberant environments | |
| Lima et al. | A volumetric SRP with refinement step for sound source localization | |
| JP2007523514A (ja) | 適応ビームフォーマ、サイドローブキャンセラー、方法、装置、及びコンピュータープログラム | |
| JP2017503388A (ja) | マイクロホンアレイを使用した残響音の抽出 | |
| WO2016119388A1 (zh) | 一种基于语音信号构造聚焦协方差矩阵的方法及装置 | |
| Hosseini et al. | Time difference of arrival estimation of sound source using cross correlation and modified maximum likelihood weighting function | |
| Dang et al. | An iteratively reweighted steered response power approach to multisource localization using a distributed microphone network | |
| Tengan et al. | Multi-source direction-of-arrival estimation using steered response power and group-sparse optimization | |
| Krause et al. | Data diversity for improving DNN-based localization of concurrent sound events | |
| JP2019054344A (ja) | フィルタ係数算出装置、収音装置、その方法、及びプログラム | |
| Toma et al. | Efficient Detection and Localization of Acoustic Sources with a low complexity CNN network and the Diagonal Unloading Beamforming | |
| Dehghan Firoozabadi et al. | A novel nested circular microphone array and subband processing-based system for counting and DOA estimation of multiple simultaneous speakers | |
| Günther et al. | Microphone utility estimation in acoustic sensor networks using single-channel signal features | |
| CN109901113B (zh) | 一种基于复杂环境的语音信号定位方法、装置及系统 | |
| Kienegger et al. | Adaptive Rotary Steering with Joint Autoregression for Robust Extraction of Closely Moving Speakers in Dynamic Scenarios | |
| Li et al. | A cascaded multiple-speaker localization and tracking system | |
| Firoozabadi et al. | Multi-speaker localization by central and lateral microphone arrays based on the combination of 2D-SRP and subband GEVD algorithms | |
| Mosayyebpour et al. | Time delay estimation via minimum-phase and all-pass component processing | |
| Ayllón et al. | Real-time multiple doa estimation of speech sources in wireless acoustic sensor networks | |
| Wang et al. | IPDnet2: an efficient and improved inter-channel phase difference estimation network for sound source localization | |
| de Groot et al. | Loudspeaker Beamforming to Enhance Speech Recognition Performance of Voice Driven Applications | |
| Mitchell et al. | Improved direction of arrival estimations with a wearable microphone array for dynamic environments by reliability weighting |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| C41 | Transfer of patent application or patent right or utility model | ||
| TA01 | Transfer of patent application right |
Effective date of registration: 20160713 Address after: Amsterdam Applicant after: Dolby International AB Address before: The French Yixilaimu Leo City Applicant before: Thomson Licensing SA |
|
| GR01 | Patent grant |