KR102220187B1 - 음장의 고차 앰비소닉 표현에서 상관되지 않은 음원들의 방향을 판정하는 방법 및 장치 - Google Patents
음장의 고차 앰비소닉 표현에서 상관되지 않은 음원들의 방향을 판정하는 방법 및 장치 Download PDFInfo
- Publication number
- KR102220187B1 KR102220187B1 KR1020157021230A KR20157021230A KR102220187B1 KR 102220187 B1 KR102220187 B1 KR 102220187B1 KR 1020157021230 A KR1020157021230 A KR 1020157021230A KR 20157021230 A KR20157021230 A KR 20157021230A KR 102220187 B1 KR102220187 B1 KR 102220187B1
- Authority
- KR
- South Korea
- Prior art keywords
- time frame
- sound sources
- directions
- hoa
- previous time
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S3/00—Systems employing more than two channels, e.g. quadraphonic
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01S—RADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
- G01S3/00—Direction-finders for determining the direction from which infrasonic, sonic, ultrasonic or electromagnetic waves, or particle emission, not having a directional significance, are being received
- G01S3/80—Direction-finders for determining the direction from which infrasonic, sonic, ultrasonic or electromagnetic waves, or particle emission, not having a directional significance, are being received using ultrasonic, sonic or infrasonic waves
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0272—Voice signal separating
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2420/00—Techniques used stereophonic systems covered by H04S but not provided for in its groups
- H04S2420/11—Application of ambisonics in stereophonic audio systems
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Signal Processing (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Computational Linguistics (AREA)
- Mathematical Physics (AREA)
- Quality & Reliability (AREA)
- Stereophonic System (AREA)
- General Physics & Mathematics (AREA)
- Radar, Positioning & Navigation (AREA)
- Remote Sensing (AREA)
- Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP13305156.5 | 2013-02-08 | ||
| EP20130305156 EP2765791A1 (en) | 2013-02-08 | 2013-02-08 | Method and apparatus for determining directions of uncorrelated sound sources in a higher order ambisonics representation of a sound field |
| PCT/EP2014/052479 WO2014122287A1 (en) | 2013-02-08 | 2014-02-07 | Method and apparatus for determining directions of uncorrelated sound sources in a higher order ambisonics representation of a sound field |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| KR20150115779A KR20150115779A (ko) | 2015-10-14 |
| KR102220187B1 true KR102220187B1 (ko) | 2021-02-25 |
Family
ID=47780000
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| KR1020157021230A Active KR102220187B1 (ko) | 2013-02-08 | 2014-02-07 | 음장의 고차 앰비소닉 표현에서 상관되지 않은 음원들의 방향을 판정하는 방법 및 장치 |
Country Status (7)
| Country | Link |
|---|---|
| US (1) | US9622008B2 (https=) |
| EP (2) | EP2765791A1 (https=) |
| JP (1) | JP6374882B2 (https=) |
| KR (1) | KR102220187B1 (https=) |
| CN (1) | CN104995926B (https=) |
| TW (1) | TWI647961B (https=) |
| WO (1) | WO2014122287A1 (https=) |
Families Citing this family (25)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP2665208A1 (en) * | 2012-05-14 | 2013-11-20 | Thomson Licensing | Method and apparatus for compressing and decompressing a Higher Order Ambisonics signal representation |
| EP2743922A1 (en) | 2012-12-12 | 2014-06-18 | Thomson Licensing | Method and apparatus for compressing and decompressing a higher order ambisonics representation for a sound field |
| EP2800401A1 (en) | 2013-04-29 | 2014-11-05 | Thomson Licensing | Method and Apparatus for compressing and decompressing a Higher Order Ambisonics representation |
| US9495968B2 (en) | 2013-05-29 | 2016-11-15 | Qualcomm Incorporated | Identifying sources from which higher order ambisonic audio data is generated |
| US9466305B2 (en) | 2013-05-29 | 2016-10-11 | Qualcomm Incorporated | Performing positional analysis to code spherical harmonic coefficients |
| US9502045B2 (en) | 2014-01-30 | 2016-11-22 | Qualcomm Incorporated | Coding independent frames of ambient higher-order ambisonic coefficients |
| US9922656B2 (en) | 2014-01-30 | 2018-03-20 | Qualcomm Incorporated | Transitioning of ambient higher-order ambisonic coefficients |
| US9852737B2 (en) | 2014-05-16 | 2017-12-26 | Qualcomm Incorporated | Coding vectors decomposed from higher-order ambisonics audio signals |
| US9620137B2 (en) | 2014-05-16 | 2017-04-11 | Qualcomm Incorporated | Determining between scalar and vector quantization in higher order ambisonic coefficients |
| US10770087B2 (en) | 2014-05-16 | 2020-09-08 | Qualcomm Incorporated | Selecting codebooks for coding vectors decomposed from higher-order ambisonic audio signals |
| US9747910B2 (en) | 2014-09-26 | 2017-08-29 | Qualcomm Incorporated | Switching between predictive and non-predictive quantization techniques in a higher order ambisonics (HOA) framework |
| EP3357259B1 (en) | 2015-09-30 | 2020-09-23 | Dolby International AB | Method and apparatus for generating 3d audio content from two-channel stereo content |
| CN105516875B (zh) * | 2015-12-02 | 2020-03-06 | 上海航空电器有限公司 | 用于快速测量虚拟声音产生设备空间角度分辨率的装置 |
| GR1008860B (el) * | 2015-12-29 | 2016-09-27 | Κωνσταντινος Δημητριου Σπυροπουλος | Συστημα διαχωρισμου ομιλητων απο οπτικοακουστικα δεδομενα |
| US10089063B2 (en) | 2016-08-10 | 2018-10-02 | Qualcomm Incorporated | Multimedia device for processing spatialized audio based on movement |
| JP6723120B2 (ja) * | 2016-09-05 | 2020-07-15 | 本田技研工業株式会社 | 音響処理装置および音響処理方法 |
| CN107147975B (zh) * | 2017-04-26 | 2019-05-14 | 北京大学 | 一种面向不规则扬声器摆放的Ambisonics匹配投影解码方法 |
| JP7224302B2 (ja) | 2017-05-09 | 2023-02-17 | ドルビー ラボラトリーズ ライセンシング コーポレイション | マルチチャネル空間的オーディオ・フォーマット入力信号の処理 |
| US10405126B2 (en) | 2017-06-30 | 2019-09-03 | Qualcomm Incorporated | Mixed-order ambisonics (MOA) audio data for computer-mediated reality systems |
| FR3074584A1 (fr) * | 2017-12-05 | 2019-06-07 | Orange | Traitement de donnees d'une sequence video pour un zoom sur un locuteur detecte dans la sequence |
| CN110751956B (zh) * | 2019-09-17 | 2022-04-26 | 北京时代拓灵科技有限公司 | 一种沉浸式音频渲染方法及系统 |
| CN111933182B (zh) * | 2020-08-07 | 2024-04-19 | 抖音视界有限公司 | 声源跟踪方法、装置、设备和存储介质 |
| CN112019971B (zh) * | 2020-08-21 | 2022-03-22 | 安声(重庆)电子科技有限公司 | 声场构建方法、装置、电子设备及计算机可读存储介质 |
| US11743670B2 (en) | 2020-12-18 | 2023-08-29 | Qualcomm Incorporated | Correlation-based rendering with multiple distributed streams accounting for an occlusion for six degree of freedom applications |
| CN115038027B (zh) * | 2021-03-05 | 2023-07-07 | 华为技术有限公司 | Hoa系数的获取方法和装置 |
Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2011117399A1 (en) | 2010-03-26 | 2011-09-29 | Thomson Licensing | Method and device for decoding an audio soundfield representation for audio playback |
Family Cites Families (14)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| GB9915398D0 (en) | 1999-07-02 | 1999-09-01 | Baker Matthew J | Magnetic particles |
| FR2801108B1 (fr) | 1999-11-16 | 2002-03-01 | Maxmat S A | Analyseur chimique ou biochimique a regulation de la temperature reactionnelle |
| FR2839565B1 (fr) * | 2002-05-07 | 2004-11-19 | Remy Henri Denis Bruno | Procede et systeme de representation d'un champ acoustique |
| FR2858403B1 (fr) * | 2003-07-31 | 2005-11-18 | Remy Henri Denis Bruno | Systeme et procede de determination d'une representation d'un champ acoustique |
| EP2297557B1 (en) * | 2008-07-08 | 2013-10-30 | Brüel & Kjaer Sound & Vibration Measurement A/S | Reconstructing an acoustic field |
| ES2690164T3 (es) * | 2009-06-25 | 2018-11-19 | Dts Licensing Limited | Dispositivo y método para convertir una señal de audio espacial |
| EP2486561B1 (en) * | 2009-10-07 | 2016-03-30 | The University Of Sydney | Reconstruction of a recorded sound field |
| WO2012025580A1 (en) * | 2010-08-27 | 2012-03-01 | Sonicemotion Ag | Method and device for enhanced sound field reproduction of spatially encoded audio input signals |
| EP2450880A1 (en) * | 2010-11-05 | 2012-05-09 | Thomson Licensing | Data structure for Higher Order Ambisonics audio data |
| EP2469741A1 (en) * | 2010-12-21 | 2012-06-27 | Thomson Licensing | Method and apparatus for encoding and decoding successive frames of an ambisonics representation of a 2- or 3-dimensional sound field |
| EP2541547A1 (en) * | 2011-06-30 | 2013-01-02 | Thomson Licensing | Method and apparatus for changing the relative positions of sound objects contained within a higher-order ambisonics representation |
| EP2665208A1 (en) | 2012-05-14 | 2013-11-20 | Thomson Licensing | Method and apparatus for compressing and decompressing a Higher Order Ambisonics signal representation |
| EP2738962A1 (en) | 2012-11-29 | 2014-06-04 | Thomson Licensing | Method and apparatus for determining dominant sound source directions in a higher order ambisonics representation of a sound field |
| US9736609B2 (en) * | 2013-02-07 | 2017-08-15 | Qualcomm Incorporated | Determining renderers for spherical harmonic coefficients |
-
2013
- 2013-02-08 EP EP20130305156 patent/EP2765791A1/en not_active Withdrawn
-
2014
- 2014-02-07 KR KR1020157021230A patent/KR102220187B1/ko active Active
- 2014-02-07 US US14/766,739 patent/US9622008B2/en active Active
- 2014-02-07 WO PCT/EP2014/052479 patent/WO2014122287A1/en not_active Ceased
- 2014-02-07 EP EP14703102.5A patent/EP2954700B1/en active Active
- 2014-02-07 JP JP2015556516A patent/JP6374882B2/ja active Active
- 2014-02-07 CN CN201480008017.XA patent/CN104995926B/zh active Active
- 2014-02-10 TW TW103104224A patent/TWI647961B/zh active
Patent Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2011117399A1 (en) | 2010-03-26 | 2011-09-29 | Thomson Licensing | Method and device for decoding an audio soundfield representation for audio playback |
Non-Patent Citations (1)
| Title |
|---|
| Andrew Wabnitz, et al. TIME DOMAIN RECONSTRUCTION OF SPATIAL SOUND FIELDS USING COMPRESSED SENSING. IEEE International Conference on Acoustics, Speech and Signal Processing. 2011.05.22. |
Also Published As
| Publication number | Publication date |
|---|---|
| JP2016509812A (ja) | 2016-03-31 |
| EP2765791A1 (en) | 2014-08-13 |
| WO2014122287A1 (en) | 2014-08-14 |
| TW201448616A (zh) | 2014-12-16 |
| EP2954700B1 (en) | 2018-03-07 |
| JP6374882B2 (ja) | 2018-08-15 |
| KR20150115779A (ko) | 2015-10-14 |
| CN104995926A (zh) | 2015-10-21 |
| TWI647961B (zh) | 2019-01-11 |
| US20150373471A1 (en) | 2015-12-24 |
| EP2954700A1 (en) | 2015-12-16 |
| US9622008B2 (en) | 2017-04-11 |
| CN104995926B (zh) | 2017-12-26 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| KR102220187B1 (ko) | 음장의 고차 앰비소닉 표현에서 상관되지 않은 음원들의 방향을 판정하는 방법 및 장치 | |
| US10609501B2 (en) | Method and apparatus for compressing and decompressing a higher order ambisonics representation for a sound field | |
| US11943604B2 (en) | Spatial audio processing | |
| CN111209903B (zh) | 一种人脸检测的方法与设备 | |
| WO2016119388A1 (zh) | 一种基于语音信号构造聚焦协方差矩阵的方法及装置 | |
| CN108701461A (zh) | 用于具有多个反射的声源的改进的立体混响编码器 | |
| EP4050602B1 (en) | Conference device with voice direction estimation | |
| Dang et al. | An iteratively reweighted steered response power approach to multisource localization using a distributed microphone network | |
| Poschadel et al. | Investigations on higher-order spherical harmonic input features for deep learning-based multiple speaker detection and localization | |
| CN115116460B (zh) | 音频信号增强方法、装置、设备、存储介质及程序产品 | |
| CN113835065B (zh) | 基于深度学习的声源方向确定方法、装置、设备及介质 | |
| CN116609726A (zh) | 一种声源定位方法及装置 | |
| RU2847553C1 (ru) | Способ и устройство для сжатия и восстановления представления системы амбисоник высшего порядка для звукового поля | |
| RU2823441C2 (ru) | Способ и устройство для сжатия и восстановления представления системы амбисоник высшего порядка для звукового поля | |
| US20070160241A1 (en) | Determination of the adequate measurement window for sound source localization in echoic environments | |
| RU2823441C9 (ru) | Способ и устройство для сжатия и восстановления представления системы амбисоник высшего порядка для звукового поля | |
| CN112822482A (zh) | 一种确定音视频通话的评估得分的方法与设备 | |
| Cohen et al. | Synthetic Aperture Local Conformal Autoencoder for Semi-Supervised Speaker's DOA Tracking | |
| KR20240097694A (ko) | 임펄스 응답 결정 방법 및 상기 방법을 수행하는 전자 장치 | |
| Donohue et al. | Computational strategy for accelerating robust sound source detection in dynamic scenes | |
| CN121093729A (zh) | 室内声场的表征方法及室内声场时域表征模型 | |
| CN121093728A (zh) | 室内声场的表征方法及时域-时频域混合模型框架 | |
| CN121805951A (zh) | 基于复值残差收缩循环神经网络的声源定位方法及装置 | |
| Qahani et al. | Realtime stereo localization of a moving speaker using filtered spectrogram correlation map algorithm | |
| CN114333783A (zh) | 一种音频的端点检测方法及设备 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PA0105 | International application |
Patent event date: 20150805 Patent event code: PA01051R01D Comment text: International Patent Application |
|
| PG1501 | Laying open of application | ||
| N231 | Notification of change of applicant | ||
| PN2301 | Change of applicant |
Patent event date: 20160831 Comment text: Notification of Change of Applicant Patent event code: PN23011R01D |
|
| A201 | Request for examination | ||
| PA0201 | Request for examination |
Patent event code: PA02012R01D Patent event date: 20190201 Comment text: Request for Examination of Application |
|
| E902 | Notification of reason for refusal | ||
| PE0902 | Notice of grounds for rejection |
Comment text: Notification of reason for refusal Patent event date: 20200821 Patent event code: PE09021S01D |
|
| E701 | Decision to grant or registration of patent right | ||
| PE0701 | Decision of registration |
Patent event code: PE07011S01D Comment text: Decision to Grant Registration Patent event date: 20201118 |
|
| GRNT | Written decision to grant | ||
| PR0701 | Registration of establishment |
Comment text: Registration of Establishment Patent event date: 20210219 Patent event code: PR07011E01D |
|
| PR1002 | Payment of registration fee |
Payment date: 20210222 End annual number: 3 Start annual number: 1 |
|
| PG1601 | Publication of registration | ||
| PR1001 | Payment of annual fee |
Payment date: 20240130 Start annual number: 4 End annual number: 4 |
|
| PR1001 | Payment of annual fee |
Payment date: 20250210 Start annual number: 5 End annual number: 5 |